User profile picture Busy

Mukunda Katta

@mukunda.vjcs6
🀘 Photon in a double slit
  • mukunda.vjcs6
  • README.md
Typing SVG

Active across 50+ public developer, AI, open source, and research surfaces
Code, datasets, package registries, preprints, reproducible demos, agent workflows, and OSS contribution traces.



[GitHub](https://github.com/MukundaKatta) [Bitbucket](https://bitbucket.org/personal-agent-harness/bitbucket-profile-readme) [Kaggle](https://www.kaggle.com/mukundakatta) [Codeberg](https://codeberg.org/MukundaKatta) [Apache](https://github.com/pulls?q=is%3Apr+author%3AMukundaKatta+org%3Aapache) [Hugging%20Face](https://huggingface.co/mukunda1729) [ORCID](https://orcid.org/0009-0007-6071-3896) [Zenodo](https://doi.org/10.5281/zenodo.20034550) [SSRN](https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6718599) [Academia.edu](https://independent.academia.edu/MukundaKatta) [Authorea](https://www.authorea.com/users/1042134-mukunda-katta) [GPT Store](https://chatgpt.com/g/g-69f949f9653c819188b87ab44205fa74-agent-eval-lab) [Poe](https://poe.com/AgentEvalLab) [Replicate](https://replicate.com/mukundakatta/agent-eval-lab) [Modal](https://mukunda-vjcs6--agent-eval-lab-evaluate-agent.modal.run) [OpenRouter](https://openrouter.ai/apps?url=https%3A%2F%2Fmukunda1729-agent-eval-lab.hf.space%2F) [Codeberg Pages](https://mukundakatta.codeberg.page) [Portfolio](https://mukunda-ai.vercel.app) [LinkedIn](https://www.linkedin.com/in/mukunda-katta-728155220/) [X](https://x.com/katta_mukunda) [Email](mailto:[email protected])

β–°β–°β–°  The Lab  β–°β–°β–°  Publications  β–°β–°β–°  Bridge  β–°β–°β–°  Stack  β–°β–°β–°  Track Record  β–°β–°β–°  Credentials  β–°β–°β–°


  8+ years building production systems at Fortune 100 scale
  Former SDE at Amazon Web Services  β€’  Currently at Southwest Airlines
  Deep expertise in ML systems, distributed architectures, and full-stack engineering

Now: shipped the @mukundakatta/agent* reliability stack (fit β†’ guard β†’ snap β†’ vet β†’ cast), 6 matching MCP servers in the official MCP Registry, 3 new GitHub Actions on the Marketplace, and published four new GitLab-born agent packages on PyPI. Plus 40+ open PRs across MCP SDKs, FastMCP, claude-code-action, and Anthropic's agent SDK.


🦊  THIS GITLAB                                    πŸ™  GITHUB
─────────────────────                             ─────────────────────
4 agent-infra repos                               222 original repos
All public + published                            105 merged upstream PRs
GitLab Registry + PyPI                            npm + PyPI package work

The Lab

Four sibling repos under mukunda.vjcs6-group. Each one solves a single concrete problem; together they form a personal agent stack.


Fresh Contributions

Surface Latest proof
OpenAI GPT Store Agent Eval Lab - public GPT for lightweight agent evaluation and scenario walkthroughs
Poe AgentEvalLab - public bot for agent-eval prompts and scoring flows
Poe OpsScorecardLab - public bot for turning eval scenarios into operations scorecards
Poe RepoLandscapeLab - public bot for mapping premium agent repo surfaces
Replicate agent-eval-lab - public model/app page for eval-oriented agent interactions
Replicate ops-scorecard-lab - public app page for ops scorecard generation
Replicate repo-landscape-lab - public app page for repo-surface mapping
Hugging Face Agent Labs Portfolio - curated collection tying together the live Spaces and datasets
Hugging Face Ops Scorecard Lab - public Space for turning rough workflows into operator-facing scorecards
Modal agent-eval-lab endpoint - live API endpoint returning structured eval JSON
Modal ops-scorecard-lab endpoint - live API endpoint for scorecard generation
Modal repo-landscape-lab endpoint - live API endpoint for repo-landscape mapping
Modal Agent Labs Portal - public two-panel demo surface for evaluation plans and ops scorecards
OpenRouter Agent Eval Lab - public OpenRouter app analytics page seeded from the live Hugging Face Space
OpenRouter Ops Scorecard Lab - public OpenRouter app analytics page for the scorecard Space
Codeberg Pages MukundaKatta.codeberg.page - public portfolio page routing across the non-GitHub footprint
Kaggle Premium Agent Repo Landscape - public dataset mapping premium agent repos by surface, stack, and focus
Kaggle Agent Eval Scenarios - public eval dataset for lightweight agent benchmarking
Kaggle building-a-lightweight-agent-eval-benchmark - clean public notebook replacement with a successful run and resilient dataset loading
Codeberg premium-agent-landscape - public showcase repo for agent portfolio mapping and presentation
Codeberg agent-eval-lab - public repo for evaluation artifacts and benchmark framing
Codeberg apache-contribution-atlas - public tracker for Apache-facing contribution work
Codeberg Documentation PR #784 - clarified HTTPS auth with 2FA and token-based Git usage
Apache fluss PR #3243 - added a blog contribution guide for the Fluss website community docs
Apache fluss PR #3244 - added an FIP contribution guide for the Fluss contributor workflow
Apache pulsar-site PR #1139 - fixed failover standby mapping in the 3.0.x docs

Publications

Type Title Venue
Landing Page Lightweight Evaluation and Operational Scorecards for Tool-Using AI Agents GitHub Pages
Preprint Lightweight Evaluation and Operational Scorecards for Tool-Using AI Agents Zenodo
Artifact Repo lightweight-agent-eval-paper GitHub
Archive lightweight-agent-eval-paper Software Heritage, archived successfully
Preprint Submission Lightweight Evaluation and Operational Scorecards for Tool-Using AI Agents SSRN, in process (PRELIMINARY_UPLOAD)
Preprint Submission Lightweight Evaluation and Operational Scorecards for Tool-Using AI Agents Research Square, declined as not suitable for posting
Preprint Submission Lightweight Evaluation and Operational Scorecards for Tool-Using AI Agents MetaArXiv on OSF Preprints, declined as out of scope
Preprint AI Eval Forge: Mixed-Check Regression Testing for LLM and Agent Workflows Zenodo
Artifact Repo ai-eval-forge-paper GitHub
Preprint Submission AI Eval Forge: Mixed-Check Regression Testing for LLM and Agent Workflows SSRN, public abstract page
Preprint Submission AI Eval Forge: Mixed-Check Regression Testing for LLM and Agent Workflows MetaArXiv on OSF Preprints, submitted
Preprint Agent Trajectory Replay for Debugging Tool-Using AI Workflow Regressions Zenodo
Preprint Submission Agent Trajectory Replay for Debugging Tool-Using AI Workflow Regressions SSRN, submitted for review
Preprint Small-Rule Guardrails for Retrieval-Augmented Generation: Prompt Injection and Vector Poisoning Checks Zenodo
Preprint Chetana: A Theory-Indexed Probe Framework for AI Consciousness Indicator Scoring Zenodo
Preprint ML Intern Lab: A Minimal Agentic Workflow for Reproducible Machine Learning Experiment Reports Zenodo
Preprint Submission ML Intern Lab: A Minimal Agentic Workflow for Reproducible Machine Learning Experiment Reports SSRN, submitted for review
Preprint Mirror ML Intern Lab: A Minimal Agentic Workflow for Reproducible Machine Learning Experiment Reports Academia.edu
Preprint Submission Citation Traceability for Web-Native AI Research Workflows MetaArXiv on OSF Preprints, pending moderator review (m9j4g_v1)
Preprint Submission Context Forge: A Lightweight Method for Diversity-Aware Context Packing and Prompt-Injection-Aware Retrieval Research Square, QA/QC check
Research Profile Mukunda Katta ORCID
Research Profile Mukunda Katta Academia.edu
Research Profile Mukunda Katta Authorea profile, new submissions paused during platform migration

🦊  agent-skills-playbook

BEHAVIOR PACK

Production-grade AI agent skills, prompts, and operating playbooks. Reusable behavior packs for research, code review, README writing, handoff briefs, and security passes β€” each one a small SKILL.md with before/after examples.

PyPI

pip install agent-skills-playbook

skills Β· SKILL.md Β· coding agents

🦊  personal-agent-harness

RUNTIME

Lightweight personal agent runtime with memory, task loops, tool adapters, and local-first safety rails. Repeatable workflows without becoming a giant framework β€” JSON memory, approval gates, replayable run logs.

PyPI

pip install personal-agent-harness

runtime Β· memory Β· tool adapters Β· safety

🦊  browser-research-agent

SPECIALIST Β· WEB

An AI research agent that searches, reads, summarizes, and cites web sources for repeatable market and repository intelligence. Source-quality labels, recency filters, GitHub trend analysis, markdown briefs.

PyPI

pip install browser-research-agent

research Β· citations Β· trend analysis

🦊  ml-intern-lab

SPECIALIST Β· ML

An ML-engineer agent sandbox for reading papers, running experiments, and shipping model reports. Paper notes β†’ experiment plan β†’ baseline run β†’ metrics β†’ report, all tracked.

PyPI

pip install ml-intern-lab

agentic ML Β· experiments Β· paper-to-report

                  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                  β”‚   agent-skills-playbook   β”‚   ← reusable behaviors
                  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                                β”‚ loaded by
                                β–Ό
                  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                  β”‚   personal-agent-harness  β”‚   ← the runtime
                  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                                β”‚ specialized into
                  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                  β–Ό                           β–Ό
        β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”        β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
        β”‚ browser-research β”‚        β”‚   ml-intern-lab  β”‚
        β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜        β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Published Packages

pip install agent-skills-playbook
pip install personal-agent-harness
pip install browser-research-agent
pip install ml-intern-lab
Package PyPI GitLab
agent-skills-playbook PyPI Repo
personal-agent-harness PyPI Repo
browser-research-agent PyPI Repo
ml-intern-lab PyPI Repo

Bridge to the Wider Portfolio

πŸ™ GitHub

@MukundaKatta

222 originals Β· 105 upstream PRs
OpenAI Β· Anthropic Β· Google Β· MS
Apache Β· HuggingFace Β· Pydantic

πŸ“¦ Registries

npm Β· PyPI

52 npm packages Β· 52 PyPI ports
fit Β· guard Β· snap Β· vet Β· cast
kavach Β· streamparse Β· skillint

πŸ”Œ Integrations

MCP Β· Marketplace Β· πŸ€—

6 MCP-Registry servers
7 GitHub-Marketplace Actions
14 HF Spaces Β· 13 HF Datasets


Stack

 β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
 β”‚  ML Systems         β”‚  Fault prediction Β· embedding pipelines Β· evaluation       β”‚
 β”‚  Agentic AI         β”‚  RAG Β· LangGraph Β· query routing Β· hallucination detection β”‚
 β”‚  Cloud              β”‚  AWS Bedrock/SageMaker Β· GCP Β· Azure Β· K8s Β· Terraform     β”‚
 β”‚  Full-Stack         β”‚  React/TS Β· Java/Python APIs Β· CI/CD Β· zero-downtime       β”‚
 β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Track Record

Era Role Β· Company What I owned
2025 β€” now AI/ML Engineer Β· Southwest Airlines production ML, agentic RAG, Bedrock migration
2024 β€” 2025 AI/ML Engineer Β· GPS IT Solutions RAG platforms, model-risk governance, vector search
2022 β€” 2024 SDE Β· Amazon Web Services enterprise cloud systems, React/Java/Python, CI/CD
2022 β€” 2022 Data Engineer Β· GPS IT Solutions AWS Glue, PySpark, on-prem β†’ cloud pipelines
2017 β€” 2020 Software Engineer Β· American Express Python REST APIs at high-volume transaction scale
Numbers worth showing
  • 78% infra cost reduction on the SageMaker β†’ Bedrock migration ($1,740 β†’ $371/mo)
  • 600x retrieval-latency improvement on the ML prediction system
  • 30K+ entries in the 9-stage agentic RAG pipeline (LangGraph + Bedrock Nova + FAISS + BM25)
  • 5 prediction types in the aircraft-maintenance fault-prediction system
  • 23 automated evaluation tests for the AI model-risk governance framework
  • 40% content production-time reduction on the GPT-4 + RAG content platform

Credentials

Education  Β·  M.S. Big Data Analytics & IT, University of Central Missouri (2021–2022)  Β·  B.Tech Mechanical Engineering, SRM University (2012–2016)


ANTHROPIC

MCP Advanced Claude Β· Bedrock Claude Β· Vertex Intro to MCP Claude Code Claude API Agent Skills Subagents

AWS

AWS GenAI Apps AWS AI Solutions AWS AI Fundamentals Amazon Q


🦊  Mirrored on GitHub  Β·  agent workspace on Bitbucket  Β·  refreshed 2026-05-03

Activity

View all
Loading
There was an error loading users activity calendar.
  • Loading

Personal projects

View all
  • Loading
Loading

Info

Member since April 30, 2026