Open Agentic Schema Framework

Natural Language Understanding [101] Natural Language Processing

Natural Language Understanding (NLU) focuses on the ability to interpret and comprehend human language, including understanding context, semantics, and identifying key entities within text.

Contextual Comprehension [10101] Natural Language Processing

Understanding the context and nuances of text input to provide relevant responses.

Semantic Understanding [10102] Natural Language Processing

Grasping the meaning and intent behind words and phrases.

Entity Recognition [10103] Natural Language Processing

Identifying and categorizing key entities within the text, such as names, dates, or locations.

Natural Language Generation [102] Natural Language Processing

Natural Language Generation (NLG) describes the ability to generate human-like text from structured data or other inputs.

Text Completion [10201] Natural Language Processing

Continuing a given text prompt in a coherent and contextually appropriate manner to generate fluent and contextually relevant content.

Text Summarization [10202] Natural Language Processing

Condensing longer texts into concise summaries while preserving essential information and maintaining coherence.

Text Paraphrasing [10203] Natural Language Processing

Rewriting text to express the same ideas using different words and structures while maintaining the original meaning.

Dialogue Generation [10204] Natural Language Processing

Producing conversational responses that are contextually relevant and engaging within a dialogue context.

Question Generation [10205] Natural Language Processing

Automatically generating relevant and meaningful questions from a given text or context.

Text Style Transfer [10206] Natural Language Processing

Rewriting text to match the style of a given reference text while preserving the original content.

Story Generation [10207] Natural Language Processing

Generating a piece of text given a description or a first sentence to complete.

Information Retrieval and Synthesis [103] Natural Language Processing

Capabilities for retrieving relevant information from various sources and synthesizing it into coherent, contextually appropriate responses. This includes searching, extracting, combining, and presenting information in a meaningful way.

Fact Extraction [10301] Natural Language Processing

Capability to identify and extract factual information from text documents or knowledge bases, including entities, relationships, and key data points.

Question Answering [10302] Natural Language Processing

System capability to understand questions and provide accurate, relevant answers by analyzing available information sources.

Knowledge Synthesis [10303] Natural Language Processing

Capability to aggregate and combine information from multiple sources, creating comprehensive and coherent responses while maintaining context and relevance.

Sentence Similarity [10304] Natural Language Processing

Capability to analyze and determine the semantic similarity between sentences, supporting tasks like search, matching, and content comparison.

Document and Passage Retrieval [10305] Natural Language Processing

Capability to identify and retrieve relevant documents or text passages based on specific criteria or queries from a larger collection of texts.

Search [10306] Natural Language Processing

Capability to perform efficient and accurate searches within large textual databases based on various criteria, including keywords, semantic meaning, or complex queries.

Creative Content Generation [104] Natural Language Processing

Capabilities for generating various forms of creative content, including narratives, poetry, and other creative writing forms.

Storytelling [10401] Natural Language Processing

Creating narratives, stories, or fictional content with creativity and coherence.

Poetry and Creative Writing [10402] Natural Language Processing

Composing poems, prose, or other forms of creative literature.

Language Translation and Multilingual Support [105] Natural Language Processing

Capabilities for handling multiple languages, including translation and multilingual text processing.

Translation [10501] Natural Language Processing

Converting text from one language to another while maintaining meaning and context.

Multilingual Understanding [10502] Natural Language Processing

Recognizing and processing text in multiple languages.

Personalisation and Adaptation [106] Natural Language Processing

Capabilities for adapting and personalizing content based on user context and preferences.

User Adaptation [10601] Natural Language Processing

Tailoring responses based on user preferences, history, or context.

Tone and Style Adjustment [10602] Natural Language Processing

Modifying the tone or style of generated text to suit specific audiences or purposes.

Analytical and Logical Reasoning [107] Natural Language Processing

Capabilities for performing logical analysis, inference, and problem-solving tasks.

Inference and Deduction [10701] Natural Language Processing

Making logical inferences based on provided information.

Problem Solving [10702] Natural Language Processing

Assisting with solving problems by generating potential solutions or strategies.

Fact and Claim Verification [10703] Natural Language Processing

Verifying facts and claims given a reference text.

Ethical and Safe Interaction [108] Natural Language Processing

Capabilities for ensuring ethical, unbiased, and safe content generation and interaction.

Bias Mitigation [10801] Natural Language Processing

Reducing or eliminating biased language and ensuring fair and unbiased output.

Content Moderation [10802] Natural Language Processing

Avoiding the generation of harmful, inappropriate, or sensitive content.

Text Classification [109] Natural Language Processing

Capabilities for classifying and categorizing text into predefined categories or labels.

Topic Labelling and Tagging [10901] Natural Language Processing

Classifying a text as belong to one of several topics, which can be used to tag a text.

Sentiment Analysis [10902] Natural Language Processing

Classify the sentiment of a text, that is, a positive movie review.

Natural Language Inference [10903] Natural Language Processing

Classifying the relation between two texts, like a contradiction, entailment, and others.

Module Extraction [110] Natural Language Processing

Capabilities for extracting and representing textual features as vectors for downstream tasks.

Model Module Extraction [11001] Natural Language Processing

Representing parts of text with vectors to be used as input to other tasks.

Token Classification [111] Natural Language Processing

Capabilities for classifying individual tokens or words within text.

Named Entity Recognition [11101] Natural Language Processing

Task to recognize names as entity, for example, people, locations, buildings, and so on.

Part-of-Speech Tagging [11102] Natural Language Processing

Tagging each part of a sentence as nouns, adjectives, verbs, and so on.

Image Segmentation [201] Images / Computer Vision

Assigning labels or categories to images based on their visual content.

Video Classification [202] Images / Computer Vision

Assigning labels or categories to entire videos or segments based on their visual and audio content.

Image Classification [203] Images / Computer Vision

Assigning labels or categories to images based on their visual content.

Object Detection [204] Images / Computer Vision

Identifying and locating specific objects within an image or video, often by drawing bounding boxes around them.

Keypoint Detection [205] Images / Computer Vision

Identifying and locating specific points of interest within an image or object.

Image Generation [206] Images / Computer Vision

Creating new images from learned patterns or data using machine learning models.

Depth Estimation [207] Images / Computer Vision

Predicting the distance or depth of objects within a scene from a single image or multiple images.

Image Module Extraction [208] Images / Computer Vision

Identifying and isolating key characteristics or patterns from an image to aid in tasks like classification or recognition.

Mask Generation [209] Images / Computer Vision

Producing segmented regions in an image to highlight specific areas or objects, typically represented as separate layers or overlays.

Image-to-Image [210] Images / Computer Vision

Transforming one image into another using a learned mapping, often for tasks like style transfer, colorization, or image enhancement.

Image-to-3D [211] Images / Computer Vision

The process of converting a 2D image into a 3D representation or model, often by inferring depth and spatial relationships.

Audio Classification [301] Audio

Assigning labels or classes to audio content based on its characteristics.

Audio to Audio [302] Audio

Transforming audio through various manipulations including cutting, filtering, and mixing.

Tabular Classification [401] Tabular / Text

Classifying data based on attributes using classical machine learning approaches.

Tabular Regression [402] Tabular / Text

Predicting numerical values based on tabular attributes and features.

Mathematical Reasoning [501] Analytical skills

Capabilities for solving mathematical problems and proving theorems.

Pure Mathematical Operations [50101] Analytical skills

Executing pure mathematical operations, such as arithmetic calculations.

Math Word Problems [50102] Analytical skills

Solving mathematical exercises presented in natural language format.

Geometry [50103] Analytical skills

Solving geometric problems and spatial reasoning tasks.

Automated Theorem Proving [50104] Analytical skills

Proving mathematical theorems using computational methods.

Coding Skills [502] Analytical skills

Capabilities for code generation, documentation, and optimization.

Text to Code [50201] Analytical skills

Translating natural language instructions into executable code.

Code to Docstrings [50202] Analytical skills

Generating natural language documentation for code segments.

Code Template Filling [50203] Analytical skills

Automatically filling in code templates with appropriate content.

Code Refactoring and Optimization [50204] Analytical skills

Rewriting and optimizing existing code through refactoring techniques.

Retrieval of Information [601] Retrieval Augmented Generation

Retrieval of information is the process of fetching relevant data or documents from a large dataset or database based on a specific query or input.

Indexing [60101] Retrieval Augmented Generation

Depth estimations the task of predicting the distance or depth of objects within a scene from a single image or multiple images.

Search [60102] Retrieval Augmented Generation

Search is the process of exploring a dataset or index to find relevant information or results based on a given query.

Document Retrieval [60103] Retrieval Augmented Generation

Document retrieval is the process of retrieving relevant documents from a collection based on a specific query, typically through indexing and search techniques.

Document or Database Question Answering [602] Retrieval Augmented Generation

Document or database question answering is the process of retrieving and using information from a document or database to answer a specific question.

Generation of Any [603] Retrieval Augmented Generation

Generation of any is augmenting the creation of text, images, audio, or other media by incorporating retrieved information to improve or guide the generation process.

Image Processing [701] Multi-modal

Capabilities for processing and generating images from various inputs and generating textual descriptions of visual content.

Image to Text [70101] Multi-modal

Generating textual descriptions or captions for images.

Text to Image [70102] Multi-modal

Generating images based on textual descriptions or instructions.

Text to Video [70103] Multi-modal

Generating video content based on textual descriptions or instructions.

Text to 3D [70104] Multi-modal

Generating 3D objects or scenes based on textual descriptions.

Visual Question Answering [70105] Multi-modal

Answering questions about images using natural language.

Audio Processing [702] Multi-modal

Capabilities for processing audio, including speech synthesis and recognition.

Text to Speech [70201] Multi-modal

Converting text into natural-sounding speech audio.

Automatic Speech Recognition [70202] Multi-modal

Converting spoken language into written text.

Any to Any Transformation [703] Multi-modal

Converting between any supported modalities (text, image, audio, video, or 3D).

Threat Detection [801] Security & Privacy

Identifying indicators of malicious activity, suspicious patterns, or emerging threats across logs and data sources.

Vulnerability Analysis [802] Security & Privacy

Reviewing code, configurations, or dependency manifests to surface potential security weaknesses and misconfigurations.

Secret Leak Detection [803] Security & Privacy

Scanning artifacts (code, logs, documents) to identify exposed credentials, tokens, or other sensitive secrets.

Privacy Risk Assessment [804] Security & Privacy

Evaluating data handling or user flows to surface potential privacy risks and recommend mitigations.

Data Cleaning [901] Data Engineering

Detecting and correcting errors, inconsistencies, and missing values to improve dataset quality.

Schema Inference [902] Data Engineering

Deriving structural metadata (fields, types, relationships) from raw or semi-structured data.

Feature Engineering [903] Data Engineering

Constructing informative transformed variables to improve downstream model performance.

Data Transformation Pipeline [904] Data Engineering

Designing or explaining multi-step sequences that extract, transform, and load datasets.

Data Quality Assessment [905] Data Engineering

Evaluating datasets for completeness, validity, consistency, and timeliness.

Task Decomposition [1001] Agent Orchestration

Breaking complex objectives into structured, atomic subtasks.

Role Assignment [1002] Agent Orchestration

Allocating responsibilities to agents based on capabilities and task requirements.

Multi-Agent Planning [1003] Agent Orchestration

Coordinating plans across multiple agents, resolving dependencies and optimizing sequencing.

Agent Coordination [1004] Agent Orchestration

Managing real-time collaboration and state synchronization among agents.

Negotiation & Resolution [1005] Agent Orchestration

Facilitating negotiation, conflict handling, and consensus-building between agents.

Benchmark Execution [1101] Evaluation & Monitoring

Running standardized benchmarks or evaluation suites and summarizing results.

Test Case Generation [1102] Evaluation & Monitoring

Creating targeted test inputs or scenarios to probe system behavior and edge cases.

Quality Evaluation [1103] Evaluation & Monitoring

Assessing outputs for accuracy, relevance, coherence, safety, and style adherence.

Anomaly Detection [1104] Evaluation & Monitoring

Identifying unusual patterns, drifts, or deviations in data or model outputs.

Performance Monitoring [1105] Evaluation & Monitoring

Tracking latency, throughput, resource utilization, and service reliability over time.

Infrastructure Provisioning [1201] DevOps / MLOps

Defining or explaining steps to allocate and configure compute, storage, and networking resources.

Deployment Orchestration [1202] DevOps / MLOps

Coordinating multi-stage application or model deployments, rollbacks, and version transitions.

CI/CD Configuration [1203] DevOps / MLOps

Designing or modifying continuous integration and delivery workflows and pipelines.

Model Versioning [1204] DevOps / MLOps

Tracking, promoting, and documenting different iterations of models and their artifacts.

Monitoring & Alerting [1205] DevOps / MLOps

Configuring and interpreting telemetry signals, thresholds, and alerts for operational health.

Policy Mapping [1301] Governance & Compliance

Translating organizational or regulatory policies into structured, enforceable rules or checklists.

Compliance Assessment [1302] Governance & Compliance

Evaluating processes or outputs against defined standards (e.g., GDPR, HIPAA) and identifying gaps.

Audit Trail Summarization [1303] Governance & Compliance

Condensing system event or transaction logs into human-readable compliance or oversight summaries.

Risk Classification [1304] Governance & Compliance

Categorizing potential operational or data-related risks by impact and likelihood for prioritization.

API Schema Understanding [1401] Tool Interaction

Interpreting and explaining API specifications, endpoints, parameters, and expected payloads.

Workflow Automation [1402] Tool Interaction

Designing or describing automated sequences integrating multiple tools or services.

Tool Use Planning [1403] Tool Interaction

Selecting and ordering tool invocations to accomplish a specified goal efficiently.

Script Integration [1404] Tool Interaction

Linking custom scripts or functions with external tools to extend capabilities.

Strategic Planning [1501] Advanced Reasoning & Planning

Formulating high-level multi-phase strategies aligned with long-term objectives.

Long-Horizon Reasoning [1502] Advanced Reasoning & Planning

Maintaining coherent reasoning chains over extended sequences of steps or time.

Chain-of-Thought Structuring [1503] Advanced Reasoning & Planning

Organizing intermediate reasoning steps into clear, justifiable sequences.

Hypothesis Generation [1504] Advanced Reasoning & Planning

Proposing plausible explanations or solution pathways for incomplete or uncertain scenarios.