Feb 27, 202611 min readArticleThynker

How to Give Your AI Agent Real Capabilities

How Agent Skills are giving AI systems downloadable capabilities — and why structured expertise is the future of enterprise AI.

AIAgent SkillsEnterprise AIStrategy

How to Give Your AI Agent Real Capabilities

I Know Kung Fu

One of the most iconic scenes in The Matrix is when Neo downloads martial arts directly into his brain. Seconds later: "I know Kung Fu."

We're entering that era. But instead of martial arts, we're loading capabilities. And one of the most underrated tools in this new world is something called Agent Skills — and SKILL.md.

What are Agent Skills?

Agent Skills are a simple, open format for giving AI agents new capabilities through structured, reusable expertise. At their core, a skill is just a folder containing a SKILL.md file with clear metadata and step-by-step instructions for performing a specific task.

Skills can also include scripts, templates, references, and other assets, allowing teams to bundle procedural knowledge, domain expertise, and repeatable workflows into portable packages.

Instead of relying purely on generic model intelligence, skills give agents access to company-, team-, or user-specific context that they can load when needed. They work through progressive disclosure: agents first read only a skill's name and description to know when it's relevant, then load the full instructions when a matching task appears, and finally execute the workflow.

This keeps agents efficient while enabling deeper specialization on demand.

Why Context Matters

Large Language Models have limited context windows (although these are growing massively with each new iterative launch), and for them to be useful, user interfaces like chat interfaces and agentic tools ingest system prompts and other useful content.

But context bloat is a real problem. The more instructions, examples, guardrails, and background you stuff into the window, the more you dilute focus and increase cost.

So the idea is to be useful and lightweight. Instead of loading everything all the time, you structure capabilities so they can be discovered and activated only when needed. That keeps agents efficient, reduces noise, and preserves high-quality reasoning within a constrained context window.

Domain Expertise in the World of Generative AI

In the context of domain expertise, there are three different ways to make AI "smarter" about a specific field:

Fine-tuning means retraining a model on specialized data so it internalizes a domain's patterns (e.g., legal contracts, medical notes, financial reports). The knowledge becomes baked into the model's weights. It's powerful but heavier, slower to update, and less flexible.

RAG (Retrieval-Augmented Generation) means the model doesn't memorize the domain, it looks things up at runtime. It retrieves relevant documents from a knowledge base and uses them to answer accurately. This is great for up-to-date, factual, organization-specific knowledge.

Skills sit slightly differently. They don't just give the model knowledge, they give it structured procedural capability. A skill defines how to do something in a domain (workflows, rules, constraints, steps), and the agent loads it only when needed.

So instead of just knowing finance, it knows how to execute a financial analysis process your way. In other words, it turns vague claims like "I'm good at strategy" into operational intelligence. It allows AI systems to understand and execute based on defined capability boundaries.

The Difference

It's the difference between:

"Help me with marketing."

and

"Apply B2B enterprise positioning frameworks with pricing psychology and GTM sequencing for emerging markets."

One is generic. The other is downloadable Kung Fu.

Why This Matters

We are moving from simply prompting models to defining structured capability layers. AI will not just respond to prompts; it will operate through clearly defined skills.

Companies that formalize their skills, both internally and digitally, will compound their leverage. Individuals who define their skills precisely will become more capable.

This isn't about replacing people — it's about compressing expertise into structured intelligence that can be deployed, scaled, and activated on demand.

An Interesting Observation

I wanted to create an equity research skill for one of my agents. Using one LLM to help "train" another LLM seems like the only rational choice.

To use this as an opportunity to grade some of these SOTA models on a practical task, I gave the same prompt to three models, Claude Sonnet 4.6, ChatGPT 5.2 and Gemini 3:

"I need to create a SKILL.md file for an equity research agent. I need the agent to be able to do equity research on a stock... look at call transcripts, earnings reports, EDGAR filings, and news articles and anything else you can think of."

The difference in outputs was striking.

Claude blew it out of the park.

It delivered a structured, procedural, step-by-step framework: identify the company, pull EDGAR filings, analyze earnings reports, review call transcripts, scan news coverage, synthesize outputs — and for each step it referenced relevant digital resources.

It was properly formatted in clean markdown, aligned to the SKILL.md structure, and felt operational. 9/10.

ChatGPT was a mediocre generalist.

The response was coherent and useful at a high level, but it lacked concrete digital resources and procedural depth. It explained what equity research involves, but didn't fully translate that into a structured, executable skill. 6/10.

Gemini was the weakest.

The output fell below expectations. Gemini seemed to be more focused on generating images than producing structured data. 4/10.

The Takeaway

The takeaway isn't that one model "wins" universally. I've had vastly different experiences with each of them. In this instance, Claude is the clear winner.

For reference, here is the SKILL.md I went with:

name: equity-research description: > Comprehensive equity research agent that analyzes stocks using earnings call transcripts, earnings reports, SEC/EDGAR filings, and news articles to produce investment-grade research.

Use this skill whenever a user asks to research a stock, analyze a company, evaluate an investment, review earnings, summarize 10-K or 10-Q filings, pull call transcripts, assess valuation, or generate an equity research report.

Trigger even for partial requests like "what do you think about NVDA?" or "help me understand Apple's latest earnings" — any question about a public company's fundamentals, outlook, or recent developments should use this skill.

Equity Research Agent

A skill for producing thorough, investment-grade equity research on publicly traded companies using primary sources: SEC filings, earnings call transcripts, earnings reports, and news.

Workflow Overview

When asked to research a stock, follow this pipeline in order. Adapt depth to the user's request (quick take vs. full report).

Identify the company & ticker
Pull recent EDGAR filings (10-K, 10-Q, 8-K)
Find earnings call transcripts (last 1–4 quarters)
Review latest earnings releases / press releases
Search recent news & analyst coverage
Synthesize into structured research output

Step 1 — Identify the Company

Confirm the correct ticker and exchange before proceeding. If the user is ambiguous (e.g., "research Apple"), confirm AAPL on NASDAQ. Note the fiscal year end date, as it affects which filings are most current.

Step 2 — EDGAR Filings

Base URL: https://www.sec.gov/cgi-bin/browse-edgar

Key URLs to fetch:

Filing	What to look for	URL pattern
10-K (Annual)	Revenue/margins trends, risk factors, MD&A, capex guidance	https://efts.sec.gov/LATEST/search-index?q="%22{TICKER}%22"&dateRange=custom&startdt={YEAR}-01-01&forms=10-K
10-Q (Quarterly)	Sequential revenue, operating leverage, balance sheet changes	Same but forms=10-Q
8-K (Current)	Earnings releases, guidance updates, material events	Same but forms=8-K
DEF 14A (Proxy)	Executive comp, insider ownership, board composition	Same but forms=DEF+14A

EDGAR full-text search: https://efts.sec.gov/LATEST/search-index?q="%22{COMPANY_NAME}%22"&forms=10-K

Always extract from filings:

Revenue, gross margin, operating margin, net income (last 3 years minimum)
Free cash flow and capex
Debt levels, cash, and liquidity position
Management guidance (if provided)
Key risk factors specific to the company (not just boilerplate)
Segment breakdown if the company has multiple business lines

Step 3 — Earnings Call Transcripts

Transcripts reveal management tone, forward-looking commentary, and analyst concerns that don't appear in the formal filings.

Sources to search (in priority order):

Seeking Alpha — https://seekingalpha.com/symbol/{TICKER}/earnings/transcripts
Motley Fool — https://www.fool.com/earnings-call-transcripts/
The Motley Fool Transcripts search — web search "{COMPANY} earnings call transcript Q{N} {YEAR}"
Investor Relations page — many companies post transcripts directly; search "{COMPANY} investor relations earnings transcript"

Pull the most recent 2 quarters minimum. For a full report, pull 4 quarters.

Key things to extract from transcripts:

Management prepared remarks: Tone, strategic priorities, what they emphasize
Guidance: Revenue, EPS, margin guidance for next quarter/year
Analyst Q&A: What questions are analysts asking? What did management dodge or hedge?
Key metrics called out: GMV, ARR, NRR, DAU, backlog — whatever the company tracks
Changes quarter-over-quarter: Did management change their language on key topics?
Red flags: Defensive answers, excessive caveats, sudden change in CFO/CEO tone

Step 4 — Earnings Reports & Press Releases

Beyond the formal 10-Q/10-K, earnings press releases (typically filed as 8-K exhibits) contain:

Headline revenue and EPS vs. consensus
Non-GAAP metrics (adjusted EBITDA, adjusted EPS, free cash flow)
Management commentary
Updated guidance tables

Search: "{COMPANY} Q{N} {YEAR} earnings results press release" or fetch directly from the IR page.

Always note: Beat/miss on revenue, EPS, and guidance raise/lower vs. prior quarter.

Step 5 — News & Analyst Coverage

Web searches to run: "{TICKER} analyst rating price target {YEAR}" "{COMPANY} news last 30 days" "{COMPANY} competitive landscape {YEAR}" "{COMPANY} {key product or segment} growth" "{COMPANY} insider buying selling {YEAR}"

Sources to prioritize:

CNBC, Reuters, Bloomberg, WSJ for news
Barron's, Seeking Alpha for analysis
Yahoo Finance for quick consensus data: https://finance.yahoo.com/quote/{TICKER}/analysis

Extract:

Current analyst consensus (Buy/Hold/Sell ratio, average price target)
Recent rating changes
Major recent news events (acquisitions, leadership changes, product launches, regulatory)
Competitive dynamics and market share trends
Macro factors affecting the sector

Step 6 — Synthesis & Output

Standard Output Structure

For a full equity research report, produce the following sections. Adjust for shorter requests (e.g., if user just wants "quick take", condense to 2–3 paragraphs).

🏢 Company Overview

Business model in plain English (2–3 sentences)
Primary revenue streams and their relative size
Key competitive advantages / moat
Geographic exposure

📊 Financial Snapshot

Present a clean table of the most recent data available:

Metric	FY{N-2}	FY{N-1}	FY{N} (LTM)	Commentary
Revenue				YoY growth %
Gross Margin %				Expanding/contracting?
Operating Margin %
Net Income
Free Cash Flow				FCF margin %
Net Cash/(Debt)
EPS (diluted)

📞 Earnings Call Highlights

Key themes from management in the most recent call
What analysts are focused on
Forward guidance (exact numbers if given)
Notable changes from prior quarter's tone or guidance

📁 Key Filing Findings

Standout items from 10-K/10-Q (risks, MD&A, segment data)
Any material 8-K events (acquisitions, guidance updates, leadership)
Balance sheet health summary

📰 Recent News & Catalysts

Top 3–5 news items from the last 30–90 days
Upcoming catalysts (earnings date, product launches, investor day)

🎯 Analyst Sentiment

Consensus rating and average price target vs. current price
Bull case and bear case as articulated by analysts/management
Recent rating changes

⚖️ Investment Considerations

Strengths:

[List key strengths with supporting data]

Risks:

[List key risks from filings and news — be specific, not generic]

Key Questions to Monitor:

[What are the 2–3 things to watch in the next quarter?]

📌 Summary

2–3 paragraph synthesis of the overall picture. Present the facts and the range of views objectively. Do not make a buy/sell recommendation — instead, lay out the key factors an investor would need to weigh.

Research Principles

Be source-specific: Always cite which filing, which quarter's transcript, or which news article you're drawing from. Don't blend sources without attribution.
Use actual numbers: Avoid vague language like "revenue grew." Say "revenue grew 18% YoY to $24.7B in Q3 FY2024."
Flag data gaps: If you couldn't find a transcript or a filing is unavailable, say so clearly rather than inferring.
Be current: Prioritize the most recent 12 months of data. Note the date of the most recent filing or transcript used.
Distinguish facts from interpretation: Separate what the documents say from what analysts/commentators interpret. Label opinions as such.
Non-GAAP vs. GAAP: Always note when a metric is non-GAAP/adjusted and whether it differs materially from the GAAP equivalent.

Quick Reference — Common Metrics by Sector

Sector	Key Metrics Beyond Standard Financials
SaaS/Software	ARR, NRR, CAC, LTV, Rule of 40, Churn
E-commerce	GMV, Take rate, Active buyers, AOV
Fintech	NIM, NCO rate, Loan growth, Deposits
Semiconductors	Book-to-bill, ASP, Lead times, Capacity utilization
Retail	SSS (same-store sales), Inventory turns, Gross margin per sq ft
Healthcare	Pipeline milestones, Payer mix, ARPU
Media/Ad	DAU/MAU, ARPU, Ad impressions, Engagement
Energy	Reserve replacement ratio, Production cost/boe, Hedging

Useful Base URLs

SEC EDGAR Search: https://efts.sec.gov/LATEST/search-index?q="{TICKER}"&forms={FORM}
SEC EDGAR Filings: https://www.sec.gov/cgi-bin/browse-edgar?action=getcompany&CIK={TICKER}
Yahoo Finance: https://finance.yahoo.com/quote/{TICKER}
Yahoo Analysis: https://finance.yahoo.com/quote/{TICKER}/analysis
Yahoo Financials: https://finance.yahoo.com/quote/{TICKER}/financials
Macrotrends: https://www.macrotrends.net/stocks/charts/{TICKER}/{COMPANY}/revenue
Seeking Alpha: https://seekingalpha.com/symbol/{TICKER}
CNBC: https://quote.cnbc.com/quote-html-webservice/restQuote/symbolType/symbol?symbols={TICKER}&requestMethod=itv&noform=1&partnerId=2&fund=1&exthrs=1&output=json&events=1

Scope Calibration

Adjust research depth based on user request:

Quick take (5 min): Latest earnings release + analyst ratings + 1 news item
Standard (15 min): Above + 1 quarter of transcripts + key 10-Q findings
Full report (30+ min): Above + 4 quarters of transcripts + 10-K deep dive + multiple news sources

Always confirm scope with the user if unclear.