How AI Transforms Investment Workflows and Why Data Quality Determines Whether It Holds Up

Published: May 16, 2026

Modified: June 11, 2026

Key Takeaways

The bottleneck in investment workflows is everything that has to happen before analysis can begin. The agent layer eliminates that overhead and returns analysts to the work they were hired to do.
An LLM without a specialized MCP is a general reasoner with no domain footing. The MCP is what converts a capable model into a trustworthy one and the data behind it is what determines whether the output holds up to scrutiny.
The architecture is convergent. Every serious vendor will offer some version of it within the next few years. The questions worth asking are about the quality of the data that supports the architecture.

The AI Architecture Series – Part 3

The bottleneck in investment workflows is often not the analysis itself, but rather everything that has to happen before analysis can begin: running compliance checks, mapping portfolio positions against mandate criteria, classifying a fund against a regulatory standard, surfacing controversies across a universe, amongst many other such tasks.

These are not analytical duties. They are retrieval and formatting tasks wearing analytical work’s loose-fitting clothing. The architecture we described in the previous piece was built to address exactly this problem.

In parts 1 and 2 of the AI Architecture series, we established why architecture choices compound over time and how the four layers (the LLM, MCPs, skills, and agents) fit together in theory. In this piece, we get practical, exploring what each layer contributes to real investment workflows, what a concrete use case reveals about present possibilities, and the constraints of model context protocols (MCPs).

April 20, 2026

What AI Systems Are Actually Made Of: The Architecture Explained

The problem worth naming in investment analysis

Data availability once acted as a constraint on investment analysis and compliance. However, as corporate disclosure matured and data sources multiplied, the constraint inverted for many firms. The quantity of data available to them exceeded their ability to act upon it. The result is a sort of “stranded insight” defined by the gap between information that exists in a system and a decision that can actually be made with it.

Consider an analyst checking mandate compliance. The steps that precede their analysis include navigating to the correct dataset or file, selecting the right portfolio, locating the relevant mandate, launching the compliance check, interpreting the metric-level output, and identifying the violations and associated drivers. None of those steps involve deep judgment or critical thinking. They involve knowing where to click and being able to reliably repeat the process.

This is not a criticism of existing systems. It is a description of how most structured data products have always worked: powerful systems that require users to know how to operate them.

AI, and specifically the architecture toward which this series of articles has been building, allows us to reimagine this status quo. An LLM’s general reasoning, paired with the domain-specific data and methodologies of a specialized MCP collapses the distance between question and answer. The “overhead” work that once consumed an analyst’s morning (the navigation, filtering, exporting, formatting) disappears. What remains is judgment; the differentiated work that the analyst was hired to do.

An LLM’s general reasoning, paired with the domain-specific data and methodologies of a specialized MCP collapses the distance between question and answer.

AI-powered mandate monitoring: A real-world example

As a tech company, Clarity AI’s philosophy is to meet clients where they work, rather than requiring them to come to us.

Our API-first foundation makes the agent layer a natural extension of that guiding principle. Our suite of MCPs, including one tuned to monitor portfolio compliance with investment mandates, integrates Clarity AI’s data and methodology directly into the preferred AI applications or internal systems our clients already use. These MCPs facilitate workflows that run end-to-end with simple natural language prompts like “please check the compliance of my portfolio with my investment mandate”.

Good afternoon, Austin

Please check the compliance of my portfolio against my investment mandate.

A single sentence in plain language sets a full workflow in motion. Absent any additional user specification, Claude reasons about the requirements of the prompts and, in response, calls a series of distinct tools, pulling the user’s mandates and portfolio holdings, running the compliance checks at the metric level, and assembling the results.

The outputs render as a structured, interactive table inside the chat, including metric-level pass/fail outcomes and enumerated breaching companies. All values are traceable via follow-up questions or direct linkout to Clarity AI’s web application for deeper investigation.

What used to require navigating a data portal, locating the right portfolio and mandate, interpreting metric-level outputs, formatting the results, and filtering for offenders is now resolved with a single exchange.

iShares SDG ETF · Ritzel Trust Reputational Responsibility Mandate

Passing

Failing

Breaching companies

Metrics

Metric	%	Threshold	Status
Exposure: Anti-Personnel Landmines	100.0%	100	Pass
Exposure: Chemical & Biological Weapons	100.0%	100	Pass
Exposure: Cluster Bombs Production	100.0%	100	Pass
Exposure: Tobacco Production	100.0%	100	Pass
Exposure: Thermal Coal Production	100.0%	100	Pass
Exposure: Oil Fuels Production	100.0%	100	Pass
Exposure: Gas Fuels Production	100.0%	100	Pass
Controversy: Ungc Violation	100.0%	100	Pass
Controversy: Oecd Violation	100.0%	100	Pass
Revenue Alignment: Total SDG Revenue Alignment (Ne…	97.0%	99	Fail

Breaching companies

Filter… 4 results

A well-designed web application is the right tool for exploring data, specifically for open-ended questions where visual scaffolding helps the user navigate an environment rich with possibilities. After all, many of the best questions arise as a result of mere data exploration. However, the workflow described above is not exploratory.

The portfolio manager knows the question, portfolio, and mandate, and wants the answer. For this kind of concrete-outcome work, even the best-designed interface adds a translation step in which the user must convert a plain-language question into a sequence of clicks. MCP architecture removes that translation and compresses the path from question to answer. The user asks the question in the language they would have used with an analyst, and the system returns the answer in seconds.

That compression depends on the MCP doing something that the LLM itself cannot do alone. Foundation models like Claude carry enormous general knowledge but lack the specialized, curated data and methodologically-grounded information that regulated industries depend upon.

Foundation models like Claude carry enormous general knowledge but lack the specialized, curated data and methodologically-grounded information that regulated industries depend upon.

Consider a portfolio manager running a faith-based values mandate against a portfolio of funds.

A general-purpose LLM with web access would have to identify each fund’s holdings (rarely publicly disclosed and usually a quarter stale), reconcile naming conventions across PDFs and factsheets, locate twenty metric values for each of perhaps a hundred holdings per fund, decompose composite metrics inconsistently reported across firms, estimate values for non-disclosers, normalize units across reporting standards and fiscal calendars, and preserve a regulator-defensible audit trail through every step. That’s 2,000 data points per fund, in a portfolio that may hold dozens, with no governance layer to adjudicate definitions and no way of knowing which holdings were silently skipped because the data wasn’t findable or the page wasn’t fetchable.

February 4, 2026

The Real ROI Challenge of AI in Finance: Accuracy, Trust, and the Verification Tax

The task is structurally impossible for a model working from web access alone. Thus, the specialized MCP acts as the “power-up” that endows the general reasoner with domain-specific command, in this case, of the verified fund holdings, mandate criteria, regulatory context, and methodology-encoded data the workflow requires.

This is just the primitive first step toward truly transformational value. Because this work now runs inside of a powerful application layer replete with potential connections (schedules, inboxes, CRMs, calendars, etc.), it can be integrated into a truly end-to-end automated workflow. A scheduled task in Claude can run the same check each weekday morning, flag and substantiate relevant changes, draft a summary according to your voice guidelines, align the alert visuals with your firm’s design style, and send it to the portfolio manager’s inbox by 9 am.

That workflow, which previously required dedicated tooling or significant manual effort, is configurable in under thirty minutes. That is what the agent layer, the marriage of an LLM, MCPs, and skills, makes possible.

Anatomy of an agent workflow: The four primitives

The workflow described above decomposes into four primitives. Each does one job. In isolation, none of them is sufficient; composition is what makes them work. Below are those elements as applied to a mandate monitoring workflow. Each primitive is paired with something concrete you can try yourself.

Claude, the LLM, is the reasoning engine. It interprets the prompt, decides which tools to call, sequences them, reads the results, and drafts the output. The user does not need to specify any of this in advance.

Ask Claude what it would need to run a meaningful mandate compliance check on your portfolio. Be sure to disengage related MCPs first. The gaps it identifies, like your actual holdings, metric methodologies, and the actual extra-financial data, are exactly what the rest of this architecture provides.

Back at it, Austin

What would you need to run a meaningful investment mandate compliance check on my portfolio, which includes both funds and individual equities?

MCPs handle access. For example, a database MCP retrieves the portfolio and fund holdings, associated extra-financial data and methodologies, and runs the compliance check. A Gmail or Outlook connector delivers the alert. Tools govern which systems the LLM can access.

With the relevant MCP connected (for example, Clarity AI’s MCP), ask Claude to “check the compliance of my portfolio against my investment mandate”. The system will make distinct tool calls without you specifying any of them. That is the MCP layer doing exactly what Part 2 of this series described: handling access so the model can focus on reasoning.

Skills govern how the system should execute the work. They guide the LLM to replicably undertake a task according to a specific set of criteria rather than ad hoc every time. In an example in which Claude is provided with both an email-drafting skill and a report-formatting skill, the former encodes the firm’s preferred voice; the latter encodes how the firm communicates analysis, like the structure of an investment committee memo, the order in which risks are surfaced, and the way recommendations are presented. Skills allow LLMs to actually execute on firms’ institutional ways of working.

Ask Claude to draft three portfolio manager briefings on topics of your choosing. Then start a new chat, paste in three real briefings your team has produced, and ask for the same three drafts in the structure and voice of the originals. The second pass will land much closer to your standard. When you are satisfied with the consistency, ask Claude to package the process as a skill. It will then automatically apply those guidelines the next time you ask for a similar deliverable.

Scheduling is the trigger. It shifts the workflow from reactive (you ask) to proactive (the system acts according to your established cadence). Without it, the generation of this example’s briefing sits inert, waiting for someone to remember.

Set up a scheduled task in Claude to run a recurring check of your portfolio’s compliance against your investment mandate. Include details on the skills you want Claude to invoke, if any.

Create scheduled task

Name *

Daily Investment Mandate Briefing

Description *

A daily review of Northbridge Capital’s Res

Will be saved as “daily-investment-mandate-briefing”

Run a compliance check on the Global Innovators Equity Fund against our Reputational Risk Mandate. Flag any new breaches since yesterday’s run. Draft the result as a Daily Mandates Briefing email using the email-drafting skill and the Northbridge report formatting skill. Send the final output to compliance@northbridge.com.

⎙ C:\Users\AustinRitz… ⌄

Default model ⌄

Frequency

Daily ⌄

09:00 AM

Scheduled tasks use a randomized delay of several minutes for server performance.

Cancel

Save

The output below is what those primitives generate when unified by a user: a branded compliance briefing in the portfolio manager’s inbox by 9 am, written in the firm’s voice, formatted to its standards, using verified holdings, mandate, and extra-financial data sourced from the MCP-connected system, without anyone having initiated it.

Daily Mandates Briefing Inbox

Compliance Monitoring <compliance@northbridge.com>

to me ↓

9:00 AM (0 minutes ago) ☆ ↩

NORTHBRIDGE

CAPITAL PARTNERS

Daily Mandates Briefing

Global Innovators Equity Fund · Reputational Risk Mandate · April 27, 2026

● 2 new breaches detected since yesterday

Good morning,

Today’s automated review of the Global Innovators Equity Fund against the Reputational Risk Mandate flagged two new breaches. The remainder of the portfolio (96.6% by weight) is aligned with the mandate’s constraints.

Breaches

Zirlet Industries Inc. 2.1% of portfolio

UNGC / OECD violation.

View breach details →

Tinsuna Enterprises 1.3% of portfolio

Tobacco participation at 8.4% of revenue, exceeding the mandate’s 5% threshold.

View breach details →

Recommended actions

Remove or replace the two implicated holdings to bring the fund back into compliance. Suggested replacements, matched on market-cap band, industry, and domicile:

Zirlet Industries Inc. → Acme Inc.

Cap band $2–10bn · Construction materials · Germany

Tinsuna Enterprises → Locatil Co

Cap band $300m–$2bn · Distributors · Japan

Reply to acknowledge or request adjustments. The next briefing is scheduled for tomorrow at 9:00 AM.

Generated automatically by Northbridge Compliance Monitoring · This briefing is internal and confidential. Not intended for redistribution.

↩ Reply

↪ Forward

Taken in isolation, each primitive is limited, but in connection, they compound. If you strip out the schedule, the analyst still has to remember to trigger the workflow. Without the skills, the briefing is generic, unactionable, and stochastic. Discard the MCP, and the data is either entirely unavailable or completely unreliable.

Anatomy of an Agent Workflow

Four primitives turn one prompt into a recurring report

Scheduling

The Trigger

Every weekday at 9AM

Claude (the LLM)

The Reasoning Engine

Decides what to do, then acts

MCPs

The access

Connects Claude to systems

🗄️ Extra-financial data MCP

Gmail / Outlook MCP

Skills

The style

Encodes your firm’s voice and approach

📋 Email drafting

📋⚙️ Report formatting

Output

Branded compliance summary in the PM’s inbox by 9AM

Why data quality is the real constraint in AI-powered workflows

MCPs are not magic. The quality of the output they produce is a direct reflection of the quality of the data on the other end. In the workflow example detailed above, what counts is the quality of the extra-financial and holdings data. If that data is suspect, the entire workflow is worthless.

Each necessary element of data quality represents its own discipline:

Data coverage requires enormous computational power and primary research operations, along with machine learning prowess for estimations.
Freshness demands continuous dedicated data operations covering ingestion, validation, and refresh cycles across thousands of sources.
Methodological rigor requires regulatory monitoring, deep domain expertise, and disciplined versioning across thousands of metrics.
Auditability requires infrastructure that preserves provenance through every transformation, so that any figure can be traced back to its source.

THE AI ARCHITECTURE SERIES – PART 3

The ingredients of a trustworthy MCP

	What it Determines	What it Requires
Coverage	Whether the answer is comprehensive or silently incomplete	Dedicated computational power, primary research operations, machine learning models for estimations and data reliability
Freshness	Whether the answer reflects the present or the past	Continuous data operations, including monitoring, ingestion, validation, and refresh cycles running across thousands of sources
Methodological Rigor	Whether “UNGC violation” means what your mandate (or the regulator) actually intended	Constant regulatory monitoring, dedicated domain expertise, disciplined versioning across thousands of metrics
Auditability	Whether the answer holds up to regulator or stakeholder scrutiny	Infrastructure that preserves provenance through every transformation, so any figure can be traced back to its origin

These disciplines do not fall out of a more capable model. Rather, they are developed slowly and deliberately, with the kind of expertise that does not compress into a context window. The implication is structural. As models commoditize reasoning, the moat shifts to the substrate: the data.

Beyond architecture: Four questions to ask any AI vendor

The architecture in this article is convergent. Every serious vendor will offer some version of it within the next few years. The questions worth carrying into vendor conversations should, therefore, focus on what supports that architecture.

THE AI ARCHITECTURE SERIES – PART 3

Four questions to ask any AI data vendor

Before you trust the output

Coverage

What percentage of your relevant universe was excluded from the system’s answer and why?

Do you have any way to even ascertain these numbers?

Silent omissions are a particularly dangerous mode of failure.

Freshness

What is the stalest data point informing an output?

Is there any way to know?

What is the lag between source publication and availability?

Methodology

Are the underlying data and ultimate outputs supported by transparent methodologies aligned with associated regulations and standards?

Provenance

For any specific figure in the output, can you trace it back to the source document?

Can you differentiate between estimated and reported values?

The model does the reasoning. The data determines whether the reasoning is worth trusting.

Stay updated

Get market insights from our experts directly in your inbox.

Author Information

Austin Ritzel

Senior Manager, AI Strategy & Strategic Projects, Clarity AI

Austin leads strategic engagements at Clarity AI, managing partnerships with global financial institutions to help them scale with tech-driven solutions. Previously, he led consulting at 17 Asset Management, where he worked with heads of state, national foundations, and tech startups to build impact frameworks, raise capital, and grow valuations.

Research and Insights

How Investors Are Navigating Geopolitical Risk

Geopolitical risk has always been priced into investment decisions, but rarely has it demanded a rethink of the assumptions beneath them. Today it does. The question facing long-term investors is no longer whether geopolitical events move markets. It is whether the frameworks built over decades to guide portfolio construction, exclusion policy, and asset allocation still…

Read Article

ESG Risk, Gender Equality

The diversity say-do gap: Two-thirds of companies with discrimination violations also claim diversity initiatives

June is a month when corporate communications are filled with Pride messaging, diversity commitments, and inclusion statements. But beyond the visibility of these declarations, a more complex question remains: do these commitments consistently align with companies’ actual conduct? At Clarity AI, we looked at whether companies with active discrimination controversies in practice also publicly emphasize…

Read Article

Climate

The physical risk gap: What today’s datasets are missing

Access to physical risk data is no longer the problem. Most asset managers who need it have it. Far fewer have data that holds up when it matters: under regulatory scrutiny, in client reporting, or when trying to act on it. Taking place in the heart of the climate week season, after Zurich and London,…

Read Article

Talk to an Expert

How AI Transforms Investment Workflows and Why Data Quality Determines Whether It Holds Up

The AI Architecture Series – Part 3

What AI Systems Are Actually Made Of: The Architecture Explained

The problem worth naming in investment analysis

AI-powered mandate monitoring: A real-world example

The Real ROI Challenge of AI in Finance: Accuracy, Trust, and the Verification Tax

Anatomy of an agent workflow: The four primitives

Why data quality is the real constraint in AI-powered workflows

Beyond architecture: Four questions to ask any AI vendor

The 2026 Guide to Navigate AI Adoption in Regulated Workflows.

Stay updated

Author Information

Research and Insights

How Investors Are Navigating Geopolitical Risk

The diversity say-do gap: Two-thirds of companies with discrimination violations also claim diversity initiatives

The physical risk gap: What today’s datasets are missing

Download the Resource