top of page

AI Jockey - August 2025

AI Jockey | JonathanMHerman.com

Jonathan Herman

Aug 15, 2025

Artificial Intelligence Updates & Integrated Web3 Strategies for Founders, Technologists & Professional Service Providers

Why “AI Jockey”?  Artificial Intelligence can feel like a powerful steed that can take you far and fast, but mastery of its unbridled power is required to succeed in the new competitive landscape.  This newsletter is designed to help you choose the right horse, stay in the race, and finish ahead of the pack…



1. Unified AI Architecture: Foundations for Agile Startups


Happenings: The latest frontier models—such as GPT-5, Claude, and Gemini—are merging lightning-fast “main” modes, deep-reasoning capabilities, and intelligent routing into single adaptive systems. Alex Wissner-Gross, on the AI Insiders Podcast (Aug 2025), called this evolution “the closest thing yet to a cognitive Swiss Army knife,” and Emad Mostaque added, “The AI that adapts to the problem in real time will win the race.” As Andrej Karpathy noted in Software 2.0 (Medium), “Neural networks are not just another classifier… They are Software 2.0,” underscoring the fundamental shift in how software—and now entire technology stacks—are designed and deployed.  In parallel, major cloud providers are introducing native AI orchestration layers that allow startups to deploy and manage multi-model strategies without complex engineering overhead.


Startup Impact: Founders can now build with one AI backbone that flexibly handles everything from rapid customer queries to multi-layer strategic planning. This reduces the need for multiple stitched-together APIs and models, simplifying the tech stack and speeding go-to-market. Integrating with mixed reality (MR) and blockchain tokenization opens the door to holistic platforms—AI orchestrating MR interfaces for immersive experiences while handling blockchain-secured transactions or token-gated access in real time. For professional service providers—lawyers, consultants, and accountants—this unified approach makes it easier to integrate AI into operational workflows without having to constantly adapt to disparate tool behaviors.


What’s Ahead: Expect tighter integration between adaptive model routing and domain-specific fine-tuning. In the coming months, more startups will offer verticalized frontier model deployments—AI cores trained on specialized legal, medical, or financial data that automatically choose the optimal reasoning depth for each task. Model providers will also release APIs for dynamic routing customization, giving founders finer control over cost-performance tradeoffs.


2. Expanded Context Windows: Unlocking Intelligent, Contextual Applications


Happenings: Many frontier models, including GPT-5, Claude 3, and Gemini 2.x, now support massive context windows—up to 400K tokens or more. Anthropic announced on Aug 12, 2025: “Claude Sonnet 4 now supports up to 1 million tokens of context… a 5x increase…”—a leap that dramatically expands the potential for maintaining continuity across huge datasets and complex workflows. Dave Blundin, on the AI Insiders Podcast (Aug 2025), remarked: “We’re looking at AI that can read your entire corporate knowledge base and apply it instantly.”  Additionally, new compression techniques are emerging to store and retrieve these large contexts more efficiently, reducing compute costs and latency.


Startup Impact: This enables AI copilots that truly understand a business’s full history—contracts, product documentation, support tickets, codebases—without relying on separate memory systems. A legal-tech startup could have the AI instantly cross-reference old and new case law; a compliance SaaS could scan thousands of pages of regulatory updates; a fintech could have the AI ingest entire market reports before producing investment recommendations. When linked with MR, AI could recall relevant visual or spatial data during live interactions, and blockchain could authenticate data sources or verify content integrity—creating end-to-end trusted, immersive workflows.


What’s Ahead: Look for even larger and more efficient context windows—pushing beyond 1M tokens—alongside hybrid architectures that combine long-context reasoning with embedded search. Startups will likely integrate these capabilities with private knowledge graphs, enabling near-instant, accurate recall of a company’s institutional memory. Expect providers to enhance memory persistence across sessions, reducing repetitive prompts and boosting productivity.


3. Agentic Workflows & Integrated Productivity: Beyond Feature to Autonomy


Happenings: Each of the leading frontier models have advanced agentic capabilities—tool orchestration, multi-step execution, and autonomous task handling—mature enough to embed directly into startup offerings.  As OpenAI described in Introducing ChatGPT agent: bridging research and action (Jul 17, 2025), “ChatGPT now thinks and acts, proactively choosing from a toolbox of agentic skills…”—highlighting the synergy between large context comprehension and agentic execution.  Recently, enterprise vendors have begun launching marketplaces for pre-built AI agents, enabling faster integration of domain-specific autonomous capabilities.


Startup Impact: Founders can deliver AI that acts, not just advises. Imagine a contract-drafting AI that not only creates a draft but cross-checks it against precedent, runs it through a compliance filter, and emails it to the counterparty—all without further prompting. In finance, an AI could reconcile ledgers, generate audit-ready statements, and update the client’s tax prep file. When paired with blockchain, these agents could automatically execute smart contracts or log verified milestones—bridging traditional and digital transactions.


What’s Ahead: Expect rapid expansion of multi-agent collaboration, where different AI agents handle separate parts of a workflow and coordinate in real time. Model providers will likely introduce native integration with enterprise systems like ERP, CRM, and compliance platforms, allowing autonomous execution at scale. Regulatory bodies may also start issuing guidelines for agent accountability, influencing how startups deploy these capabilities.


4. AI-Generated Intellectual Property as a Valuation Lever


Happenings: The latest AI models, including GPT-5 and other top-tier systems, offer coding, reasoning, and creative generation capabilities that allow startups to rapidly create patentable or copyright-eligible assets—from innovative algorithms to novel process designs.  The U.S. Patent and Trademark Office has clarified its stance on AI-assisted inventions. As published in the Federal Register, Inventorship Guidance for AI-Assisted Inventions (Feb 13, 2024), patents may cover AI-assisted inventions “for which a natural person provided a significant contribution.” Kathi Vidal, USPTO Director, emphasized in Holland & Knight (Feb 14, 2024): “The right balance must be struck between awarding patent protection… while not unnecessarily locking up innovation…”  This signals that human creativity remains a legal requirement, but AI can play a pivotal supporting role.  Internationally, several jurisdictions are piloting “AI co-inventor” recognition frameworks, signaling a shift toward more inclusive IP laws.


Startup Impact: Founders can now treat their AI not just as a tool, but as an IP-producing partner. By working with attorneys experienced in the U.S. Patent and Trademark Office (USPTO) process, startups can file for patents on AI-generated inventions or register trademarks tied to unique AI-created branding. Linking MR and blockchain into these plays could mean patenting spatial interaction methods or securing blockchain-verified provenance for AI-created assets—strengthening IP claims and adding monetization paths.


What’s Ahead: We’ll likely see clearer legal frameworks around AI inventorship and copyright ownership, with some jurisdictions formalizing how machine-assisted IP can be registered. Startups should anticipate new opportunities to license AI-generated technologies and processes, and investors may start valuing companies partly on their “AI IP portfolios” alongside traditional metrics.


5. Token-Efficient Tiering and Cost-Control Strategies


Happenings: Across the major frontier models, modular variants offer different speed, depth, and cost profiles, with pricing that can range from a few cents to over $10 per million tokens depending on tier. Alex Wissner-Gross, on the AI Insiders Podcast (Aug 2025), stated: “Cost discipline will be a defining factor in whether AI-native startups survive beyond their Series A.”   New AI-FinOps platforms are emerging to automate token routing decisions, forecast usage spikes, and suggest optimal provider-switching strategies in real time.


Startup impact: Founders should design intelligent routing systems that send simple tasks to cheaper models and reserve premium tiers for high-value or high-risk workloads. This not only controls costs but also ensures users experience consistent performance where it matters most. In MR-linked businesses, this routing could prioritize high-fidelity visual generation when needed, and in blockchain contexts, allocate processing to secure token-related computations—creating an optimized balance between immersive experience quality and secure transactional throughput.


What’s ahead: Expect dynamic model-switching APIs to become more sophisticated, allowing real-time routing decisions based on latency, accuracy needs, and cost. We may also see cross-provider orchestration platforms emerge, enabling startups to seamlessly switch between GPT, Claude, Gemini, and open-source models for optimal pricing and performance. Token optimization strategies will likely be built into development frameworks, making efficiency a default rather than an afterthought.


Takeaways

Stakeholder

Strategic Actions and Considerations

Founders

Prioritize a unified AI architecture with adaptive routing; Explore integrated Mixed Reality (MR) + blockchain applications; Pilot industry-specific fine-tunes; Integrate IP creation into R&D to boost valuation; Implement dynamic cost controls for sustainable scaling.

Product Architects

Design for long-context processing with MR data streams; Enable agentic workflows integrated with enterprise tools and blockchain triggers; Leverage modular model tiers to balance performance and cost.

Lawyers

Develop frameworks for AI-assisted IP registration that incorporate blockchain provenance; Craft liability and compliance policies for autonomous AI; Negotiate flexible, scalable model usage contracts.

Accountants

Embed token economics in financial planning; Track and value AI-generated IP as intangible assets, including those from MR or blockchain use cases; Model cost-performance tradeoffs across providers.

Wealth Managers

Incorporate AI-driven personalization while safeguarding trust; Assess startups’ AI asset portfolios and MR/blockchain integrations; Advise on diversification of model providers to mitigate operational risk.

 

Note: This report has been fact-checked for accuracy, but please inform us of any suggested changes

 

 

For more about Jonathan, who Crunchbase ranks among the Top 10 People in Semantic Web (‘Web3”)…

 

JonathanMHerman.com

 


bottom of page