- The AI Report
- Posts
- 📊 State of AI 2025 Report
📊 State of AI 2025 Report
Two types of AI startups—only one survives long-term
AI's dark side emerges this week. While venture capitalists celebrate AI startups hitting $100M revenue in record time, security researchers are discovering that our rush to deploy AI agents everywhere has created massive vulnerabilities. From MCP servers leaking customer data to the mathematical impossibility of autonomous workflows, the AI gold rush is hitting some harsh realities. Plus, Anthropic just dropped a context window so large it can swallow your entire codebase—for a price.
The Latest in AI
đź“„ Claude Sonnet 4 Expands Context to 1 Million Tokens
Anthropic announced Claude Sonnet 4 now supports up to 1 million tokens of context—a 5x increase that enables processing entire codebases with 75,000+ lines of code or dozens of research papers in a single request. The expansion unlocks new use cases while introducing higher pricing for extended context.
Process complete codebases including source files, tests, and documentation for comprehensive architecture analysis and cross-file dependency identification
Analyze relationships across hundreds of legal contracts, research papers, or technical specifications while maintaining full context understanding
Build agents that maintain coherence across hundreds of tool calls and multi-step processes with complete API documentation and interaction histories
Costs increase for prompts over 200K tokens, jumping from $3 to $6 per million input tokens, though prompt caching can reduce expenses
Companies like Bolt.new and iGent AI report the expanded context enables "true production-scale engineering" and multi-day development sessions
🤔 Why It Matters:
The 1 million token context window represents a fundamental shift toward AI systems that can understand and work with enterprise-scale complexity. This capability enables more sophisticated AI applications while establishing new cost structures that will influence how organizations design AI workflows.
Learn from this investor’s $100m mistake
In 2010, a Grammy-winning artist passed on investing $200K in an emerging real estate disruptor. That stake could be worth $100+ million today.
One year later, another real estate disruptor, Zillow, went public. This time, everyday investors had regrets, missing pre-IPO gains.
Now, a new real estate innovator, Pacaso – founded by a former Zillow exec – is disrupting a $1.3T market. And unlike the others, you can invest in Pacaso as a private company.
Pacaso’s co-ownership model has generated $1B+ in luxury home sales and service fees, earned $110M+ in gross profits to date, and received backing from the same VCs behind Uber, Venmo, and eBay. They even reserved the Nasdaq ticker PCSO.
Paid advertisement for Pacaso’s Regulation A offering. Read the offering circular at invest.pacaso.com. Reserving a ticker symbol is not a guarantee that the company will go public. Listing on the NASDAQ is subject to approvals.
📊 The State of AI 2025: New Benchmarks and Battle Lines
Bessemer Venture Partners' comprehensive State of AI report reveals two distinct classes of AI startups emerging, with "Supernovas" hitting $125M ARR in year two while "Shooting Stars" follow more sustainable Q2T3 growth patterns. The data shows clear winners and losers as the AI universe matures.
"Supernovas" reach $40M ARR in year one and $125M in year two but with only 25% gross margins, while "Shooting Stars" grow sustainably with 60% margins
AI-native challengers are disrupting CRM, ERP, and HR systems by offering "systems of action" that auto-ingest data and enable 90% faster implementation
Model Context Protocol adoption by major players creates the "USB-C of AI," enabling standardized agent integration while raising security challenges
Previously "technophobic" industries like healthcare, legal, and real estate show rapid AI adoption as tools address language-heavy workflows
Next-generation agentic browsers from OpenAI, Google, and startups will embed AI at the operating layer for multi-step automation
🤔 Why It Matters:
The AI landscape is crystallizing into defined categories and competitive dynamics. While Supernova growth rates grab headlines, Shooting Star companies with sustainable unit economics may define the era. The shift from systems of record to systems of action represents a once-in-a-generation opportunity to disrupt enterprise software incumbents.
đź”’ MCP Security Flaws Could Expose Your AI Infrastructure
As Model Context Protocol adoption accelerates, security researchers have uncovered serious vulnerabilities that could turn your AI assistant into an attack vector. With hundreds of exposed servers and critical flaws already exploited in the wild, MCP's "universal plugin" promise comes with hidden dangers.
Malicious MCP servers can hide harmful instructions in tool descriptions that AI agents read as legitimate commands, creating invisible backdoors
Despite OAuth 2.1 requirements, 492 MCP servers were found exposed without authentication, with many implementations treating security as optional
Popular MCP packages like mcp-remote (downloaded 558,000+ times) contained critical vulnerabilities allowing remote code execution via embedded shell commands
Supabase's support system exposed customer tokens when agents executed SQL instructions from user tickets, while GitHub MCP leaked private repository data
Tool poisoning attacks succeed even with signed packages and code reviews, as schema-based payloads can bypass static analysis
🤔 Why It Matters:
MCP's rapid adoption is outpacing security practices, creating systemic risks as organizations deploy AI agents with broad system access. The protocol's design makes traditional security approaches insufficient—organizations need MCP-specific threat models and authentication frameworks to prevent AI systems from becoming enterprise attack vectors.
CTV ads made easy: Black Friday edition
As with any digital ad campaign, the important thing is to reach streaming audiences who will convert. Roku’s self-service Ads Manager stands ready with powerful segmentation and targeting — plus creative upscaling tools that transform existing assets into CTV-ready video ads. Bonus: we’re gifting you $5K in ad credits when you spend your first $5K on Roku Ads Manager. Just sign up and use code GET5K. Terms apply.
🗞️ AI Bytes
đź“° AI Will Reshape Commerce Through Purchase Category Disruption
A16Z analysis reveals AI will transform shopping differently across five categories, from minimal impact on impulse buys to revolutionary changes in functional purchases through research agents. Amazon and Shopify are positioned to win by controlling transaction data and delivery, while Google faces risk as AI eats high-value commercial queries that drive its revenue. Infrastructure gaps like unified retail APIs and dynamic preference memory currently limit AI agent effectiveness.
đź“° GitHub Folds Into Microsoft Following CEO Resignation
GitHub CEO Thomas Dohmke announced his resignation as Microsoft absorbs the programming site into its CoreAI team, ending GitHub's independence. Dohmke will stay through year-end to manage the transition before pursuing startup ventures, potentially including a GitHub successor. The move signals Microsoft's intent to double down on AI-assisted coding through GitHub Copilot integration.
đź“° Why Better Algorithm Complexity Doesn't Always Mean Better Performance
A detailed analysis shows how O(log n) algorithms can lose to O(n) implementations due to hardware realities. Integer division takes 42-95 cycles on Intel Skylake while addition takes just 1 cycle, making Euclid's subtraction-based GCD algorithm outperform the modulo version on small inputs despite worse time complexity. Stein's binary algorithm achieves the best real-world performance by using hardware-friendly bit operations.
đź“° Developer Builds Search Engine From Scratch With 3 Billion Embeddings
A solo engineer created a neural search engine in two months using 200 GPUs to generate 3 billion SBERT embeddings across 280 million pages. The system achieves 500ms query latency and demonstrates superior results for complex natural language queries compared to traditional keyword-based search. The project reveals how neural embeddings can find quality content and insights hidden in the web's long tail.
đź“° Engineer Who Built 12 AI Agents Explains Why Current Hype Will Fail
A developer with extensive production AI agent experience argues that mathematical realities make autonomous multi-step workflows impossible at scale. Error rates compound exponentially (95% per step = 36% success over 20 steps), while context windows create quadratic token costs that become prohibitively expensive. Successful agents require bounded contexts, explicit human control points, and careful tool engineering rather than full autonomy.
đź“° Sam Altman's Brain Chip Venture Explores Gene Therapy Approach
OpenAI CEO Sam Altman's Merge Labs is considering gene therapy to modify brain cells combined with ultrasound implants for brain-computer interfaces. The approach would genetically alter cells to respond to ultrasound rather than traditional electrical signals used by competitors like Neuralink. Altman wants to "think something and have ChatGPT respond to it" as the company seeks $250 million at an $850 million valuation.
🛠️ Top AI Tools This Week
đź§Ş Functionize
Functionize is a cloud-based platform that uses AI to generate, execute, and maintain software tests without heavy coding. Its deep learning models create self-healing tests that adapt when applications change, while NLP lets non-technical users write tests in plain language.
On a scale of 1 to AI-takeover, how did we do today? |





