AI Startup Engram Raises $98 Million to Slash Token Costs for Enterprise Models

Amid increasing expenses from next-generation AI models, Engram, a young AI memory company, secured $98 million to develop solutions that reduce token consumption by up to 100x, optimizing enterprise AI efficiency.

Engram's AI models use up to 100x fewer tokens, cutting query costs dramatically.
Client roster includes Microsoft, Notion, and legal AI startup Harvey.
Raised $98 million led by top investors including Kleiner Perkins and General Catalyst.

Market signal

The AI sector is confronting a critical cost issue as advanced models require substantially more compute resources and tokens for effective operation. Contrary to earlier assumptions that bigger AI models would reduce per-query costs through scale, the industry now faces an 'explosion' of data and expenses. Engram's approach highlights a novel market response by focusing on optimizing AI memory and token usage to alleviate these rising costs.

By securing significant funding from prominent venture capitalists and AI experts, Engram's rapid ascent signals strong investor confidence in solutions addressing cost efficiency in AI operations. Its early traction with enterprise clients across diverse sectors underscores a clear demand for more cost-effective AI tooling that does not compromise contextual intelligence or workflow integration.

Operator impact

For operators and buyers deploying AI at scale, Engram's technology presents a potential paradigm shift in managing AI inference expenses. Incorporating Engram’s memory-focused models could reduce the token consumption necessary for internal workflows, which directly translates into lower usage costs and improved ROI on AI investments.

This technology enhances AI responsiveness by storing organizational context and anticipating questions within workflows, resulting in smarter, faster interactions. Operators integrating such specialized models may benefit from tailored AI outputs that better align with specific business needs while controlling spiraling costs associated with more general, large foundation models.

What to watch next

Monitor Engram’s client expansion and partnerships in the coming months to assess adoption dynamics and real-world performance impacts on operational AI budgets. Evaluating how effectively Engram’s models integrate with existing AI stacks and enterprise applications will inform the viability of memory-optimized approaches in broader markets.

It will also be crucial to watch how competitors respond to the growing cost pressures in AI model deployments, especially major AI service providers like OpenAI and Anthropic. Their strategies either to improve token efficiency internally or to partner with emerging players like Engram will shape the economics and capabilities landscape of AI solutions tailored for enterprises.

Source assisted: This briefing began from a discovered source item from CNBC Technology. Open the original source.

How SignalDesk reports: feeds and outside sources are used for discovery. Public briefings are edited to add context, buyer relevance and attribution before they are published. Read the standards