TL;DR
Semble is a code search tool designed for AI agents that significantly reduces token consumption—by approximately 98%—compared to traditional methods like grep+read. It offers rapid, accurate code retrieval on CPU without external dependencies. This development could improve agent performance and efficiency in code understanding tasks.
Semble, a new code search library designed for AI agents, claims to reduce token usage by approximately 98% compared to traditional methods like grep+read, while offering faster and accurate code retrieval on CPU without external dependencies.
Semble is built to enable agents such as Claude Code, Codex, and OpenCode to access code repositories instantly and efficiently. It indexes repositories in about 250 milliseconds and answers queries in roughly 1.5 milliseconds, all on CPU hardware. According to its creators, Semble achieves a retrieval quality comparable to specialized transformer models, with a normalized discounted cumulative gain (NDCG@10) score of 0.854. The system returns only relevant code chunks, drastically reducing token consumption—about 98% fewer tokens than traditional grep+read searches. It can be deployed as an MCP server or used via shell commands, supporting local and remote repositories, and requires no API keys, GPU, or external services.
Setup involves installing via pip or uv, with integration options for popular AI agents like Claude Code, Codex, and OpenCode. Users can perform natural language searches, such as “How is authentication handled?” or find related code snippets based on a specific file and line number. The tool also provides token savings metrics, showing significant efficiency gains over time; for example, users report saving around 1.2 million tokens across all searches.
Why It Matters
This development matters because it offers a more efficient way for AI agents to perform code searches, reducing computational costs and latency. By minimizing token usage, Semble can improve the responsiveness and scalability of code-aware AI systems, making it easier to integrate code search into various workflows without reliance on external APIs or expensive hardware. This could accelerate AI-assisted development, code review, and debugging processes, especially in environments with resource constraints.

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light
【Vehicle CEL Doctor】The NT301 obd2 scanner enables you to read DTCs, access to e-missions readiness status, turn off…
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Background
Traditional code search methods, such as grep or transformer-based models, often involve high token consumption and latency, limiting scalability and responsiveness for AI agents. Existing solutions may require external services or GPUs, adding complexity and cost. Semble emerges as a local, CPU-based alternative that combines speed, accuracy, and token efficiency. Its release follows ongoing efforts to optimize AI tool performance and reduce operational costs, aligning with trends toward more self-sufficient AI systems that do not depend on external APIs.
“Semble reduces token consumption by approximately 98%, enabling faster and more efficient code searches for AI agents without external dependencies.”
— Semble development team
“The speed and token savings are impressive; it could significantly improve how agents interact with large codebases.”
— Hacker News user

Inateck 2D Barcode Scanner, Wireless Bluetooth QR Code Scanner with AI APP & SDK, 180-Day Battery Life, Fast & Accurate Scanning, Compatible with iOS/Android/Windows
Powerful Scanning Capability: The Inateck 2D barcode scanner accurately reads almost all 1D and 2D barcodes within a…
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
What Remains Unclear
It is not yet clear how Semble performs across diverse and very large codebases in real-world scenarios, or how it compares with transformer-based models in complex search tasks beyond benchmark metrics. Long-term stability and integration ease with various agent frameworks remain to be tested.

ResumeMaker Professional Deluxe 20 – Software to Create Professional Resumes Includes Sample Resumes Written by Certified Resume Writers, Career Advice, Job Searches & Interview Questions – CD – PC
Works on Windows 11, 10, & 8
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
What’s Next
Further adoption and testing by the developer community are expected. Future updates may include expanded features, broader agent integration, and performance benchmarks across different environments. Monitoring user feedback will determine its impact on AI-assisted coding workflows.

4 Pack Telescoping Magnet Pick-up Tool Set – Retrieving Pickup Tools,Extendable Pick Up Tools,Bendable Spring Magnet Stick,Flexible Extra Long Reach Bendable Curve Grabber with 4 Claws
【Quality material】These telescoping magnet sticks are made of telescopic stainless steel tubes, which are hard to break, and…
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Key Questions
How does Semble achieve such low token usage?
Semble returns only the relevant code chunks needed for a query, avoiding reading entire files or performing exhaustive searches, thus reducing token consumption by about 98% compared to grep+read.
Can Semble be used with any AI agent?
Yes, Semble supports integration with various agents like Claude Code, Codex, and OpenCode via MCP or shell commands, making it versatile for different workflows.
Does Semble require external services or GPUs?
No, it runs entirely on CPU with no API keys or external dependencies needed, simplifying setup and reducing operational costs.
What are the performance benchmarks for Semble?
It indexes repositories in about 250 milliseconds and answers queries in approximately 1.5 milliseconds, with a retrieval quality comparable to specialized transformer models.
What is the future potential of Semble?
As adoption grows, it could become a standard tool for efficient code search in AI workflows, especially in resource-constrained environments, with ongoing updates to improve features and scalability.