Add fast memory search mode by Amhitox · Pull Request #188 · XortexAI/XMem

Amhitox · 2026-05-20T13:16:49Z

This PR adds a fast search path for memory lookup without forcing the full agentic retrieval flow every time. The default behavior is raw search results, and callers can opt into answer synthesis with answer=true when they still want an LLM-generated response.

What changed:

expanded /v1/memory/search to return ranked hits from profile, temporal, summary, snippet, and code memory
kept raw search as the default path, with optional answer generation behind answer=true
added per-domain timing data and p50/p95/p99 latency snapshots for raw vs answer mode
added short-lived caching for profile catalogs and retrieval plans to avoid repeated planning work
covered the raw and answer search behavior with focused API tests

Tested with:

.venv/bin/python -m pytest tests/api/test_dependencies_and_routes.py
.venv/bin/python -m py_compile src/api/schemas.py src/api/routes/memory.py src/pipelines/retrieval.py tests/api/test_dependencies_and_routes.py

Fixes #163

gemini-code-assist

Code Review

This pull request enhances the /search endpoint by adding optional LLM-based answer synthesis, expanding search domains to include code and snippets, and implementing detailed latency tracking. It also introduces TTL-based caching for retrieval plans and profile catalogs to improve efficiency. Review feedback identifies an opportunity to further optimize search performance by executing domain queries concurrently and suggests removing an unnecessary side effect in the profile search logic.

gemini-code-assist · 2026-05-20T13:20:00Z

+        if "profile" in domains:
+            all_results.extend(await _timed_sync("profile", timings, _search_profile, pipeline, user_id))
+        if "temporal" in domains:
+            all_results.extend(await _timed_sync("temporal", timings, _search_temporal, pipeline, req.query, user_id, req.top_k))
+        if "summary" in domains:
+            all_results.extend(await _timed_async("summary", timings, _search_summary, pipeline, req.query, user_id, req.top_k))
+        if "snippet" in domains:
+            all_results.extend(await _timed_async("snippet", timings, _search_snippet, pipeline, req.query, user_id, req.top_k))
+        if "code" in domains:
+            if not req.org_id or not req.repo:
+                return _error(request, "org_id and repo are required when domains includes 'code'.", 400, 0)
+            all_results.extend(await _timed_async("code", timings, _search_code, req, user_id))


The domain searches are currently executed sequentially. To improve performance for this 'fast search' endpoint, these searches should be run concurrently using asyncio.gather.

Suggested change

if "profile" in domains:

all_results.extend(await _timed_sync("profile", timings, _search_profile, pipeline, user_id))

if "temporal" in domains:

all_results.extend(await _timed_sync("temporal", timings, _search_temporal, pipeline, req.query, user_id, req.top_k))

if "summary" in domains:

all_results.extend(await _timed_async("summary", timings, _search_summary, pipeline, req.query, user_id, req.top_k))

if "snippet" in domains:

all_results.extend(await _timed_async("snippet", timings, _search_snippet, pipeline, req.query, user_id, req.top_k))

if "code" in domains:

if not req.org_id or not req.repo:

return _error(request, "org_id and repo are required when domains includes 'code'.", 400, 0)

all_results.extend(await _timed_async("code", timings, _search_code, req, user_id))

tasks = []

if "profile" in domains:

tasks.append(_timed_sync("profile", timings, _search_profile, pipeline, user_id))

if "temporal" in domains:

tasks.append(_timed_sync("temporal", timings, _search_temporal, pipeline, req.query, user_id, req.top_k))

if "summary" in domains:

tasks.append(_timed_async("summary", timings, _search_summary, pipeline, req.query, user_id, req.top_k))

if "snippet" in domains:

tasks.append(_timed_async("snippet", timings, _search_snippet, pipeline, req.query, user_id, req.top_k))

if "code" in domains:

if not req.org_id or not req.repo:

return _error(request, "org_id and repo are required when domains includes 'code'.", 400, 0)

tasks.append(_timed_async("code", timings, _search_code, req, user_id))

results_from_domains = await asyncio.gather(*tasks)

for domain_results in results_from_domains:

all_results.extend(domain_results)

gemini-code-assist · 2026-05-20T13:20:01Z

-        )
-        return [SourceRecord(domain="profile", content=r.content, score=r.score, metadata=r.metadata) for r in raw]
+        _, raw = pipeline._fetch_profile_catalog(user_id)
+        pipeline._cached_profile_records = raw


This line appears to be unnecessary. The _cached_profile_records attribute is not used within this file's logic, and setting it here creates a potentially confusing side effect on the pipeline object. Please consider removing it.

Add fast memory search mode

b4343eb

Amhitox requested review from ishaanxgupta and ved015 as code owners May 20, 2026 13:16

github-actions Bot added tests api pipelines labels May 20, 2026

gemini-code-assist Bot reviewed May 20, 2026

View reviewed changes

Amhitox mentioned this pull request May 20, 2026

Add low-latency raw search path separate from agentic answer synthesis #163

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fast memory search mode#188

Add fast memory search mode#188
Amhitox wants to merge 1 commit into
XortexAI:mainfrom
Amhitox:bounty/xmem-163-search-fastpath

Amhitox commented May 20, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 20, 2026

Uh oh!

gemini-code-assist Bot May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Amhitox commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Amhitox commented May 20, 2026 •

edited

Loading