memory

dan/memory

mirror of https://github.com/mruwnik/memory.git synced 2026-01-02 09:12:58 +01:00

History

mruwnik 782b56939f Refactor search: add LLM query analysis, extract constants

- Add query_analysis.py for LLM-based query preprocessing
  - Detects modalities from natural language ("on lesswrong" -> forum)
  - Cleans meta-language ("I remember reading..." -> core query)
  - Generates query variants for better recall
  - Dynamically discovers modalities and domains from database

- Extract constants to constants.py
  - STOPWORDS, RRF_K, boost values, etc.
  - Cleaner separation of configuration from logic

- Refactor search_chunks into focused helper functions
  - _run_llm_analysis: parallel query analysis + HyDE
  - _apply_query_analysis: apply analysis results
  - _build_search_data: construct search data with variants
  - _run_searches: embedding + BM25 with RRF fusion
  - _fetch_chunks: database retrieval with scoring
  - _apply_boosts: title, popularity, recency boosts
  - _apply_reranking: cross-encoder reranking

- Remove redundant regex-based modality detection
- Remove static QUERY_EXPANSIONS (LLM handles this better)
- Add comprehensive tests for query_analysis module

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

2025-12-21 14:43:17 +00:00

data

better emails embedding + format search results

2025-06-09 13:51:58 +02:00

integration

fix tests

2025-06-10 15:32:34 +02:00

memory

Refactor search: add LLM query analysis, extract constants

2025-12-21 14:43:17 +00:00

providers

fix linting