Overview
Retrieve relevant facts from the knowledge graph based on current conversation context and an optional focal entity. This is the preferred method for context-aware memory retrieval during conversations.Use this endpoint when you have conversation messages for context. For standalone queries, use
/search.Authentication
Your API key for authentication
Request Body
User identifier (group ID) to retrieve memories for
UUID of entity to focus retrieval around. Set to
null for general retrieval across all user’s memory.Use
null when you don’t have a specific focal entity. Use a UUID when retrieving facts about a specific project, person, or concept.Current conversation messages to build the retrieval query from
Maximum number of facts to return. Typical range: 10-20.
Reranking method for result ordering:
rrf- Reciprocal Rank Fusion (good general purpose)mmr- Maximal Marginal Relevance (balances relevance + diversity)cross_encoder- Deep learning reranker (most accurate, slower)
Minimum relevance score threshold:
- For
rrfandcross_encoder: 0.0 to 1.0 (defaults to 0.0) - For
mmr: -1.0 to 1.0 (defaults to -1.0)
Limit facts to specific conversation/source
MMR diversity parameter (0.0-1.0, only used when
reranker="mmr"):0.0= Maximum diversity1.0= Maximum relevance0.7= Balanced (recommended)
Response
Returns an array of relevant facts with relevance scores.Array of fact objects relevant to the conversation context
Example Request
Example Response
Error Responses
Request validation failed - check your parameters
Best Practices
Use center_node_uuid strategically - Set to
null for general retrieval, or provide a specific entity UUID when you want facts centered around a project/person/conceptInclude recent conversation context - Provide the last few messages for best semantic matching
Adjust max_facts based on use case - Use 10-20 for chat applications, more for comprehensive analysis
Use MMR reranker -
reranker="mmr" with mmr_lambda=0.7 provides good balance between relevance and diversityResponse time: Typically 200-800ms. This is a synchronous endpoint - you’ll get results immediately.