Developed Embedding and Visualization Techniques for LlamaIndex

  • Day: 2025-07-22
  • Time: 23:20 to 00:00
  • Project: Dev
  • Workspace: WP 2: Operational
  • Status: Completed
  • Priority: MEDIUM
  • Assignee: Matías Nehuen Iglesias
  • Tags: Llamaindex, Embedding, Markdown, Visualization, TMAP

Description

Session Goal: The session aimed to explore and develop techniques for embedding and visualizing data using LlamaIndex and related tools.

Key Activities:

  • Conducted search queries on AI tools and techniques for 2025, focusing on hierarchical markdown parsing, TMAP visualization, and context tree indexing with embeddings.
  • Investigated the TMAP library for visualizing embeddings, including minimum spanning tree visualization.
  • Explored MarkdownNodeParser metadata and its integration with LlamaIndex, focusing on header paths and metadata mode.
  • Developed a structured approach for parsing Markdown documents into hierarchical nodes, creating embeddings, and clustering these embeddings for visualization.
  • Outlined processes for embedding header paths in LlamaIndex, including multi-scale parsing and embedding.
  • Documented an end-to-end workflow for processing JSONL chat logs into Markdown, embedding them, and visualizing results using TMAP and dendrograms.
  • Debugged and embedded nodes for a LinkedIn bio project, including thematic mapping.

Achievements:

  • Established a comprehensive understanding of embedding techniques and visualization using LlamaIndex and MarkdownNodeParser.
  • Developed practical workflows with code snippets for embedding and visualizing data.

Pending Tasks:

Evidence

  • source_file=2025-07-22.sessions.jsonl, line_number=2, event_count=0, session_id=ef1f4674590a4a3cf11bc6e07c202461342d8fbb275353ae8fcd7ef97945fa74
  • event_ids: []