📅 2023-01-04 — Session: Enhanced BibTeX to JSON conversion and data manipulation

🕒 09:40–10:45
🏷️ Labels: Python, Bibtex, Data Manipulation, Pandas, Vs Code
📂 Project: Dev
⭐ Priority: MEDIUM

Session Goal

The session aimed to improve the functionality of a Python script for converting BibTeX files to JSON and CSV formats, alongside performing data manipulation tasks using Pandas.

Key Activities

  • Developed a Python script to convert BibTeX entries into JSON and CSV formats, focusing on rearranging and sorting entries by year.
  • Implemented error handling and additional features for the bibtex_to_json function, such as sorting by fields and specifying output file names.
  • Utilized Pandas for data manipulation tasks, including converting DataFrame columns to numeric types, removing unwanted characters, and replacing ‘null’ values with np.nan.
  • Explored text editing techniques in Visual Studio Code, such as indenting/un-indenting and converting cells to Markdown.

Achievements

  • Successfully created a robust Python script capable of converting, sorting, and saving BibTeX data in desired formats.
  • Enhanced data processing capabilities with Pandas, improving data cleaning and type conversion processes.

Pending Tasks

  • Further enhancement of the bibtex_to_json function to support additional BibTeX entry types and parse more fields.
  • Continued exploration of Python scripting for data management and conversion tasks.