📅 2023-05-18 — Session: Executed Data Manipulation and Visualization Tasks
🕒 02:20–03:00
🏷️ Labels: Python, Pandas, Seaborn, Git, Data Visualization
📂 Project: Dev
⭐ Priority: MEDIUM
Session Goal: The session aimed to perform various data manipulation and visualization tasks using Python, focusing on Git repository management, pandas DataFrame operations, and data visualization techniques.
Key Activities:
- Configured a
.gitignorefile to exclude thedatosfolder from a Git repository, including staging and committing changes. - Utilized pandas to group and aggregate a DataFrame by monthly frequency using
groupbyandresamplefunctions. - Created a date table by aggregating earliest and latest dates for each
serie_idusing pandas. - Formatted dates in a pandas DataFrame to ‘yyyy-mm’ format.
- Developed a correlation matrix heatmap using Seaborn, with customization options for color maps and annotations.
- Updated the heatmap code for larger cells and custom row labels.
- Applied hierarchical clustering to time series data and visualized it with a dendrogram using
scipyandseaborn. - Aligned time series data for hierarchical clustering to ensure accuracy.
- Implemented Dynamic Time Warping (DTW) for comparing time series of different lengths.
Achievements: Successfully executed and documented multiple data manipulation and visualization tasks, enhancing the understanding of pandas and Seaborn for data analysis.
Pending Tasks: None identified.