Overview
This guide documents the exploratory data analysis performed on the Historia Para Gandules video dataset, which includes 121 videos analyzed across multiple engagement metrics and content categories.Dataset Summary
The analysis covers:- Total Videos: 121
- Metrics Analyzed: Likes, Comments, Views, Video Duration
- Content Categories: 5 main categories
- Time Period: Historical content from Las Palmas de Gran Canaria and Canary Islands
Loading and Preparing Data
Key Metrics
Descriptive Statistics
The dataset shows the following distribution across 121 videos:| Metric | Mean | Std Dev | Min | 25% | Median | 75% | Max |
|---|---|---|---|---|---|---|---|
| Likes | 1,316 | 1,930 | 304 | 640 | 828 | 1,250 | 14,659 |
| Comments | 39 | 49 | 3 | 18 | 27 | 39 | 361 |
| Views | 15,392 | 39,250 | 2,277 | 4,926 | 6,294 | 10,244 | 337,001 |
| Duration (s) | 50.08 | 18.22 | 26 | 38.13 | 45.90 | 56.20 | 133.49 |
Top Performing Videos
The top 5 videos by likes demonstrate exceptional engagement:- 14,659 likes - Canarian identity video (255,191 views, 361 comments)
- 14,514 likes - Puerto de la Luz construction (337,001 views, 333 comments)
- 5,259 likes - Las Canteras beach history (82,054 views, 99 comments)
- 4,497 likes - Historic train on maritime avenue (69,165 views, 179 comments)
- 4,384 likes - Historical event from 1755 (38,606 views, 110 comments)
Data Distribution Analysis
Engagement by Category
Visualizing Category Performance
Interactive Visualization
Likes vs Comments Scatter Plot
- Strong positive correlation between likes and comments
- Bubble size represents view count
- Color coding shows content category distribution
- Hover to see video titles and exact metrics
Key Insights
Engagement Patterns: Videos about Canarian identity and major historical infrastructure projects generate the highest engagement, with views exceeding 250,000 and comments reaching over 300.
Next Steps
- Statistical Analysis - Deep dive into correlations and statistical tests
- Category Analysis - Detailed breakdown by content type
Code Repository
All analysis code is available in the project’sEDA.ipynb notebook with reproducible results and visualizations.