Skip to content

rroy30/Spotify-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 

Repository files navigation

🎧 Spotify Data Analysis Project

This project explores and analyzes Spotify's top songs dataset from 2023 and previous years, offering insights into trends, artist performances, streaming patterns, and more. Using Python libraries such as Pandas, Matplotlib, and Seaborn, we extract meaningful visualizations and statistical summaries from over 950 songs. πŸ“Š Features

Data Cleaning & Preprocessing
Handled missing values and formatted data for accurate analysis.

Descriptive Statistics
Generated comprehensive statistical summaries including artist count, release years, and playlist/chart appearances.

Exploratory Data Analysis (EDA)

    Most featured artists and their frequency

    Number of songs released per year

    Distribution of stream counts

    Relationships between various features (e.g., artist count vs. streams)

Visualizations
Included histograms, box plots, line plots, count plots, and pair plots to uncover trends and patterns in the data.

πŸ“ Dataset

Source: [Spotify Top Tracks 2023]

Rows: 953 songs

Columns: 24 features including track_name, artist(s)_name, streams, bpm, energy_%, and more.

πŸ› οΈ Tools & Libraries

Python

Pandas

NumPy

Matplotlib

Seaborn

πŸ“ˆ Key Insights

Taylor Swift, The Weeknd, and Bad Bunny were among the most frequent artists.

2022 and 2023 saw the highest number of releases in the dataset.

Stream counts vary widely, with several outliers skewing the distribution.

Most songs had a single artist, but collaborative tracks also performed well.

πŸ“Œ Future Enhancements

Predictive modeling to forecast song popularity

Genre-based analysis

About

A short and simple data analysis project on spotify data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published