Dataset
The catalog metadata comes from the Free Music Archive. The frontend uses static assets hosted in the music-sim-capstone-data S3 bucket.
This project uses machine learning embeddings and nearest-neighbor search to surface songs that are acoustically or stylistically related. Browse the catalog, search by song title, view spectrograms, and inspect each track’s 20 most similar songs.
The catalog metadata comes from the Free Music Archive. The frontend uses static assets hosted in the music-sim-capstone-data S3 bucket.
Tracks are embedded into a shared vector space. A FAISS index is then used to retrieve the 20 nearest neighbors for each track.
Each song page can display a precomputed spectrogram and a short audio clip so that visual patterns and audible texture can be compared side by side.
Each track is passed through the trained audio pipeline to produce a normalized embedding vector.
For every track, the top 20 closest songs are stored so the application can return results quickly.
Static pages call the backend for search, genre listings, track metadata, and similar song results.