Data Mining is in Fashion
Few magazines can boast being continuously published for over a century, familiar and interesting to almost everyone, full of iconic pictures — and also completely digitized and marked up as both text and images. What can you do with over 2,700 covers, 400,000 pages, 6 TB of data? Students, librarians and faculty are excited about the possibilities of working with Vogue to explore questions in fields from gender studies to computer science. We highlight some early experiments below:
Using temporal word embeddings to study the shifting notion of beauty in Vogue, by Sydney Bowen '21.
View figures on circulation, ratio of articles to advertisements, price per issue, and number of pages per year.
Using Word Embedding Models (word2vec) to explore hierarchical clusters of fabric types.