If you’re using the Yahoo Flickr Creative Commons 100 Million (YFCC100M) dataset in your research, please cite:

Bart Thomee, David A. Shamma, Gerald Friedland, Benjamin Elizalde, Karl Ni, Douglas Poland, Damian Borth, and Li-Jia Li. 2016. YFCC100M: The New Data in Multimedia Research. Communications of the ACM 59(2), pp. 64-73. Available at:

author = "Bart Thomee and David A. Shamma and Gerald Friedland and Benjamin Elizalde and Karl Ni and Douglas Poland and Damian Borth and Li-Jia Li",
title = "{YFCC100M}: The New Data in Multimedia Research",
journal = "Communications of the {ACM}",
volume = "59",
number = "2",
pages = "64--73",
year = "2016",
url = "",

Note: A preprint version of this article was made available as a PDF on, but listed under a different title: “The New Data and New Challenges in Multimedia Research”. Please cite the published version instead.

For the preferred citations for computed features, annotations, and tools, please see the pages for those resources.