CC Open Source Blog
CC Search is out of beta with 300M images and easier attribution
Today CC Search comes out of beta, with over 300 million images indexed from multiple collections, a major redesign, and faster, more relevant search. It’s the result of a huge amount of work from the engineering team at Creative Commons and our community of volunteer developers.
CC Search searches images across 19 collections pulled from open APIs and the Common Crawl dataset, including cultural works from museums (the Metropolitan Museum of Art, Cleveland Museum of Art), graphic designs and art works (Behance, DeviantArt), photos from Flickr, and an initial set of CC0 3D designs from Thingiverse.
Aesthetically, you’ll see some key changes — a cleaner home page, better navigation and filters, design alignment with creativecommons.org, streamlined attribution options, and clear channels for providing feedback on both the overall function of the site and on specific image reuses. It’s also now linked directly from the Creative Commons homepage as the default method to search for CC-licensed works, and replaces the old search portal (though that tool is still online here).
Under the hood, we improved search loading times and search phrase relevance, implemented analytics to better understand when and how the tools are used, and fixed many critical bugs our community helped us to identify.
We will continue to grow the number of images in our catalog, prioritizing key image collections such as Europeana and Wikimedia Commons. We also plan to index additional types of CC-licensed works, such as open textbooks and audio, later this year. While our ultimate goal remains the same (to provide access to all 1.4 billion works in the commons), we are initially focused on images that creators desire to reuse in meaningful ways, learning about how these images are reused in the wild, and incorporating that learning back into CC Search.
Feature-wise, we have specific deliverables for this quarter listed in our roadmap, which includes advanced filters on the home page, the ability to browse collections without entering search terms, and improved accessibility and UX on mobile. In addition, we expect some work related to CC Search will be done by our Google Summer of Code students starting in May.
We’re also presenting the “State of CC Search” at the CC Global Summit next month in Lisbon, Portugal, where we’ll host a global community discussion around desired features and collections for CC Search.
All of our code, including the code behind CC Search, is open source (CC Search, CC Catalog API, CC Catalog) and we welcome community contribution. If you know how to code, we invite you to join the growing CC developer community.
CC Search is also made possible by a number of institutional and individual supporters and donors. Specifically, we would like to thank Arcadia – a charitable fund of Lisbet Rausing and Peter Baldwin, Mozilla, and the Brin Wojcicki Foundation for their support.
This post was originally posted on the primary Creative Commons blog.