Then we put the data up, but the problem with Solr was it didn’t have a user interface, so we used Project Blacklight, which is open source software normally used by librarians. We used it for the journalists. It’s simple because it allows you to do faceted search—so, for example, you can facet by the folder structure of the leak, by years, by type of file. There were more complex things—it supports queries in regular expressions, so the more advanced users were able to search for documents with a certain pattern of numbers that, for example, passports use. You could also preview and download the documents. ICIJ open-sourced the code of our document processing chain, created by our web developer Matthew Caruana Galizia.
We also developed a batch-searching feature. So say you were looking for politicians in your country—you just run it through the system, and you upload your list to Blacklight and you would get a CSV back saying yes, there are matches for these names—not only exact matches, but also matches based on proximity. So you would say “I want Mar Cabra proximity 2” and that would give you “Mar Cabra,” “Mar whatever Cabra,” “Cabra, Mar,”—so that was good, because very quickly journalists were able to see… I have this list of politicians and they are in the data!
6More
The People and Tech Behind the Panama Papers - Features - Source: An OpenNews project - 0 views
1More
All Nations Lose with TPP's Expansion of Copyright Terms | Electronic Frontier Foundation - 0 views
1More
European Press Prize: The Awards for Excellence in Journalism. - 0 views
1More
ACLU to appellate court: Please halt NSA's resumed bulk data collection | Ars Technica - 0 views
1More
Samsung's Linux-Based Tizen Phone Proves a Success [# ! see url note] - 0 views
1More
Google DMCA Notice Record Smashed Again - But Why? - TorrentFreak - 1 views
3More
Obama lawyers asked secret court to ignore public court's decision on spying | US news ... - 0 views
8More
Google Chrome Listening In To Your Room Shows The Importance Of Privacy Defense In Depth - 0 views
3More
How Social Media is Upending the Enterprise - 0 views
1More
Big media fails to turn ISPs into copyright cops | Media Maverick - CNET News - 1 views
1More
Red 4.0 - A Full Ruby Runtime in Your Browser « Trek - 0 views
1More
Inside Citizen Lab, the "Hacker Hothouse" protecting you from Big Brother | Ars Technica - 0 views
‹ Previous
21 - 40 of 109
Next ›
Last »
Showing 20▼ items per page