Skip to main content

Home/ DJCamp2011/ Group items tagged analytic journalism

Rss Feed Group items tagged

Tom Johnson

Data journalism at the Guardian: what is it and how do we do it? | News | guardian.co.uk - 0 views

  • Data journalism at the Guardian: what is it and how do we do it? Simon Rogers: Our 10 point guide to data journalism and how it's changing Share  reddit this omnitracker.omniTrackEVarEvent( 12, 16, 'News: Reddit', 'click', '.reddit a' ); Comments (2) Data journalism. What is it and how is it changing? Photograph: Alamy Here's an interesting thing: data journalism is becoming part of the establishment. Not in an Oxbridge elite kind of way (although here's some data on that) but in the way it is becoming the industry standard.Two years ago, when we launched the Datablog, all this was new. People still asked if getting stories from data was really journalism and not everyone had seen Adrian Holovaty's riposte. But once you've had MPs expenses and Wikileaks, the startling thing is that no-one asks those questions anymore. Instead, they want to know, "how do we do it?"
  •  
    Data journalism at the Guardian: what is it and how do we do it? Simon Rogers: Our 10 point guide to data journalism and how it's changing Share reddit this Comments (2) Data abstract Data journalism. What is it and how is it changing? Photograph: Alamy Here's an interesting thing: data journalism is becoming part of the establishment. Not in an Oxbridge elite kind of way (although here's some data on that) but in the way it is becoming the industry standard. Two years ago, when we launched the Datablog, all this was new. People still asked if getting stories from data was really journalism and not everyone had seen Adrian Holovaty's riposte. But once you've had MPs expenses and Wikileaks, the startling thing is that no-one asks those questions anymore. Instead, they want to know, "how do we do it?"
Tom Johnson

Shorenstein Center paper argues for collaboration in investigative reporting | Harvard ... - 0 views

  • Shorenstein Center paper argues for collaboration in investigative reporting Thursday, June 2, 2011 Sandy Rowe, former editor of The Oregonian, and Knight Fellow at the Shorenstein Center fall 2010 and spring 2011. Photograph by Martha Stewart Shorenstein Center, Harvard Kennedy School Contact: Janell Simsjanell_sims@harvard.eduhttp://www.hks.harvard.edu/presspol/index.html Media organizations may be able to perform their watchdog roles more effectively working together than apart. That is one conclusion in a new paper, “Partners of Necessity: The Case for Collaboration in Local Investigative Reporting,” authored by Sandy Rowe, former editor of Portland’s The Oregonian. The paper is based on interviews and research that Rowe conducted while serving as a Knight Fellow at the Shorenstein Center on the Press, Politics and Public Policy at Harvard Kennedy School. Rowe’s research examines the theory underpinning collaborative work and shows emerging models of collaboration that can lead to more robust investigative and accountability reporting in local and regional markets. “Growing evidence suggests that collaborations and partnerships between new and established news organizations, universities and foundations may be the overlooked key for investigative journalism to thrive at the local and state levels,” Rowe writes. “These partnerships, variously and often loosely organized, can share responsibility for content creation, generate wider distribution of stories and spread the substantial cost of accountability journalism.” Rowe was editor of The Oregonian from 1993 until January 2010. Under her leadership, the newspaper won five Pulitzer Prizes including the Gold Medal for Public Service. Rowe chairs the Board of Visitors of The Knight Fellowships at Stanford University and is a board member of the Committee to Protect Journalists. From 1984 until April 1993, Rowe was executive editor and vice president of The Virginian-Pilot and The Ledger-Star, Norfolk and Virginia Beach, Virginia. The Virginian-Pilot won the Pulitzer Prize for general news reporting under her leadership. Rowe’s year-long fellowship at the Shorenstein Center was funded by the John S. and James L. Knight Foundation. Read the full paper on the Shorenstein Center’s website.
  •  
    Shorenstein Center paper argues for collaboration in investigative reporting Thursday, June 2, 2011 Sandy Rowe, former editor of The Oregonian, and Knight Fellow at the Shorenstein Center fall 2010 and spring 2011. Photograph by Martha Stewart Shorenstein Center, Harvard Kennedy School Contact: Janell Sims janell_sims@harvard.edu http://www.hks.harvard.edu/presspol/index.html Media organizations may be able to perform their watchdog roles more effectively working together than apart. That is one conclusion in a new paper, "Partners of Necessity: The Case for Collaboration in Local Investigative Reporting," authored by Sandy Rowe, former editor of Portland's The Oregonian. The paper is based on interviews and research that Rowe conducted while serving as a Knight Fellow at the Shorenstein Center on the Press, Politics and Public Policy at Harvard Kennedy School. Rowe's research examines the theory underpinning collaborative work and shows emerging models of collaboration that can lead to more robust investigative and accountability reporting in local and regional markets. "Growing evidence suggests that collaborations and partnerships between new and established news organizations, universities and foundations may be the overlooked key for investigative journalism to thrive at the local and state levels," Rowe writes. "These partnerships, variously and often loosely organized, can share responsibility for content creation, generate wider distribution of stories and spread the substantial cost of accountability journalism." Rowe was editor of The Oregonian from 1993 until January 2010. Under her leadership, the newspaper won five Pulitzer Prizes including the Gold Medal for Public Service. Rowe chairs the Board of Visitors of The Knight Fellowships at Stanford University and is a board member of the Committee to Protect Journalists. From 1984 until April 1993, Rowe was executive editor and vice president of The Virginian-Pi
Tom Johnson

MDA Analytics - 0 views

  •  
    An interesting example of yet another "next generation" data analysis and presentation tool. You can see the demos at http://www.lavastorm.com/ Emphasis is on visualizing the data analytic method while doing the analysis.
Tom Johnson

Data-driven journalism: What is there to learn? - 0 views

  • Data-drivenjournalism:Whatistheretolearn?Apaperonthedata-drivenjournalismroundtableheldinAmsterdamon24August2010.Withadditionalmateri class=
  •  
    Data-drivenjournalism:Whatistheretolearn?Apaperonthedata-drivenjournalismroundtableheldinAmsterdamon24August2010.Withadditionalmaterialondatatools,DDJinnovators,andrecommendedwebsitesandarticles.Theimmediategoalsaretoimproveaccessforinterestedjournalistsandtoidentifytrainingneedsforthefuture
Tom Johnson

Mining of Massive Datasets - 0 views

  •  
    Mining of Massive Datasets The book has now been published by Cambridge University Press. A hardcopy can be obtained Here. By agreement with the publisher, you can still download it free from this page. Cambridge Press does, however, retain copyright on the work, and we expect that you will acknowledge our authorship if you republish parts or all of it. We are sorry to have to mention this point, but we have evidence that other items we have published on the Web have been appropriated and republished under other names. It is easy to detect such misuse, by the way, as you will learn in Chapter 3. --- Anand Rajaraman (@anand_raj) and Jeff Ullman Downloads Download the Complete Book (340 pages, approximately 2MB) Download chapters of the book: Preface and Table of Contents Chapter 1 Data Mining Chapter 2 Large-Scale File Systems and Map-Reduce Chapter 3 Finding Similar Items Chapter 4 Mining Data Streams Chapter 5 Link Analysis Chapter 6 Frequent Itemsets Chapter 7 Clustering Chapter 8 Advertising on the Web Chapter 9 Recommendation Systems Index
Tom Johnson

The Overview Project » VIDEO: document mining with Overview - 0 views

  •  
    VIDEO: document mining with Overview by Jonathan Stray on 10/31/2012 0 With the release of the new, web-only version of Overview that runs in your browser, we thought it was time to make a little video showing how to use it. If that doesn't answer your questions, see also the help page, and the FAQ.
Tom Johnson

ELAN description | The Language Archive - 0 views

  • ELAN description ELAN is a professional tool for the creation of complex annotations on video and audio resources. With ELAN a user can add an unlimited number of annotations to audio and/or video streams. An annotation can be a sentence, word or gloss, a comment, translation or a description of any feature observed in the media. Annotations can be created on multiple layers, called tiers. Tiers can be hierarchically interconnected. An annotation can either be time-aligned to the media or it can refer to other existing annotations. The textual content of annotations is always in Unicode and the transcription is stored in an XML format. ELAN provides several different views on the annotations, each view is connected and synchronized to the media playhead. Up to 4 video files can be associated with an annotation document. Each video can be integrated in the main document window or displayed in its own resizable window. ELAN delegates media playback to an existing media framework, like Windows Media Player, QuickTime or JMF (Java Media Framework). As a result a wide variety of audio and video formats is supported and high performance media playback can be achieved. ELAN is written in the Java programming language and the sources are available for non-commercial use. It runs on Windows, Mac OS X and Linux.
  •  
    ELAN description ELAN is a professional tool for the creation of complex annotations on video and audio resources. With ELAN a user can add an unlimited number of annotations to audio and/or video streams. An annotation can be a sentence, word or gloss, a comment, translation or a description of any feature observed in the media. Annotations can be created on multiple layers, called tiers. Tiers can be hierarchically interconnected. An annotation can either be time-aligned to the media or it can refer to other existing annotations. The textual content of annotations is always in Unicode and the transcription is stored in an XML format. ELAN provides several different views on the annotations, each view is connected and synchronized to the media playhead. Up to 4 video files can be associated with an annotation document. Each video can be integrated in the main document window or displayed in its own resizable window. ELAN delegates media playback to an existing media framework, like Windows Media Player, QuickTime or JMF (Java Media Framework). As a result a wide variety of audio and video formats is supported and high performance media playback can be achieved. ELAN is written in the Java programming language and the sources are available for non-commercial use. It runs on Windows, Mac OS X and Linux.
Tom Johnson

Palantir- Our Work - What We Do - 0 views

  •  
    WHAT WE DO We build software that allows organizations to make sense of massive amounts of disparate data. We solve the technical problems, so they can solve the human ones. Combating terrorism. Prosecuting crimes. Fighting fraud. Eliminating waste. From Silicon Valley to your doorstep, we deploy our data fusion platforms against the hardest problems we can find, wherever we are needed most.
Tom Johnson

Statistical Reasoning I - 0 views

  •  
    Statistical Reasoning 1 http://ocw.jhsph.edu/index.cfm/go/viewCourse/course/StatisticalReasoning1/coursePage/index/ Most people could probably use a bit of a refresher on statistical reasoning and its methods, and this free course from Johns Hopkins University is a great way to get started on the road back to statistical literacy. The course was originally taught by John McGready and provides "a broad overview of biostatistical methods and concepts used in the public health sciences." Users will find that the home page includes links to the course syllabus, schedule, lecture materials, readings, and additional assignments. The Lecture Materials area includes course notes from the seven modules here. The topics include "Describing Data," "An Introduction to Hypothesis Testing," and "When Time Is of Interest: The Case for Survival Analysis." Visitors can also take advantage of the assignments, which correspond to the readings and the lecture materials. The site is completed by the Other Resources area, which includes a special lecture on the software package Stata and a flowchart designed to help students learn how to choose the correct statistical procedure for the task at hand. [KMG]
Tom Johnson

Michelle Minkoff » Learning to love…grep (let the computer search text for you) - 0 views

  • Blog Learning to love…grep (let the computer search text for you) Posted by Michelle Minkoff on Aug 9, 2012 in Blog, Uncategorized | No Comments I’ve gotten into the habit of posting daily learnings on Twitter, but some things require a more in-depth reminder. I also haven’t done as much paying as forward as I’d like (but I’m having a TON of fun!  and dealing with health problems!  but mostly fun!) I’d like to try to start posting more helpful tips here, partially as a notebook for myself, and partially to help others with similar issues. Today’s problem: I needed to search for a few lines of text, which could be contained in any one of nine files with 100,000 lines each. Opening all of the files took a very long time on my computer, not to mention executing a search. Enter the “grep” command in Terminal, that allows you to quickly search files using the power of the computer.
  •  
    Blog Learning to love…grep (let the computer search text for you) Posted by Michelle Minkoff on Aug 9, 2012 in Blog, Uncategorized | No Comments I've gotten into the habit of posting daily learnings on Twitter, but some things require a more in-depth reminder. I also haven't done as much paying as forward as I'd like (but I'm having a TON of fun! and dealing with health problems! but mostly fun!) I'd like to try to start posting more helpful tips here, partially as a notebook for myself, and partially to help others with similar issues. Today's problem: I needed to search for a few lines of text, which could be contained in any one of nine files with 100,000 lines each. Opening all of the files took a very long time on my computer, not to mention executing a search. Enter the "grep" command in Terminal, that allows you to quickly search files using the power of the computer.
  •  
    An easy to use method for content analysis
Tom Johnson

Google Developers Live - 0 views

  •  
    Welcome to Google Developers Live Connect with Googlers, other developers, and industry leaders face-to-face over Google+ Hangouts. Learn more All Categories Check out our upcoming shows
Tom Johnson

Reporters' Lab @ Duke University - 0 views

  •  
    The site now has reviews of common (and some uncommon) tools that promise to help your reporting, projects for the future and soon, we hope, news of promising and interesting use of new methods for reporting.
Tom Johnson

Software for Content Analysis - 0 views

  •  
    "Software for Content Analysis: Links to external sites The list below provides links to web sites where one can find information (often including purchasing information) regarding content analysis software as well as other types of software that are often utilized by content analysts. The list was last updated in December 2008. Some links may change. You might also find Will Lowe's Review of Software for Content Analysis useful. "
Tom Johnson

Want to help fact-check breaking news like the Malaysian airplane disaster? Here's how ... - 0 views

  •  
    "Want to help fact-check breaking news like the Malaysian airplane disaster? Here's how and where you can do it"
Tom Johnson

Visualization contests around the corner - 0 views

  •  
    Visualization contests around the corner May 25, 2011 to Contests | Comments (3) The best way to learn how to visualize data is to grab a dataset and see what you can do with it. You can read as many tips and tricks as you want, but you're not going to get any better until you actually try. Contests are a fun way to do this. Participate So here are a handful of visualization contests to get your hands dirty. Hey you might even win a couple of thousand dollars. Not that money matters to you, because as well all know, learning is your reward. Hacking Education - A contest for developers and data crunchers. DonorsChoose.org has inspired $80 million in giving from 400,000 donors, helping 165,000 teachers at 43,000 schools, and the donation site has opened up this data. Can do you do something with it? Deadline: June 30, 2011. Data In Sight - A hands-on competition in San Francisco's SoMa district with surprise data sources. Some talks, lunch, dinner, and a 24-hour hackathon. Event date: June 24, 2011 (better to register your team early). Tableau Interactive Viz Contest - This one is coming up the quickest, but is the most straightforward. Plus, you get a t-shirt just for entering. Grab some business, finance, or real estate data and go to town with Tableau Public. Deadline: June 3, 2011.
Tom Johnson

Google Correlate - 0 views

  •  
    Google Correlate lets you see how your data relates to search queries Posted: 25 May 2011 11:27 AM PDT Influenza search - Google Correlate A while back, Google showed how Influenza outbreaks correlated to searches for flu-related terms with Google Flu Trends. It helped researchers and policy-makers estimate flu activity much sooner than with previous methods. Google Correlate is the evolution of Flu Trends in that now you can correlate search trends with not just flu cases, but with your own data or other search queries. The above, which you already know about, matches flu cases with searches for "treatment for flu." Similarly, the search phrase that correlates highest with "Toyota for sale" is "used Hyundai," as shown below. You can also see how your data is related geographically. For example, annual rainfall (left) strongly correlates with searches for "disney vacation package." Although, it looks like distance is a strong factor in the latter, which should be a reminder that correlation is different from causation. Google is careful to point this out in their FAQ and explanation of the tool. Nevertheless, it's fun to poke around and sometimes see the non-sensical correlations. For example, the strongest correlation with "flowingdata" is "how to scan a document," because the growth rates of both seem similar. There's also a search by drawing function. You draw a time series, and Correlate finds terms that best match that trend. In the below chart, I drew a line (blue) that had steady growth, but plateaued towards present day. What weird correlations can you find? [Google Correlate]
Tom Johnson

What's in your document? « The Reporter's Lab - 0 views

  •  
    Sarah Cohen, Duke University
Tom Johnson

New: World Health Organization (WHO) Releases Database on Nutrition, Obesity ... - 0 views

  • New: World Health Organization (WHO) Releases Database on Nutrition, Obesity and Physical Activity (NOPA) Posted on June 5, 2011 by Gary D. Price From a WHO Announcement: The NOPA database has been created in close collaboration with health ministries and with support from the European Commission. It includes details on more than 300 national and sub-national policies that address nutrition, physical activity or obesity. Most of these policy documents have been developed in the past six years and refer to lifestyle trends such as the increasing consumption of industrially produced foods high in fat and sugar and declining physical activity. Physical inactivity and poor nutrition are considered to be risk factors to many non-communicable diseases, such as heart diseases and stroke, diabetes, cancer, and mental health disorders. The measurable indicators for policy development have been identified and monitored at the national level in accordance with the commitments made through the European Charter on Counteracting Obesity and the WHO European Action Plan for Food and Nutrition Policy 2007–2012. As an example of these commitments, the Nutrition, Obesity and Physical Activity Database shows that 28 Member States have stated that they plan to take action on physical activity in the school setting, 11 plan to address the adequate labeling of food products and 9 plan to deal with appropriate food marketing practices. Direct to NOPA Database
  •  
    New: World Health Organization (WHO) Releases Database on Nutrition, Obesity and Physical Activity (NOPA) Posted on June 5, 2011 by Gary D. Price From a WHO Announcement: The NOPA database has been created in close collaboration with health ministries and with support from the European Commission. It includes details on more than 300 national and sub-national policies that address nutrition, physical activity or obesity. Most of these policy documents have been developed in the past six years and refer to lifestyle trends such as the increasing consumption of industrially produced foods high in fat and sugar and declining physical activity. Physical inactivity and poor nutrition are considered to be risk factors to many non-communicable diseases, such as heart diseases and stroke, diabetes, cancer, and mental health disorders. The measurable indicators for policy development have been identified and monitored at the national level in accordance with the commitments made through the European Charter on Counteracting Obesity and the WHO European Action Plan for Food and Nutrition Policy 2007-2012. As an example of these commitments, the Nutrition, Obesity and Physical Activity Database shows that 28 Member States have stated that they plan to take action on physical activity in the school setting, 11 plan to address the adequate labeling of food products and 9 plan to deal with appropriate food marketing practices. Direct to NOPA Database http://infodocket.com/2011/06/05/new-world-health-organization-who-releases-database-on-nutrition-obesity-and-physical-activity-nopa/?utm_source=twitterfeed&utm_medium=twitter
Tom Johnson

Download PowerPivot - Excel - Office.com - 0 views

  •  
    Tom Torok (NYT) writes: After years of looking down my nose at Excel because of its limitations, I have to say that I'm very impressed with Excel 2010 when used with a free Microsoft add-in called PowerPivot. http://office.microsoft.com/en-us/excel/download-powerpivot-HA101959985.aspx In a PowerPivot tutorial (link below), I imported eight tables  from several sources and joined them - yes, you can join relational data. It uses some magical data compression that allows for lightning fast sorts, filters and calculated fields. The largest table in the tutorial has about 2 million rows. A calculated field on that table took seconds. A did a pivot table on the table and the answers appeared as soon as I selected the fields. In one of  the training videos (http://www.powerpivot.com/) an MS guy works with a 101 million-record table on his laptop. It's really amazing. http://powerpivotsdr.codeplex.com/ If you install, be sure to read the prerequisites or you'll be installing and uninstalling both PowerPivot and Excel. I'm running it on a 32-bit XP machine (it won't run on a 64-bit XP but will work on Windows 7 64-bit). The tutorial is for a Windows 7 setup, but there are items in the menu bar that match the reference to the tutorial's ribbon. I noticed that if I call up an xlsx by double clicking on a file in Windows Explorer that PowerPivot is not enabled in the ribbon. If you call up a file from within Excel 2010 everything works as advertised.Regards, TT  
Tom Johnson

Open Flash Chart - Home - 0 views

  •  
    Hello, this is the Open Flash Chart project. Note: "Open Flash Chart 2" is LGPL. OK, Open Flash Chart 1.x was great and it works like a dream. But I made some little mistakes which over time grew and anyoyed me and made the source code weird. So I decided it was time to re-jigger the code and make it pretty again. The big change is moving the data format to JSON. This has made a big difference and has allowed some pretty cool new features. While I was hacking away at the source code I moved it all to Actionscript 3, and used Adobe Flex to compile it. This means everything is open source. If you want to make changes to the charts all you need is laid out in these instructions. Just because there is a new version doesn't make V 1.x obsolete. You can use both versions at the same time so leave your current working code in V 1.x and make all the new charts using which ever version you find easier to use. Why is V2 better? Well it uses JSON as the file format and this means you can do cool stuff like Grant Slender has: http://code.google.com/p/ofcgwt/ If you like Open Flash Chart and want to see it continue, please help Donate some money :-) Blog about it (promotion takes up about a third of my time) Write a cool library Really. You can make a massive difference to the project! Need help choosing reseller hosting for your charts? Make sure you read reliable web hosting reviews. Why choose Open Flash Chart? This is a little gentle propaganda for the project. Like all opinions, disregard it and make up your own mind. Edge cases such as tooltips encourage user interactivity and data exploration what happens to the tooltip when two points are in the same position? you can re-size the charts missing data save the chart as an image You can highlight or emphasize one (or many) points PC Pro loves open flash chart. Server Side Helper Libraries PHP, Perl, Python, Ruby, .NET, Google Web Toolkit and JAVA. Libraries. Next: Che
1 - 20 of 31 Next ›
Showing 20 items per page