Skip to main content

Home/ science/ Group items tagged data

Rss Feed Group items tagged

Janos Haits

Named Data Networking (NDN) - A Future Internet Architecture - 0 views

  •  
    'The Named Data Networking (NDN) project aims to develop a new Internet architecture that can capitalize on strengths - and address weaknesses - of the Internet's current host-based, point-to-point communication architecture in order to naturally accommodate emerging patterns of communication. By naming data instead of their locations, NDN transforms data into a first-class entity. The current Internet secures the data container. NDN secures the contents, a design choice that decouples trust in data from trust in hosts, enabling several radically scalable communication mechanisms such as automatic caching to optimize bandwidth.'
Janos Haits

LAWA | Longitudinal Analytics of Web Archive Data - 0 views

  •  
    LAWA will federate distributed FIRE facilities with the rich Web repository of the European Archive, to create a Virtual Web Observatory and use Web data analytics as a use case study to validate our design. The outcome of our work will enable Internet-scale analysis of data, and bring the content aspect of the Internet on the roadmap of Future Internet Research. In four work packages we will extend the open-source Hadoop software by novel methods for wide-area data access, distributed storage and indexing, scalable data aggregation and data analysis along the time dimension, and automatic classification of Web contents.
Janos Haits

TELEIOS | Virtual Observatory Infrastructure for Earth Observation Data - 0 views

  •  
    Earth observation data have increased considerably over the last decades with satellite sensors collecting and transmitting back to Earth several terabytes of data per day. This data acquisition rate is a major challenge to existing data management, exploitation and dissemination approaches used by various agencies such as ESA, NASA and European national space agencies. To make the available petabytes of EO data easily accessible by an even larger group of end user applications, TELEIOS will design and implement a Virtual Earth Observatory by building on the following state of the art technologies:
Janos Haits

data.nature.com - 0 views

  •  
    data.nature.com - the NPG Linked Data Platform The Linked Data Platform provides access to datasets from NPG published as linked data and made available through SPARQL services. The data are queryable interactively through a form interface and remotely through a service endpoint.
Janos Haits

Narrative Science | We Transform Data Into Stories and Insight - 0 views

  •  
    Artificial Intelligence. Human Insight. Real Results There is no shortage of data, in fact just about every company we talk to is drowning in data. As the volume of data continues to rise exponentially, companies need a better way to use, monetize and understand the data they already have. Narrative Science helps companies leverage their data by creating easy to use, consistent narrative reporting - automatically through our proprietary artificial intelligence technology platform.
Janos Haits

Kaggle, we're making data science a sport - 0 views

  •  
    Kaggle is a platform for data prediction competitions that allows organizations to post their data and have it scrutinized by the world's best data scientists. In exchange for a prize, winning competitors provide the algorithms that beat all other methods of solving a data crunching problem. Most data problems can be framed as a competition.
Janos Haits

PlanetData - 0 views

  •  
    "PlanetData aims to establish a sustainable European community of researchers that supports organizations in exposing their data in new and useful ways. The ability to effectively and efficiently make sense out of the enormous amounts of data continuously published online, including data streams, (micro)blog posts, digital archives, eScience resources, public sector data sets, and the Linked Open Data Cloud, is a crucial ingredient for Europe's transition to a knowledge society. It allows businesses, governm"
Janos Haits

LOD2 - Creating Knowledge out of Interlinked Data - 0 views

  •  
    The Linked Data paradigm has therefore evolved from a practical research idea into a very promising candidate for addressing one of the biggest challenges in the area of intelligent information management: the exploitation of the Web as a platform for data and information integration in addition to document search. To translate this initial success into a world-scale disruptive reality, encompassing the Web 2.0 world and enterprise data alike, the following research challenges need to be addressed: improve coherence and quality of data published on the Web, ...
Janos Haits

data.nasa.gov - 0 views

  •  
    The Open Data project is part of the NASA Open Government Initiative, and is intended to improve access to NASA data. This data catalog is a continually-growing listing of publicly available NASA datasets.
Janos Haits

The Open Data Handbook - Open Data Handbook - 0 views

  •  
    handbook introduces you to the legal, social and technical aspects of open data. It can be used by anyone but is especially useful for those working with government data. It discusses the why, what and how of open data - why to go open, what open is, and the how to do open.
Janos Haits

e-LICO Front Page | Data Mining Portal - 0 views

  •  
    e-LICO: An e-Laboratory for Interdisciplinary Collaborative Research in Data Mining and Data-Intensive Science
Janos Haits

Home | Open Data Portal - 0 views

  •  
    "The European Union Open Data Portal (EU ODP) gives you access to open data published by EU institutions and bodies. All the data you can find via this catalogue are free to use and reuse for commercial or non-commercial purposes."
Janos Haits

CHB - 0 views

  •  
    Come work with us Interested in working with researchers from different disciplines within the Harvard, MIT and Broad community and an unique opportunity to participate in world-class research to make an impact on human health? Come work with us! We are looking for a computational biologists to handle data from a wide variety of experimental methods, focusing on next-gen sequencing technologies. Keep Reading...  SCDE is live The Stem Cell Discovery Engine (SCDE) is an integrated platform that allows users to consistently describe, share and compare cancer and tissue stem cell data. It is made up of an online database of curated experiments coupled to a customized instance of the Galaxy analysis engine with tools for gene list manipulation and molecular profile comparisons. The SCDE currently contains more than 50 stem cell-related experiments. Each has been manually curated and encoded using the ISA-Tab standard to ensure the quality of the data and its annotation. Keep Reading...  The Center for Health Bioinformatics at the Harvard School of Public Health provides consults to researchers for the management, integration and contextual analysis of biological high-throughput data. We are a member of the Center for Stem Cell Bioinformatics, the Environmental Statistics and Bioinformatics Core at the Harvard NIEHS Center for Environmental Health and the Genetics & Bioinformatics Consulting group for Harvard Catalyst and work closely with our colleagues in the Department of Biostatistics and the Program in Quantitative Genomics to act as a single point of contact for computational biology,
Janos Haits

lobid.org - 0 views

  •  
    lobid.org is the North Rhine-Westphalian Library Service Center's (hbz) Linked Open Data service. The akronym 'lobid' stands for "Linking Open Bibliographic Data". We support the process of creating Linked Open Bibliographic Data out of existing libraries and other associated data.
Janos Haits

WorldCat knowledge base - 0 views

  •  
    "The WorldCat knowledge base combines data about your library's electronic resources and linking features that enable access to the content and help you manage the workflows associated with these materials. Unlike data in a traditional knowledge base, WorldCat knowledge base data is not tied to a particular application. Knowledge base data is added and maintained in a single place for use with a growing number of OCLC and non-OCLC services."
Janos Haits

World and regional statistics, national data, maps, rankings - 0 views

  •  
    World Data Atlas World and regional statistics, national data, maps, rankings 370M+timeseries: 960+topics: 900+sources: Take a look at data coverage matrix by country or topic to see the full picture!
thinkahol *

Plastic computer memory device uses spin of electrons to read and write data | KurzweilAI - 0 views

  •  
    "Researchers at Ohio State University have demonstrated the first plastic computer memory device that utilizes the spin of electrons to read and write data. An alternative to traditional microelectronics, the "spintronics" device could store more data in less space, process data faster, and consume less power."
Skeptical Debunker

We're so good at medical studies that most of them are wrong - 0 views

  • Statistical validation of results, as Shaffer described it, simply involves testing the null hypothesis: that the pattern you detect in your data occurs at random. If you can reject the null hypothesis—and science and medicine have settled on rejecting it when there's only a five percent or less chance that it occurred at random—then you accept that your actual finding is significant. The problem now is that we're rapidly expanding our ability to do tests. Various speakers pointed to data sources as diverse as gene expression chips and the Sloan Digital Sky Survey, which provide tens of thousands of individual data points to analyze. At the same time, the growth of computing power has meant that we can ask many questions of these large data sets at once, and each one of these tests increases the prospects than an error will occur in a study; as Shaffer put it, "every decision increases your error prospects." She pointed out that dividing data into subgroups, which can often identify susceptible subpopulations, is also a decision, and increases the chances of a spurious error. Smaller populations are also more prone to random associations. In the end, Young noted, by the time you reach 61 tests, there's a 95 percent chance that you'll get a significant result at random. And, let's face it—researchers want to see a significant result, so there's a strong, unintentional bias towards trying different tests until something pops out. Young went on to describe a study, published in JAMA, that was a multiple testing train wreck: exposures to 275 chemicals were considered, 32 health outcomes were tracked, and 10 demographic variables were used as controls. That was about 8,800 different tests, and as many as 9 million ways of looking at the data once the demographics were considered.
  •  
    It's possible to get the mental equivalent of whiplash from the latest medical findings, as risk factors are identified one year and exonerated the next. According to a panel at the American Association for the Advancement of Science, this isn't a failure of medical research; it's a failure of statistics, and one that is becoming more common in fields ranging from genomics to astronomy. The problem is that our statistical tools for evaluating the probability of error haven't kept pace with our own successes, in the form of our ability to obtain massive data sets and perform multiple tests on them. Even given a low tolerance for error, the sheer number of tests performed ensures that some of them will produce erroneous results at random.
Janos Haits

World and regional statistics, national data, maps, rankings - 0 views

  •  
    World Data Atlas. World and regional statistics, national data, maps, rankings. 370M+timeseries: 960+topics: 900+sources: Take a look at data coverage matrix by country or topic to see the full picture!
Janos Haits

Academic Torrents - 0 views

  •  
    "Currently making 1.67TB of research data available. Sharing data is hard. Emails have size limits, and setting up servers is too much work. We've designed a distributed system for sharing enormous datasets - for researchers, by researchers. The result is a scalable, secure, and fault-tolerant repository for data, with blazing fast download speeds."
1 - 20 of 175 Next › Last »
Showing 20 items per page