Skip to main content

Home/ INF 6107 Web social/ Group items tagged extracting

Rss Feed Group items tagged

Harry Sahyoun

Collective Knowledge Systems: Where the Social Web meets the Semantic Web - 1 views

  • Collective Knowledge Systems: Where the Social Web meets the Semantic Web
  • What can happen if we combine the best ideas from the Social Web and Semantic Web?
  • The Vision of Collective Intelligence
  • ...20 more annotations...
  • The Social Web is represented by a class of web sites and applications in which user participation is the primary driver of value.
  • Collective intelligence is a grand vision, one to which I subscribe.  However, I would call the current state of the Social Web something else: collected intelligence.   That is, the value of these user contributions is in their being collected together and aggregated into community- or domain-specific sites
  • The grand challenge is to boost the collective IQ of organizations and of society
  • With the rise of the Social Web, we now have millions of humans offering their knowledge online, which means that the information is stored, searchable, and easily shared.  The challenge for the next generation of the Social and Semantic Webs is to find the right match between what is put online and methods for doing useful reasoning with the data.  True collective intelligence can emerge if the data collected from all those people is aggregated and recombined to create new knowledge and new ways of learning that individual humans cannot do by themselves.
  • Technology can augment the discovery and creation of knowledge. For instance, some drug discovery approaches embody a system for learning from models and data that are extracted from published papers and associated datasets.  By assembling large databases of known entities relevant to human biology, researchers can run computations that generate and test hypotheses about possible new therapeutic agents.
  • The first approach is to expose the structured data that already underlies the unstructured web pages.  An obvious technique is for the site builder, who is generating unstructured web pages from a database, to expose the structured data in those pages using standard formats.
  • the second approach, to extract structured data from unstructured user contributions [2] [28] [39] .  It is possible to do a reasonable job at identifying people, companies, and other entities with proper names, products, instances of relations you are interested in (e.g., person joining a company) [1] [7] , or instances of questions being asked [24] . There also techniques for pulling out candidates to use as classes and relations, although these are a bit noisier than the directed pattern matching algorithms [8] [23]  [31] [32] [36] [38] [42]
  • Tomorrow, the web will be understood as an active human-computer system, and we will learn by telling it what we are interested in, asking it what we collectively know, and using it to apply our collective knowledge to address our collective needs.
  • The other major area where Semantic Web can help achieve the vision of collective intelligence is in the area of interoperability.  If the world's knowledge is to be found on the Web, then we should be able to use it to answer questions, retrieve facts, solve problems, and explore possibilities. 
  • In a sense, the TagCommons project is attempting to create a platform for interoperability of social web data on the Semantic Web that is akin to the "mash-up" ecology that is celebrated in Web 2.0.
  • An example of how a system might apply some of these ideas is RealTravel.  RealTravel is an example of "Web 2.0 for travel".  It attracts travelers to share their experiences: sharing their itineraries, stories, photographs, where they stayed, what they did, and their recommendations for fellow travelers.  Writers think of RealTravel as a great platform to share their experiences -- a blog site that caters to this domain.  People who are planning travel use the site as a source of information to research their trip,
  • The collection of tags for a site is called the folksonomy, which is useful data about collective interests.
  • like many Web 2.0 sites, combines these structured dimensions to order the unstructured content.  For example, one can find all the travel blogs about diving, sorted by rating.  In fact, the site combines all of the structured dimensions into a matrix, which offers the user a way to "pivot browse" along any dimension from any point in the matrix.
  • This paper argues that the Social Web and the Semantic Web should be combined, and that collective knowledge systems are the "killer applications" of this integration.  The keys to getting the most from collective knowledge systems, toward true collective intelligence, are tightly integrating user-contributed content and machine-gathered data, and harvesting the knowledge from this combination of unstructured and structured information.
  • Structured and unstructured, formal and informal -- these are not new dimensions.  They are typically considered poles of a continuum.
  • We are beginning to see companies launching services under the banner of Web 3.0 [25] that aim explicitly at collective intelligence.  For instance, MetaWeb [35] is collecting a commons of integrated, structured data in a social web manner, and Radar Networks [25] is applying semantic web technologies to enrich the applications and data of the social web.
  • The third approach is to capture structured data on the way into the system.  The straightforward technique is to give users tools for structuring their data, such as ways of adding structured fields and making class hierarchies.
    • Harry Sahyoun
       
      Folksonomies_Semantic_Collectivities Web2_To_Web3
    • Harry Sahyoun
       
      3-étoiles
    • Harry Sahyoun
       
      Activité-A
  •  
    Technology can augment the discovery and creation of knowledge. For instance, some drug discovery approaches embody a system for learning from models and data that are extracted from published papers and associated datasets. By assembling large databases of known entities relevant to human biology, researchers can run computations that generate and test hypotheses about possible new therapeutic agents
dumontjose

Analytics and Predictive Models for Social Media - 0 views

  • Analytics & Predictive Models for Social Media
  • Part 1: Information flow in social media (slides) Collecting social media data Extracting and tracking the flow of relevant information Correcting for the effects of missing and incomplete data Predicting and modeling the flow of information Identifying networks of information flow Part 2: Rich user interactions (slides) Predicting and recommending links in network Modeling tie strenght Modeling trust and distrust, frieds and foes How users evaluate one another and the social media content
  •  
    Tutoriel intéressant de l'université Stanford sur les modèles prédictifs pour les médias sociaux
Harry Sahyoun

Folksonomies et communautés de partage de signets Vers de nouvelles st... - 1 views

  • Les folksonomies peuvent constituer une alternative aux moteurs de recherche en permettant la construction de parcours et la mise en réseau d'informations mais aussi de personnes.
  • Pour beaucoup d’usagers, la recherche d’information est devenue synonyme de moteur de recherche voire de  « googlisation ». Cependant il existe désormais des alternatives à ce quasi monopole via notamment les folksonomies, mot composé par Thomas Vander Wal à partir de folk et de taxonomy et qui définit la possibilité offerte à l’usager d’ajouter des mots-clés à des ressources. Leurs stratégies de recherche diffèrent de la traditionnelle médiation des moteurs. Nous nous sommes interrogés sur la pertinence des folksonomies et leur intérêt réel dans le cas de la recherche d’information. En effet, la démarche « folksonomique » diffère de la simple requête et suppose d’autres habiletés. Cette analyse s’inscrit dans nos recherches sur les habiletés informationnelles ( information literacy ) Nos travaux sur ces nouveaux modes de partage et recherche d’informations s’appuient sur de nombreux tests des différentes plateformes permettant l’intégration de mots-clés qualifiés de tags. Nous nous sommes tout particulièrement ici appuyés sur les systèmes de partages de signets parfois appelés « marque-pages sociaux », voire signets sociaux (social bookmarks). Nous avons pu d’ailleurs constater que les différents sites d’intégration de favoris ont veillé à l’interopérabilité de leur système le plus souvent en utilisant le format xml. Une possibilité qui n’était pas offerte sur tous les sites il y a encore un an
  • Faut-il pour autant voir dans ces systèmes un concurrent  potentiel des moteurs ? Nous songeons plutôt à les considérer comme des alternatives au sens de cheminements de recherche différents qui nécessitent une construction et non une logique de push
  • ...10 more annotations...
  • Les moteurs de recherche manquent souvent de pertinence du fait qu’ils reposent sur un tri effectué par un robot. Les folksonomies reposent quant à elles sur une médiation humaine, il est vrai imparfaite.
  • Il est possible d’identifier des « folksonomistes » que l’usager perçoit comme référence ce qui permet facilement ainsi de réaliser de la veille collaborative.
  •  " Combien vous vous trompez, mortels, en voyant dans ce trompeur édifice une tromperie qui veut vous égarer (…) Même si les chemins sont parfois parsemés d’embûches et semblent constituer de mauvaises directions, les folksonomies sont plus fidèles au cheminement hypertextuel. Cependant elles nécessitent un apprentissage voire une tag literacy
  • Les communautés virtuelles pour ne pas les nommer « collèges invisibles » devenant ainsi le socle des folskonomies
  • Trailfire[8]qui permet à l’usager de créer des parcours de sites web avec annotations inclues sur la page. Chaque parcours recevant un tag. Les possibilités pédagogiques sont ici assez évidentes puisque cette technologie permet l’insertion de billets explicatifs ou de commentaires à n’importe quel endroit de la page.
  • Les folksonomies sont parfois critiquées du fait que tous les folksonomistes ne sont pas tous des « gentlemen »
  • la participation à des groupes thématiques de veille ou tout au moins à une volonté de mettre ses découvertes à disposition des autres.
    • Harry Sahyoun
       
      folksonomies_Désintermédiarisation_alternative_moteurs_Par_personnes
    • Harry Sahyoun
       
      2-étoiles
    • Harry Sahyoun
       
      Activité-A
  •  
    Technology can augment the discovery and creation of knowledge. For instance, some drug discovery approaches embody a system for learning from models and data that are extracted from published papers and associated datasets. By assembling large databases of known entities relevant to human biology, researchers can run computations that generate and test hypotheses about possible new therapeutic agents
Ramzi Sleilaty

Please respect FT.com's ts&cs and copyright policy which allow you to: share links; cop... - 1 views

  •  
    The message for China from Tahrir Square
1 - 5 of 5
Showing 20 items per page