Skip to main content

Home/ Groups/ Apache Hadoop
francis hou

PoweredBy - Hadoop Wiki - 2 views

  • We use Hadoop to process clickstream and demographic data in order to create web analytic reports.
  • Our cluster runs across Amazon's EC2 webservice
  • We use Hadoop to develop MapReduce algorithms: Information retrieval and analytics Machine generated content - documents, text, audio, & video Natural Language Processing Project portfolio includes: * Natural Language Processing Mobile Social Network Hacking Web Crawlers/Page scrapping Text to Speech Machine generated Audio & Video with remuxing Automatic PDF creation & IR
  • ...1 more annotation...
  • We use Hadoop to process web clickstream, marketing, CRM, & email data in order to create multi-channel analytic reports. Our cluster runs on Amazon's EC2 webservice and makes use of Python for most of our codebase.
  •  
    了解各球產業及公司如何應用HADOOP. This page documents an alphabetical list of institutions that are using Hadoop for educational or production uses. Companies that offer services on or based around Hadoop are listed in Distributions and Commercial Support. Please include details about your cluster hardware and size. Entries without this may be mistaken for spam references and deleted.
francis hou

Hadoop and MapReduce: Big Data Analytics - 3 views

  •  
    Gartner的HADOOP報告, 提及未來HADOOP的應用及如何建構整個BIG DATA系統。
francis hou

ManyEyes - 2 views

  •  
    可參考未來設計資料圖像。 IBM BigSheets builds on the spreadsheet interaction model to support data gathering, exploration, and processing. Various visualization tools are supported, including IBM Many Eyes, which supports a number of visualizations including network diagrams, scatter plots and matrix charts to see relationships among data points; bar chart, histograms and bubble charts to compare sets of values; and word trees and tag clouds to analyze text.
francis hou

基于 Apache Mahout 构建社会化推荐引擎 - 3 views

  •  
    推荐引擎利用特殊的信息过滤(IF,Information Filtering)技术,将不同的内容(例如电影、音乐、书籍、新闻、图片、网页等)推荐给可能感兴趣的用户。通常情况下,推荐引擎的实现是通过将用户的个人喜好与特定的参考特征进行比较,并试图预测用户对一些未评分项目的喜好程度。参考特征的选取可能是从项目本身的信息中提取的,或是基于用户所在的社会或社团环境.......
1 - 5 of 5
Showing 20 items per page