Skip to main content

Home/ Groups/ Public Service Internet
1More

[1607.06520] Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Emb... - 0 views

  •  
    The blind application of machine learning runs the risk of amplifying biases present in data. Such a danger is facing us with word embedding, a popular framework to represent text data as vectors which has been used in many machine learning and natural language processing tasks. We show that even word embeddings trained on Google News articles exhibit female/male gender stereotypes to a disturbing extent. This raises concerns because their widespread use, as we describe, often tends to amplify these biases. Geometrically, gender bias is first shown to be captured by a direction in the word embedding. Second, gender neutral words are shown to be linearly separable from gender definition words in the word embedding. Using these properties, we provide a methodology for modifying an embedding to remove gender stereotypes, such as the association between between the words receptionist and female, while maintaining desired associations such as between the words queen and female. We define metrics to quantify both direct and indirect gender biases in embeddings, and develop algorithms to "debias" the embedding. Using crowd-worker evaluation as well as standard benchmarks, we empirically demonstrate that our algorithms significantly reduce gender bias in embeddings while preserving the its useful properties such as the ability to cluster related concepts and to solve analogy tasks. The resulting embeddings can be used in applications without amplifying gender bias.
1More

Facebook's Secret Censorship Rules Protect White Men… - ProPublica - 0 views

  •  
    "A trove of internal documents sheds light on the algorithms that Facebook's censors use to differentiate between hate speech and legitimate political expression."
1More

Google 'fixed' its racist algorithm by removing gorillas from its image-labeling tech -... - 0 views

  •  
    "Nearly three years after the company was called out, it hasn't gone beyond a quick workaround"
1More

TwArχiv - 0 views

  •  
    Twitter archives are a rich source of data for doing research into numerous things: Learning about social media and interaction networks, gaining insights into movement patterns based on geolocations and even doing sentiment analysis based on the tweets. And the best part of it: Unless you have a protected Twitter account this data is already public. So why not share it? The TwArχiv takes in your Twitter archive and generates interesting visualizations from your own tweets, including tweet volume over time and your interaction/movement patterns.
1More

Home - Open Humans - 0 views

  •  
    Open Humans is a platform that allows you to upload, connect, and privately store your personal data - such as genetic, activity, or social media data. Once you've added data, you can to donate it: you might choose to share some publicly , and you can join and contribute to diverse research projects. Thus, we turn the traditional research pipeline on its head: you are at the center and in control of when you share your data. We want to empower you to explore your data
1More

Mozilla IoT - Gateway - 0 views

  •  
    Build your own web of things gateway
1More

The House That Spied on Me - 0 views

  •  
    "In December, I converted my one-bedroom apartment in San Francisco into a "smart home." I connected as many of my appliances and belongings as I could to the internet: an Amazon Echo, my lights, my coffee maker, my baby monitor, my kid's toys, my vacuum, my TV, my toothbrush, a photo frame, a sex toy, and even my bed."
1More

Data Sense - 0 views

  •  
    "Data Sense is a research experiment at Intel Labs. We wanted to see if it is possible to make data more accessible to those of us without stats degrees. To test out some ideas, we built this tool. "
1More

SoLiD - Read Write Web Community Group - 0 views

  •  
    "SoLiD is a proposed set of conventions for building decentralized social applications on the Linked Data stack. SoLiD is modular and extensible. It relies as much as possible on existing W3C standards. SoLiD applications are somewhat like multiuser applications where instances talk to each other through a shared filesystem, and the Web is that filesystem. "
1More

Tomorrow's BBC will be fitted to your personality | openDemocracy - 1 views

  •  
    "The BBC is doing cutting-edge research into Visual Perceptive Media, virtual reality and facial coding technologies. But do we want our shows to be tailored to our age, gender, and tastes? And what happens to all that data?"
1More

pickhardt/betty: Friendly English-like interface for your command line. Don't remember ... - 0 views

  •  
    Betty is a friendly English-like interface for your command line.
1More

Mycroft - Open Source Voice Assistant - Mycroft - 0 views

  •  
    Open source, privacy aware Google Home & Amazon Alexa iot device and much more
1More

Probing the Dark Side of Google's Ad-Targeting System - MIT Technology Review - 0 views

  •  
    Researchers say Google's ad-targeting system sometimes makes troubling decisions based on data about gender and other personal characteristics.
1More

AI Principles - Future of Life Institute - 0 views

  •  
    These principles were developed in conjunction with the 2017 Asilomar conference
1More

South Park trolled Amazon Echo owners in the best way possible | TechCrunch - 0 views

  •  
    "This is one of the minor dangers of owning an Amazon Echo. Most anyone can activate the device by just saying the wake word, including Cartman. And in the season opener of South Park, that's exactly what happened. If someone watched that episode of South Park in the same room as an Echo, their Amazon shopping list was filled with random, gross items. This example shows the potential danger of having a voice-activated shopping assistant. It's easy to imagine a potential rogue advertisement, online or elsewhere, that could, in theory, say the right words to order a particular product - like a South Park box set."
1More

Patient Home Monitoring Service Leaks Private Medical Data O - 0 views

  •  
    Kromtech Security Researchers have discovered another publically accessible Amazon S3 repository. This time it contained medical data in 316,363 PDF reports in the form of weekly blood test results. Many of these were multiple reports on individual patients. It appears that each patient had weekly test results totaling around 20 files each. That would still be an estimated 150,000+ people affected by the leak.
1More

Troy Hunt: What Would It Look Like If We Put Warnings on IoT Devices Like We Do Cigaret... - 0 views

  •  
    A couple of years ago, I was heavily involved in analysing and reporting on the massive VTech hack, the one where millions of records were exposed including kids' names, genders, ages, photos and the relationship to parents' records which included their home address. Part of this data was collected via an IoT device called the InnoTab which is a wifi connected tablet designed for young kids; think Fisher Price designing an iPad... then totally screwing up the security.
1More

Sex toy company admits its vibrators 'secretly recorded intimate sessions' | The Indepe... - 1 views

  •  
    Sex toy company admits its vibrators secretly recorded intimate sessions
1More

Google admits it tracked user location data even when the setting was turned off - The ... - 0 views

  •  
    "Android phones gather your location data and send it to Google, even if you've turned off location services and don't have a SIM card"
1 - 20 Next › Last »
Showing 20 items per page