A Dataset is a Worldview - Towards Data Science - 0 views
-
Simon Knight on 07 Apr 20But because a machine learning model learns the boundaries of its world from its input data, just three people informed how any model using that dataset would interpret if 'childbirth' was emotional. This led to a perspective that has informed all of my work since: a dataset is a worldview. It encompasses the worldview of the people who scrape and collect the data, whether they're researchers, artists, or companies. It encompasses the worldview of the labelers, whether they labeled the data manually, unknowingly, or through a third party service like Mechanical Turk, which comes with its own demographic biases. It encompasses the worldview of the inherent taxonomies created by the organizers, which in many cases are corporations whose motives are directly incompatible with a high quality of life.