“Although data scientists can gain great insights from large data sets — and can ultimately use these insights to tackle major challenges — accomplishing this is much easier said than done. Many such efforts are stymied from the outset, as privacy concerns make it difficult for scientists to access the data they would like to work with. In a paper presented at the IEEE International Conference on Data Science and Advanced Analytics, members of the Data to AI Lab at the MIT Laboratory for Information and Decision Systems (LIDS) Kalyan Veeramachaneni, principal research scientist in LIDS and the Institute for Data, Systems, and Society (IDSS) and co-authors Neha Patki and Roy Wedge describe a machine learning system that automatically creates synthetic data — with the goal of enabling data science efforts that, due to a lack of access to real data, may have otherwise not left the ground. While the use of authentic data can cause significant privacy concerns, this synthetic data is completely different from that produced by real users — but can still be used to develop and test data science algorithms and models.”
Related Content
Related Posts:
- Cobalt-free batteries could power cars of the future
- Researchers 3D print components for a portable mass spectrometer
- A blueprint for making quantum computers easier to program
- MIT researchers discover “neutronic molecules”
- MIT scientists tune the entanglement structure in an array of qubits
- New software enables blind and low-vision users to create interactive, accessible charts
- Researchers 3D print key components for a point-of-care mass spectrometer
- Self-powered sensor automatically harvests magnetic energy
- This 3D printer can figure out how to print with an unknown material
- With inspiration from “Tetris,” MIT researchers develop a better radiation detector