Removing barriers to data innovation
Empowering people and organizations to share and use data more effectively
Sharing data can help address some of society’s biggest challenges and help individuals and organizations be more innovative, efficient, and productive. Today, however, it is often difficult to determine how best to share data from both a legal and technical perspective. To help make it easier for individuals and organizations to share data, we’re sharing a set of draft agreements, each designed to address a specific data sharing challenge. We look forward to receiving feedback and taking additional steps to help make it easier for individuals and organizations to share data with confidence.
-
LinkedIn Articles Read our brief articles on the importance of responsible data sharing
-
Backgrounder & FAQ Read more about our perspective on data sharing licensing and governance
-
Data Use Agreements Access the data use agreements
-
Data Sharing Projects Explore projects using open data and data sharing models for societal benefit and business innovation
-
Other Resources Find links to resources that support open data and data sharing
LinkedIn Articles
Enabling data use through the power of community
For those of you who have been following my posts over the last several months, you will hopefully see the outlines of a multi-pronged effort to bring more attention to the importance of open data and to improve the tools for advancing AI research. Today, I’d like to focus attention on the area of data use and governance.
Getting to Common Terms for Data Sharing
Sharing data between organizations can help address some of society’s biggest challenges. There is no question that working with data in any fashion requires careful consideration of privacy and security risks, but there are clear benefits to be gained when done appropriately.
Data Use Agreements
Open Use of Data Agreement (O-UDA)
README | Annotated Agreement | Agreement
Open Use of Data Agreement GitHub repo
Computational Use of Data Agreement (C-UDA)
README | Annotated Agreement | Agreement
Computational Use of Data Agreement GitHub repo
Data Use Agreement for Open AI Model Development (DUA-OAI)
README | Annotated Agreement | Agreement
Data Use Agreement for Data Commons (DUA-DC)
README | Annotated Agreement | Agreement
Send your feedback to datainno@microsoft.com.
Data Sharing Projects
LinkedIn - World Bank Partnership
LinkedIn and the World Bank partnered to build a digital dashboard that visualizes the company’s labor market data for over 100 World Bank member countries
Answer ALS
Answer ALS is the single largest collaborative ALS research project working to build a comprehensive collection of data on people living with the disease in an effort to help researchers discover a cure
Hosting the World’s Leading Environmental Data Sets
In April 2019, Microsoft committed to hosting the world’s leading environmental data sets on Azure to accelerate the work of grantees and researchers around the world
Open Data Initiative with Adobe, Microsoft and SAP
The Open Data Initiative provides a platform for a single, comprehensive view of business data for analysis
Other Resources
Find links to resources that support open data and data sharing.
Licensing, Governance, and Legal Tools
- Creative Commons licenses
- The Linux Foundation’s Community Data License Agreement licenses
- Open Data Commons licenses
- DataSAR: Data Sharing Agreement Repository
- The OBO Foundry’s Data Use Ontology
- Governance Pathways Data Use Agreement Library
Open Data Repositories
- Microsoft Azure Open Datasets
- Microsoft Research Open Data
- Awesome Public Datasets
- BotSpeak Database
- Bureau of Labor Statistics
- CodeSearchNet Corpus
- Data.gov
- Harvard Dataverse
- HealthData.gov
- Open Data on GitHub
- Open Images
- Open ML
- Public Data Sets for Testing and Prototyping on Azure
- Quora list of datasets
- Stanford Large Network Dataset Collection
- Store-in-a-Box Solution Accelerator with product image collection
- Towards AI’s Public Datasets for Machine Learning
- Udacity Self-Driving Car Dataset
- University of California-Irvine Machine Learning Repository
- USAFacts
- Wide World Importers Sample Database
Recent News
- “Open government data – more critical than ever.” Andersen, Erich. Microsoft On the Issues, November 14, 2019.
- “Cancer researchers embrace AI to accelerate development of precision medicine.” Roach, John. Microsoft AI Blog, October 27, 2019.
- “New AI neural network approach detects heart failure from a single heartbeat with 100% accuracy.” University of Surrey. Medical Xpress, September 10, 2019.
- “Microsoft invests $1 billion in OpenAI, vows to build AI tech platform of ‘unprecedented scale’.” Bishop, Todd. GeekWire, July 22, 2019.
- “The University Health Network leverages Microsoft Azure and AI to bridge the gap between cancer research and treatment.” Peesker, Kevin. Microsoft News Center Canada, July 9, 2019.
- “DoD’s Joint AI Center to open-source natural disaster satellite imagery data set.” Johnson, Khari. VentureBeat, June 23, 2019.
- “Opinion: For a Longer, Healthier Life, Share Your Data.” Miner, Luke. New York Times, May 22, 2019.
- “Rebuilding an icon: a call for the sharing of data to help restore Notre Dame.” Dedezade, Esat. Microsoft News Centre Europe, May 20, 2019.
- “Technology can help doctors spend more time with patients, Health Secretary told, as UK embraces AI in healthcare.” Dedezade, Esat. Microsoft News Centre UK, May 7, 2019.