United States of America¶
- https://en.wikipedia.org/wiki/Federal_government_of_the_United_States
- https://en.wikipedia.org/wiki/List_of_federal_agencies_in_the_United_States
Legislative Branch¶
Senate¶
House of Representatives¶
Executive Branch¶
President¶
Vice President¶
Chief Information Officer¶
see: https://wrdrd.github.io/docs/consulting/knowledge-engineering.html
Chief Data Scientist¶
see:
Chief Technology Officer¶
see: https://wrdrd.github.io/docs/consulting/information-systems.html
General Services Administration¶
18F¶
- Src: https://open-source-program.18f.gov/
- Web: https://github.com/18F/open-source-program
- https://open-source-program.18f.gov/pages/resources/
- 18F’s Open Source Policy: https://github.com/18F/open-source-policy/blob/master/policy.md
- 18F Open Source Style Guide: https://open-source-guide.18f.gov/
See:
- US Digital Service (USDS)
US Web Design Standards¶
Judicial Branch¶
Supreme Court¶
Court of Appeals¶
District Court¶
United States¶
Cool Projects¶
US Digital Service¶
US Digital Service (USDS) is a part of the Executive Branch (President) which provides information technology consultation services for the federal government.
See: * US Digital Services Playbook * 18F
US Digital Services Playbook¶
- Understand what people need
- Address the whole experience, from start to finish
- Make it simple and intuitive
- Build the service using agile and iterative practices
- Structure budgets and contracts to support delivery
- Assign one leader and hold that person accountable
- Bring in experienced teams
- Choose a modern technology stack
- Deploy in a flexible hosting environment
- Automate testing and deployments
- Manage security and privacy through reusable processes
- Use data to drive decisions
- Default to open
Data¶
US Digital Registry¶
U.S. Digital Registry is a database of official US federal government social media accounts.
USAspending.gov¶
USAspending.gov is the publicly accessible, searchable website mandated by the Federal Funding Accountability and Transparency Act of 2006 to give the American public access to information on how their tax dollars are spent.
Performance.gov¶
- https://www.performance.gov/clear_goals
- https://www.performance.gov/agencies
- https://www.performance.gov/node/3406 Climate Change
- https://www.performance.gov/node/3404 STEM Education
- https://www.performance.gov/content/increase-nation%E2%80%99s-data-science-capacity Data Science
- https://www.performance.gov/federalprograminventory
- https://www.performance.gov/api
Data.gov¶
- http://www.data.gov/food
- http://www.data.gov/business
- http://www.data.gov/climate
- http://www.data.gov/consumer
- http://www.data.gov/ecosystems
- http://www.data.gov/education
- http://www.data.gov/energy
- http://www.data.gov/finance
- http://www.data.gov/health
- http://www.data.gov/local
- http://www.data.gov/manufacturing
- http://www.data.gov/ocean
- http://www.data.gov/safety
- http://www.data.gov/research
API.data.gov¶
API.data.gov is an API for US federal government APIs.
Project Open Data¶
- https://project-open-data.cio.gov/v1.1/schema/ (JSON (-> JSON-LD -> RDF))
- https://project-open-data.cio.gov/v1.1/metadata-resources/#field-mappings
- POD, CKAN, DCAT RDF, http://schema.org RDF
ckanext-datajson¶
ckanext-datajson
is a CKAN extension to generate
Project Open Data
Data.gov JSON and JSON-LD from CKAN.
/data.json
/data.jsonld
seeAlso:
CKAN:
REST API Standards¶
DICE¶
Disconnected Interactive Content Explorer (DICE) is an app for iOS, Android, and Windows that allows users to load interactive content generated in HTML, CSS, and Javascript to a mobile device so the device can display interactive content without a network connection.
Apache Accumulo¶
- “Rya” (RDF Linked Data)
Library of Congress¶
- http://catalog.loc.gov/
- http://id.loc.gov/download/ (RDF Linked Data)
5 ★ Linked Open Data¶
- http://www.w3.org/TR/ld-glossary/#x5-star-linked-open-data
- http://www.w3.org/TR/ld-glossary/#linked-data-principles
- http://www.w3.org/TR/ld-glossary/#comma-separated-values-csv (CSV)
- http://www.w3.org/TR/ld-glossary/#rdf (RDF)
- http://www.w3.org/TR/ld-glossary/#rdfa (RDFa)
- http://www.w3.org/TR/ld-glossary/#json (JSON)
- http://www.w3.org/TR/ld-glossary/#json-ld (JSON-LD)
Health¶
Precision Medicine Initiative¶
- More and better treatments for cancer
- Creation of a voluntary national research cohort
- Commitment to protecting privacy
- Regulatory modernization
- Public-private partnerships
#precisionmedicine
Personal Health Agenda¶
Goals¶
Maximize utility of health data (through network effects)
Ask and answer questions to improve outcomes
Support case-history-aware epidemiological meta-analyses [“controlling for factors”, “systematic review”] (PRISMA, QUOROM)
Support “Meta-research” #metaresearch [“workflow”, “process knowledge”]
PLoS meta-research collection: http://collections.plos.org/meta-research
- Twitter: https://twitter.com/PLOS
- Twitter: https://twitter.com/PLOSbiology
“Meta-Research: Broadening the Scope of PLOS Biology” (2016) http://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1002334
PloS
#TenSimpleRules
- Homepage: http://collections.plos.org/ten-simple-rules
- Hashtag:
#TenSimpleRules
- Twitter: https://twitter.com/hashtag/TenSimpleRules
- https://wrdrd.github.io/docs/consulting/data-science#ten-simple-rules
#LinkedReproducibility
Objectives¶
- Link available resources together with common identifiers and vocabulary
[“URIs”, “Linked Data”, “Semantic Web”, “RDF”, “RDFa”]
- http://www.w3.org/wiki/HCLSIG/LODD/Data
#LinkedReproducibility
- Add RDFa tags to existing HTML templates [“Linked Data”, “MedicalEntity”]
- Lookup relevant datasets
- Lookup relevant datasets across languages and coding systems
- Encourage sharing of non-PHI data
- Encourage sharing of non-PHI data with units
- Lookup and automatedly analyze relevant datasets (before reading an abstract or a conclusion) with a number of models and random seeds [“blind statistical analysis”, “masked statistical analysis”]
- Develop an RDFS vocabulary for describing study controls and protocols with URIs [“Study Protocol”]
- MedicalStudy, MedicalObservationalStudy, MedicalTrial
- RCT?
- Which groups were masked? (single, double, triple is not sufficient)
- [...]
- Develop RDFS vocabulary predicates for linking between
similar and concurring / discordant reproductions of studies
[“Reproducibility”, “Repeatability”]
- [ ]
- Develop a platform for collaborative systematic review
- Linked Data (RDF)
- OpenAnnotation
- Structured Reviews
#LinkedReproducibility
Healthcare.gov (HHS CMMS)¶
- [ ] TODO: create RDFa vocabulary for health plans
- [ ] TODO: add RDFa to individual plan pages
- [ ] TODO: search engine to index RDFa vocabulary
- [ ] TODO: encourage carriers to add RDFa to describe their servcies
Medline Plus (NIH NLM)¶
- Health Info: http://www.nlm.nih.gov/medlineplus/healthtopics.html
- Drug Info: http://www.nlm.nih.gov/medlineplus/druginformation.html
- http://www.nlm.nih.gov/medlineplus/videosandcooltools.html
- TODO: http://schema.org/docs/meddocs.html (RDFa Linked Data)
- parse_medline.py: https://github.com/westurner/healthref/blob/gh-pages/parse_medline.py
- [ ] see the schema.org types listed under OpenFDA (FDA)
PubMed (NIH NLM NCBI)¶
MeshRDF¶
https://en.wikipedia.org/wiki/List_of_MeSH_codes
-
“A medical code for the entity, taken from a controlled vocabulary or ontology such as ICD-9, DiseasesDB, MeSH, SNOMED-CT, RxNorm, etc.”
-
PubMed and Schema.org RDF¶
- http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3745940/ (RDF Linked Data)
- TODO: http://schema.org/docs/meddocs.html (RDFa Linked Data)
ClinicalTrials.gov (NIH NLM)¶
- http://linkedct.org/ (RDF linked data (delayed; third-party))
- TODO: http://schema.org/docs/meddocs.html (RDFa Linked Data)
UMLS (NIH NLM)¶
- https://github.com/ncbo/umls2rdf (RDF Linked Data)
- TODO: http://schema.org/docs/meddocs.html (RDFa Linked Data)
National Cancer Institute (NIH NCI)¶
LexEVS (NIH NCI)¶
- https://wiki.nci.nih.gov/display/LexEVS/LexEVS+6.x+OWL+Export+Guide (RDF Linked Data)
- TODO: http://schema.org/docs/meddocs.html (RDFa Linked Data)
OpenFDA (FDA)¶
https://open.fda.gov/api/reference/ (JSON Data)
- [ ] TODO: http://json-ld.org/
@context
-> RDF Linked Data
- [ ] TODO: http://json-ld.org/
[ ] TODO: http://schema.org/docs/meddocs.html (RDFa Linked Data)
- [ ] http://schema.org/MedicalWebPage
- [ ] http://schema.org/MedicalDevice
- [ ] http://schema.org/MedicalTherapy
- [ ] http://schema.org/MedicalCondition
- [ ] http://schema.org/MedicalIndication
- [ ] http://schema.org/MedicalContraindication
- [ ] http://schema.org/adverseOutcome
- [ ] http://schema.org/seriousAdverseOutcome
- [ ] http://schema.org/MedicalSignOrSymptom
[x] BLD: Dockerfiles for testing
[ ] ENH: Adverse Event Count / Prescription Count Heatmap
HealthData.gov (HHS)¶
- http://www.healthdata.gov/dataset/search
- http://hub.healthdata.gov/
- http://hub.healthdata.gov/data.json
- [ ] TODO: http://schema.org/docs/meddocs.html (RDFa Linked Data)
- seeAlso: Data.gov
GNUHealth¶
GNU Health is a Free Health and Hospital Information System
- Electronic Health Record
- Hospital Information System
- Health Information System
.
- Open Source
- Written in Python
- PostgreSQL Database
- Desktop Client: Windows / OSX / Linux (GTK)
- Android Client
- Web Interface
EPA¶
- http://www.epa.gov/datafinder/
- http://www.epa.gov/airdata/
- http://aqsdr1.epa.gov/aqsweb/aqstmp/airdata/download_files.html (CSV Data)
- TODO: http://www.w3.org/TR/csv2rdf/ + Metadata -> RDF Linked Data
- http://aqsdr1.epa.gov/aqsweb/aqstmp/airdata/download_files.html (CSV Data)
MyPlate (USDA)¶
Science & Technology¶
OSTP¶
Office of Science and Technology Policy
OpenStack¶
OpenStack is an Open Source cloud infrastructure platform started as a public-private partnership between NASA and Rackspace.