Date Range

2019-2025

Last Updated

Source

Various. See Methods and Notes.

Download

[.csv]

This dataset provides a historical list (2019-2025) of apps and software applications featured across public app galleries from several commercial and non-commercial sources, including EHR vendors (see “Methods and Notes” for source details.) We developed an automated, programmed process that gathers data from these sources, documenting the date of each pull and relevant details about the apps and software featured at that source at the time of the pull.  The individual snapshots of these apps and software include the application name, developer name, description of the application, and categories used to sort the application by the source. The dataset includes snapshots beginning in late 2019 and through August 1, 2025 (the most recent data available.) As detailed in “Methods and Notes” some light data clean-up was conducted, but largely the information included in the dataset reflect the information provided publicly by these sources without changes or revisions.

See “Methods and Notes” for further information about the methodology used to assemble this data, as well as information on changes to data sources and coding over time.

 

Data Field Data Description Additional Information
source Data source for the app or software and its related details. See "Notes" for more information on the data sources.
appName App or software application name as displayed at the listed "source" and at the time of the data pull. See "date_accessed".
appName_clean Clean name of app or software application name as displayed at the listed "source" and at the time of the data pull. See "date_accessed". See "Notes" for more information on cleaning process.
devName Developer name of app or software application name as displayed at the listed "source" and at the time of the data pull. See "date_accessed".
devName_clean Clean developer name of app or software application name as displayed at the listed "source" and at the time of the data pull. See "date_accessed". See "Notes" for more information on cleaning process.
description Text description of app or software application name as displayed at the listed "source" and at the time of the data pull. See "date_accessed".
categories Categories associated with the app or software as discovered at the listed "source" and at the time of the data pull. See "date_accessed".
users Users associated with the app or software as discovered at the listed "source" and at the time of the data pull. See "date_accessed".
website Website associated with the app or software as discovered at the listed "source" and at the time of the data pull. See "date_accessed".
date_accessed The date the information about the app or software application was pulled from the listed "source".
year "Year" of data associated with the data pull. This can be used to sort listings and is not often a perfect match to "date_accessed" as the data was accessed in January 2021, but we consider the data as of "2020".

Data sources

Commercial sources

Allscripts (now Veradigm): Appears as “allscripts” in the source field. Information on the apps and software that integrate with Allscripts/Veradigm from 2019-2022 discovered here: https://expo.allscripts.com/Developers, and from 2023-2025 discovered here: https://expo.veradigm.com/Developers.

Athenahealth: Appears as “athena” in the source field. Information on apps and software that integrate with athenahealth from 2019-2025 discovered here: https://marketplace.athenahealth.com/.

Cerner: Appears as “cerner” in the source field. Information on the apps and software that integrate with Cerner from 2019-2022 discovered here: https://code.cerner.com/apps. No data after 2022. More recent data is discoverable here: https://cloudmarketplace.oracle.com/marketplace/product/health, but the site does not facilitate programmed data pulls.

Epic: Appears as “epic” in the source field. Information on the apps and software that integrate with Epic from 2019-2023 discovered here: https://apporchard.epic.com/, and from 2024-2025 discovered here: https://vendorservices.epic.com.

eMDs: Appears as “emds” in the source field. Information on the apps and software that integrate with eMDs from 2021-2025 discovered here: https://marketplace.emds.com/. Data collection for this source began in 2021.

Greenway: Appears as “greenway” in the source field. Information on the apps and software that integrate with Greenway from 2021-2025 discovered here: https://www.greenwayhealth.com/marketplace. Data collection for this source began in 2021.

NextGen: Appears as “nextgen” in the source field. Information on the apps and software that integrate with NextGen from 2021-2025 discovered here: https://www.nextgen.com/marketplace. Data collection for this source began in 2021.

Non-commercial sources

1up Health: Appears as “1uphealth” in the source field. Information on apps and software searchable in the 1up Health App Gallery for years 2021-2022 discovered here: https://1up.health/health-apps/gallery. No other years of data available.

Bluebutton: Appears as “bluebutton” in the source field. Information on apps and software searchable in the Blue Button App Gallery for years 2021-2025 discovered here: https://api.bluebutton.cms.gov/. Data collection for this source began in 2021.

CARIN: Appears as “carin” in the source field. Information on apps and software searchable in the CARIN Alliance App Gallery for years 2021-2025 discovered here: https://www.myhealthapplication.com/#apps. Data collection for this source began in 2021.

SMART: Appears as “smart” in the source field. Information on apps and software searchable in the SMART App Gallery discovered here: https://apps.smarthealthit.org/apps.

Construction of the file

R statistical software was used to program and automate the data pulls and web scraping of the public websites, as provided above for the specific years. The specific R packages used were: (1) rvest, (2) httr, and (3) rjson. The program either queries publicly available APIs or scrapes the hypertext markup language (HTML) of the website for this data. The data are then structured in a uniform format, aggregating data across all sources. Data collection for this study began in December 2019 and is current through August 2025.

Minor adjustments were made to programmatic scripts over time to respond to changes to the layout or design of some of the public websites over time.

Some minor data cleaning was completed to remove characters and reformat to facilitate data ingestion and analysis.