Skip to content

Pillbox data downloads

David Hale edited this page Aug 9, 2013 · 13 revisions

Pillbox is a resource of the U.S. National Library of Medicine, part of the National Institutes of Health, U.S. Department of Health and Human Services. Pillbox is a United States government resource. This document will be moved to an official Pillbox repo in August 2013.

Overview

Pillbox is a database of human prescription, over-the-counter, homeopathic, and veterinary oral solid dosage medications (pills) marketed in the United States of America. This data set contains information about pills such as how they look, their ingredients, and other criteria. This data can be used to identify unknown pills based on their physical appearance. It also answers questions like “What pills contain acetaminophen?” or “What pills contain lactose as an inactive ingredient?” The data contain unique identifiers, such as the RXCUI and FDA product code.

WARNINGREAD BEFORE YOU DOWNLOAD

Pillbox’s data is created by combining drug information resources from the Food and Drug Administration (FDA) and National Library of Medicine (NLM) at the National Institutes of Health. This information has been reformatted to make it easier to work with but has not been verified by FDA or NLM. The information available for download may not be the labeling on currently distributed products or identical to the labeling that is approved. NLM makes no warranty that the data is error free.

Corrections to data

Physical appearance data (imprint, color, shape, size, score) for records which have an image have been checked against that images. In cases where the physical characteristics data has been found to be in error, data have been changed to the correct value. A table of changes can be downloaded in the Support data section.

Disclaimer

The pill images and accompanying data available here were obtained from products acquired from a licensed pharmacy or the product manufacturer. Manufacturers may alter the appearance (e.g., shape, color, size, markings) of medications over time.

The same medication may have been issued with a different appearance and/or different accompanying data before or after the date NLM acquired it. NLM would like to hear about any changes in medication appearance or possible errors in accompanying information. Please contact [email protected] if you notice any discrepancies in the information provided here.

Reference in this Web site to any specific commercial product, process, service, manufacturer, or company does not constitute its endorsement or recommendation by the U.S. government or the U.S. Department of Health and Human Services or any of its agencies.

Neither the U.S. government nor any agency thereof, nor any of their employees, makes any warranty, express or implied, or assumes any legal responsibility for the accuracy, completeness, or usefulness of any information disclosed.

Terms of Use

Use of this data is subject to the Terms of Service displayed on the Pillbox API/Data page.

Pill images

Pillbox currently contains 2,159 pill images. While all of images provided by Pillbox were created through a National Library of Medicine/Food and Drug Administration partnership, most are not yet part of the FDA Structured Product Labels (drug labels) and have not yet been verified by the manufacturer/labeler.

Images are not yet available as a single download but will be sometime in August 2013.

Data downloads

Master data

pillbox_[date] Master Pillbox data table. This is the tables that powers Pillbox. Currently, this table also includes legacy fields, which are unused. See pillbox schema for more information.

(tab-separated) pillbox_20130808.tab – MD5 checksum efa8cf4c2ad691f1f1073faa093b046c
(xml) pillbox_20130808.xml":http://pillbox.nlm.nih.gov/data/pillbox_20130808.xml – MD5 checksum 3685cffda156e04aab1aa7e60f9fd3c4

Support data

trade_dress_change_log_[date] This table lists every change made in the physical appearance data in pillbox_[date] based on a review of the pill images currently available. See trade_dress_change_log schema for more information.

(tab-separated) trade_dress_change_log_20130808.tab – MD5 checksum 8c4670f88d5ed47b567404105d200eac
(xml) trade_dress_change_log_20130808.xml – MD5 checksum 21af8d17f08055e99247979c7f297239

Lookup tables which contain the FDA codes for color, shape, and DEA schedule

(tab-separated)
SPL_color_lookup.tab – MD5 checksum 0fb6c979d8d534fae99efa4cead136ae
SPL_DEA_lookup.tab – MD5 checksum b65547eca5fbc206063411d994379bd1
SPL_shape_lookup.tab – MD5 checksum 26ac9ccc4af858e2d1ee221ba46ec04a

(xml)
xml files containing the FDA codes and values for color, shape, and DEA schedule are included in the FDA Terminology Validation Files download.

Download all data as MySQL

pillbox_full_[date] is a single .sql file containing the master Pillbox data table, trade dress change log, and lookup tables for color, shape, and DEA schedule

(sql) pillbox_full_20130808.sql – MD5 checksum f466340b17f34b7b7040ca93036c41e7