tableone: An open source Python package for producing summary statistics for research papers
Author(s)
Pollard, Tom Joseph; Johnson, Alistair Edward William; Raffa, Jesse D; Mark, Roger G
DownloadPublished version (375.5Kb)
Terms of use
Metadata
Show full item recordAbstract
Objectives:In quantitative research, understanding basic parameters of the study population is key for interpre-tation of the results. As a result, it is typical for the first table (“Table 1”) of a research paper to include summarystatistics for the study data. Our objectives are 2-fold. First, we seek to provide a simple, reproducible methodfor providing summary statistics for research papers in the Python programming language. Second, we seek touse the package to improve the quality of summary statistics reported in research papers.Materials and Methods:Thetableonepackage is developed following good practice guidelines for scientificcomputing and all code is made available under a permissive MIT License. A testing framework runs on a con-tinuous integration server, helping to maintain code stability. Issues are tracked openly and public contributionsare encouraged.Results:Thetableonesoftware package automatically compiles summary statistics into publishable formatssuch as CSV, HTML, and LaTeX. An executable Jupyter Notebook demonstrates application of the package to asubset of data from the MIMIC-III database. Tests such as Tukey’s rule for outlier detection and Hartigan’s DipTest for modality are computed to highlight potential issues in summarizing the data.Discussion and Conclusion:We present open source software for researchers to facilitate carrying out repro-ducible studies in Python, an increasingly popular language in scientific research. The toolkit is intended to ma-ture over time with community feedback and input. Development of a common tool for summarizing data mayhelp to promote good practice when used as a supplement to existing guidelines and recommendations. Weencourage use of tableone alongside other methods of descriptive statistics and, in particular, visualization toensure appropriate data handling. We also suggest seeking guidance from a statistician when usingtableonefor a research study, especially prior to submitting the study for publication.
Date issued
2018-05Department
Massachusetts Institute of Technology. Institute for Medical Engineering & Science; Harvard--MIT Program in Health Sciences and Technology. Laboratory for Computational PhysiologyJournal
JAMIA open
Publisher
Oxford University Press (OUP)
Citation
Pollard, Tom J. et al. “tableone: An open source Python package for producing summary statistics for research papers.” JAMIA open, vol. 1, no. 1, 2018, pp. 26-31 © 2018 The Author(s)
Version: Final published version
ISSN
2574-2531