MIT Libraries homeMIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

tableone: An open source Python package for producing summary statistics for research papers

Author(s)
Pollard, Tom Joseph; Johnson, Alistair Edward William; Raffa, Jesse D; Mark, Roger G
Thumbnail
DownloadPublished version (375.5Kb)
Terms of use
Creative Commons Attribution 4.0 International license https://creativecommons.org/licenses/by/4.0/
Metadata
Show full item record
Abstract
Objectives:In quantitative research, understanding basic parameters of the study population is key for interpre-tation of the results. As a result, it is typical for the first table (“Table 1”) of a research paper to include summarystatistics for the study data. Our objectives are 2-fold. First, we seek to provide a simple, reproducible methodfor providing summary statistics for research papers in the Python programming language. Second, we seek touse the package to improve the quality of summary statistics reported in research papers.Materials and Methods:Thetableonepackage is developed following good practice guidelines for scientificcomputing and all code is made available under a permissive MIT License. A testing framework runs on a con-tinuous integration server, helping to maintain code stability. Issues are tracked openly and public contributionsare encouraged.Results:Thetableonesoftware package automatically compiles summary statistics into publishable formatssuch as CSV, HTML, and LaTeX. An executable Jupyter Notebook demonstrates application of the package to asubset of data from the MIMIC-III database. Tests such as Tukey’s rule for outlier detection and Hartigan’s DipTest for modality are computed to highlight potential issues in summarizing the data.Discussion and Conclusion:We present open source software for researchers to facilitate carrying out repro-ducible studies in Python, an increasingly popular language in scientific research. The toolkit is intended to ma-ture over time with community feedback and input. Development of a common tool for summarizing data mayhelp to promote good practice when used as a supplement to existing guidelines and recommendations. Weencourage use of tableone alongside other methods of descriptive statistics and, in particular, visualization toensure appropriate data handling. We also suggest seeking guidance from a statistician when usingtableonefor a research study, especially prior to submitting the study for publication.
Date issued
2018-05
URI
https://hdl.handle.net/1721.1/126562
Department
Institute for Medical Engineering and Science; Harvard--MIT Program in Health Sciences and Technology. Laboratory for Computational Physiology
Journal
JAMIA open
Publisher
Oxford University Press (OUP)
Citation
Pollard, Tom J. et al. “tableone: An open source Python package for producing summary statistics for research papers.” JAMIA open, vol. 1, no. 1, 2018, pp. 26-31 © 2018 The Author(s)
Version: Final published version
ISSN
2574-2531

Collections
  • MIT Open Access Articles

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries homeMIT Libraries logo

Find us on

Twitter Facebook Instagram YouTube RSS

MIT Libraries navigation

SearchHours & locationsBorrow & requestResearch supportAbout us
PrivacyPermissionsAccessibility
MIT
Massachusetts Institute of Technology
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.