Show simple item record

dc.contributor.authorPollard, Tom Joseph
dc.contributor.authorJohnson, Alistair Edward William
dc.contributor.authorRaffa, Jesse D
dc.contributor.authorMark, Roger G
dc.date.accessioned2020-08-13T16:28:26Z
dc.date.available2020-08-13T16:28:26Z
dc.date.issued2018-05
dc.date.submitted2018-03
dc.identifier.issn2574-2531
dc.identifier.urihttps://hdl.handle.net/1721.1/126562
dc.description.abstractObjectives:In quantitative research, understanding basic parameters of the study population is key for interpre-tation of the results. As a result, it is typical for the first table (“Table 1”) of a research paper to include summarystatistics for the study data. Our objectives are 2-fold. First, we seek to provide a simple, reproducible methodfor providing summary statistics for research papers in the Python programming language. Second, we seek touse the package to improve the quality of summary statistics reported in research papers.Materials and Methods:Thetableonepackage is developed following good practice guidelines for scientificcomputing and all code is made available under a permissive MIT License. A testing framework runs on a con-tinuous integration server, helping to maintain code stability. Issues are tracked openly and public contributionsare encouraged.Results:Thetableonesoftware package automatically compiles summary statistics into publishable formatssuch as CSV, HTML, and LaTeX. An executable Jupyter Notebook demonstrates application of the package to asubset of data from the MIMIC-III database. Tests such as Tukey’s rule for outlier detection and Hartigan’s DipTest for modality are computed to highlight potential issues in summarizing the data.Discussion and Conclusion:We present open source software for researchers to facilitate carrying out repro-ducible studies in Python, an increasingly popular language in scientific research. The toolkit is intended to ma-ture over time with community feedback and input. Development of a common tool for summarizing data mayhelp to promote good practice when used as a supplement to existing guidelines and recommendations. Weencourage use of tableone alongside other methods of descriptive statistics and, in particular, visualization toensure appropriate data handling. We also suggest seeking guidance from a statistician when usingtableonefor a research study, especially prior to submitting the study for publication.en_US
dc.description.sponsorshipNational Institutes of Health (U.S.) (Grant NIH-R01-EB017205)en_US
dc.description.sponsorshipNational Institutes of Health (U.S.) (Grant NIH-R01-EB001659)en_US
dc.language.isoen
dc.publisherOxford University Press (OUP)en_US
dc.relation.isversionof10.1093/JAMIAOPEN/OOY012en_US
dc.rightsCreative Commons Attribution 4.0 International licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceOxford University Pressen_US
dc.titletableone: An open source Python package for producing summary statistics for research papersen_US
dc.typeArticleen_US
dc.identifier.citationPollard, Tom J. et al. “tableone: An open source Python package for producing summary statistics for research papers.” JAMIA open, vol. 1, no. 1, 2018, pp. 26-31 © 2018 The Author(s)en_US
dc.contributor.departmentInstitute for Medical Engineering and Scienceen_US
dc.contributor.departmentHarvard--MIT Program in Health Sciences and Technology. Laboratory for Computational Physiologyen_US
dc.relation.journalJAMIA openen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2019-10-09T15:44:36Z
dspace.date.submission2019-10-09T15:44:38Z
mit.journal.volume1en_US
mit.journal.issue1en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record