Show simple item record

dc.contributor.advisorJames Glass and Hao Tang.en_US
dc.contributor.authorOseni-Adegbite, Adedotun J.en_US
dc.contributor.otherMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.en_US
dc.date.accessioned2020-09-15T21:59:07Z
dc.date.available2020-09-15T21:59:07Z
dc.date.copyright2020en_US
dc.date.issued2020en_US
dc.identifier.urihttps://hdl.handle.net/1721.1/127463
dc.descriptionThesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, May, 2020en_US
dc.descriptionCataloged from the official PDF of thesis.en_US
dc.descriptionIncludes bibliographical references (pages 81-84).en_US
dc.description.abstractGiven the large amount of time workers are spending in meetings, having statistics to drive more effective meetings is desirable. Various workplaces have distinct types of meetings and workers present. So the more agnostic to the content and people present within the meeting, the more meeting scenarios these statistics can be applied. We propose a system that provides these statistics in the form of a summary of who is speaking within the meeting and at what times they are speaking whilst respecting the participants' privacy. The system aims to run completely online and locally. Therefore, no audio needs to be stored or transmitted on the device running the system. This is accomplished by displaying where speech originates in the room and also labeling the speaker. Time stamp labels are provided for all occurrences of a speaker's speech thus allowing a breakdown of how each speaker contributed to the meeting. We have created a dataset of emulated meeting-like scenario recordings to run experiments on. In an offline scenario, this system was able to achieve a DER of 27.8% with no overlap in the speech, 44.3% with small amounts of overlap, and 50.0% with large amounts of overlap. When run online, DERs of 16.9%, 37.2%, and 45.6% were achieved in situations of no overlap, small overlap, and large amounts of overlap respectively.en_US
dc.description.statementofresponsibilityby Adedotun J Oseni-Adegbite.en_US
dc.format.extent84 pagesen_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsMIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectElectrical Engineering and Computer Science.en_US
dc.titleSpeaker diarization in a meeting scenarioen_US
dc.typeThesisen_US
dc.description.degreeM. Eng.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.identifier.oclc1192966863en_US
dc.description.collectionM.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Scienceen_US
dspace.imported2020-09-15T21:59:07Zen_US
mit.thesis.degreeMasteren_US
mit.thesis.departmentEECSen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record