FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
Author(s)
Kao, Sheng-Chun; Subramanian, Suvinay; Agrawal, Gaurav; Yazdanbakhsh, Amir; Krishna, Tushar
Download3575693.3575747.pdf (4.126Mb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordDate issued
2023-01-27Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science; Massachusetts Institute of Technology. Computer Science and Artificial Intelligence LaboratoryPublisher
ACM|Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2
Citation
Kao, Sheng-Chun, Subramanian, Suvinay, Agrawal, Gaurav, Yazdanbakhsh, Amir and Krishna, Tushar. 2023. "FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks."
Version: Final published version
ISBN
978-1-4503-9916-6
Collections
The following license files are associated with this item: