Now showing items 1-20 of 28

    • A 1020-Node Modular Microphone Array and Beamformer for Intelligent Computing Spaces 

      Weinstein, Eugene; Steele, Kenneth; Agarwal, Anant; Glass, James (2004-04-14)
      Ubiquitous computing environments are characterized by an unboundedamount of noise and crosstalk. In these environments, traditionalmethods of sound capture are insufficient, and array microphones areneeded in order to ...
    • A Quantitative Comparison of Reconfigurable, Tiled, and Conventional Architectures on Bit-level Computation 

      Wentzlaff, David; Agarwal, Anant (2004-04-13)
      General purpose computing architectures are being called on to work on amore diverse application mix every day. This has been fueled by the needfor reduced time to market and economies of scale that are the hallmarksof ...
    • A Stream Algorithm for the SVD 

      Strumpen, Volker; Hoffmann, Henry; Agarwal, Anant (2003-10-22)
      We present a stream algorithm for the Singular-Value Decomposition (SVD) of anM X N matrix A. Our algorithm trades speed of numerical convergence for parallelism,and derives from a one-sided, cyclic-by-rows Hestenes SVD. ...
    • Application Heartbeats for Software Performance and Health 

      Miller, Jason; Agarwal, Anant; Santambrogio, Marco; Eastep, Jonathan; Hoffmann, Henry (2009-08-07)
      Adaptive, or self-aware, computing has been proposed as one method to help application programmers confront the growing complexity of multicore software development. However, existing approaches to adaptive systems are ...
    • ATAC: A Manycore Processor with On-Chip Optical Network 

      Liu, Jifeng; Psota, James; Beckmann, Nathan; Miller, Jason; Michel, Jurgen; e.a. (2009-05-05)
      Ever since industry has turned to parallelism instead of frequency scaling to improve processor performance, multicore processors have continued to scale to larger and larger numbers of cores. Some believe that multicores ...
    • The Case for a Factored Operating System (fos) 

      Agarwal, Anant; Wentzlaff, David (2008-10-08)
      The next decade will afford us computer chips with 1,000 - 10,000 cores on a single piece of silicon. Contemporary operating systems have been designed to operate on a single core or small number of cores and hence are not ...
    • A Comparison of Autonomic Decision Making Techniques 

      Maggio, Martina; Hoffmann, Henry; Santambrogio, Marco D.; Agarwal, Anant; Leva, Alberto (2011-04-01)
      Autonomic computing systems are capable of adapting their behavior and resources thousands of times a second to automatically decide the best way to accomplish a given goal despite changing environmental conditions and ...
    • Core Count vs Cache Size for Manycore Architectures in the Cloud 

      Agarwal, Anant; Miller, Jason; Beckmann, Nathan; Wentzlaff, David (2010-02-11)
      The number of cores which fit on a single chip is growing at an exponential rate while off-chip main memory bandwidth is growing at a linear rate at best. This core count to off-chip bandwidth disparity causes per-core ...
    • DSENT - A Tool Connecting Emerging Photonics with Electronics for Opto-Electronic Networks-on-Chip Modeling 

      Sun, Chen; Chen, Chia-Hsin Owen; Kurian, George; Wei, Lan; Miller, Jason; e.a. (2012-02-08)
      With the advent of many-core chips that place substantial demand on the NoC, photonics has been investigated as a promising alternative to electrical NoCs. While numerous opto-electronic NoCs have been proposed, their ...
    • Efficient Cache Coherence on Manycore Optical Networks 

      Psota, James; Agarwal, Anant; Miller, Jason; Beckmann, Nathan; Kurian, George (2010-02-11)
      Ever since industry has turned to parallelism instead of frequency scaling to improve processor performance, multicore processors have continued to scale to larger and larger numbers of cores. Some believe that multicores ...
    • Energy Scalability of On-Chip Interconnection Networks in Multicore Architectures 

      Agarwal, Anant; Psota, James; Eastep, Jonathan; Konstantakopoulos, Theodoros (2008-11-11)
      On-chip interconnection networks (OCNs) such as point-to-point networks and buses form the communication backbone in systems-on-a-chip, multicore processors, and tiled processors. OCNs can consume significant portions of ...
    • Fleets: Scalable Services in a Factored Operating System 

      Wentzlaff, David; Gruenwald, Charles, III; Beckmann, Nathan; Belay, Adam; Kasture, Harshad; e.a. (2011-03-09)
      Current monolithic operating systems are designed for uniprocessor systems, and their architecture reflects this. The rise of multicore and cloud computing is drastically changing the tradeoffs in operating system design. ...
    • Graphite: A Distributed Parallel Simulator for Multicores 

      Beckmann, Nathan; Eastep, Jonathan; Gruenwald, Charles, III; Kurian, George; Kasture, Harshad; e.a. (2009-11-09)
      This paper introduces the open-source Graphite distributed parallel multicore simulator infrastructure. Graphite is designed from the ground up for exploration of future multicore processors containing dozens, hundreds, ...
    • Multicore Performance Optimization Using Partner Cores 

      Lau, Eric; Miller, Jason E; Choi, Inseok; Yeung, Donald; Amarasinghe, Saman; e.a. (2011-03-25)
      As the push for parallelism continues to increase the number of cores on a chip, and add to the complexity of system design, the task of optimizing performance at the application level becomes nearly impossible for the ...
    • An Operating System for Multicore and Clouds: Mechanisms and Implementation 

      Modzelewski, Kevin; Miller, Jason; Belay, Adam; Beckmann, Nathan; Gruenwald, Charles, III; e.a. (2010-02-08)
      Cloud computers and multicore processors are two emerging classes of computational hardware that have the potential to provide unprecedented compute capacity to the average user. In order for the user to effectively harness ...
    • Partitioning Strategies for Concurrent Programming 

      Devadas, Srinivas; Agarwal, Anant; Hoffmann, Henry (2009-06-16)
      This work presents four partitioning strategies, or patterns, useful for decomposing a serial application into multiple concurrently executing parts. These partitioning strategies augment the commonly used task and data ...
    • PIKA: A Network Service for Multikernel Operating Systems 

      Beckmann, Nathan Z.; Gruenwald III, Charles; Johnson, Christopher R.; Kasture, Harshad; Sironi, Filippo; e.a. (2014-01-28)
      PIKA is a network stack designed for multikernel operating systems that target potential future architectures lacking cache-coherent shared memory but supporting message passing. PIKA splits the network stack into several ...
    • Power-Aware Computing with Dynamic Knobs 

      Misailovic, Sasa; Agarwal, Anant; Carbin, Michael; Sidiroglou, Stelios; Hoffmann, Henry; e.a. (2010-05-14)
      We present PowerDial, a system for dynamically adapting application behavior to execute successfully in the face of load and power fluctuations. PowerDial transforms static configuration parameters into dynamic knobs that ...
    • Remote Store Programming: Mechanisms and Performance 

      Wentzlaff, David; Agarwal, Anant; Hoffmann, Henry (2009-05-05)
      This paper presents remote store programming (RSP). This paradigm combines usability and efficiency through the exploitation of a simple hardware mechanism, the remote store, which can easily be added to existing ...
    • Scalar Operand Networks: Design, Implementation, and Analysis 

      Taylor, Michael Bedford; Lee, Walter; Amarasinghe, Saman; Agarwal, Anant (2004-06-08)
      The bypass paths and multiported register files in microprocessors serve as an implicit interconnect tocommunicate operand values among pipeline stages and multiple ALUs. Previous superscalar designs implementedthis ...