Senior Staff Engineer | MangoBoost
Contact: rachata[dot]ausavarungnirun[at]mangoboost[dot]io

I am a senior staff engineer at MangoBoost. Prior to this, I was an assistant professor at the Sirindhorn International Thai-German Graduate School (TGGS), where I lead the Architecture Research Group. My research interests span multiple areas including GPU architectures, management of GPUs in the cloud,
virtual memory, heterogeneous CPU-GPU architectures, memory subsystems, persistent memory, accelerator designs for bioinformatic applications and network-on-chip.

I got my Ph.D. from Carnegie Mellon University under the guidance of professor Onur Mutlu. I received my M.S. in Electrical and Computer Engineering,
B.S. in Computer Science and B.S. in Electrical and Computer Engineering from Carnegie Mellon University.

News


  • March 2024: A paper on GC and swap co-design is accepted at ASPLOS 2024 .
  • July 2023: A paper on virtual memory design is accepted at MICRO 2023 .
  • February 2023: A paper on virtual memory design for processing-in-memory is accepted at DAC 2023 .
  • August 2022: A paper on a collaborative process-and-memory resource management is accepted at EuroSys 2023 .
  • July 2022: A paper on in-RRAM graph acceleration is accepted at ICCAD 2022 .
  • April 2022: A paper on techniques to harvest neighbor GPUs' memory is accepted at USENIX ATC 2022 .
  • December 2021: A paper on cache files management for mobile phone is accepted at FAST 2022 .
  • November 2021: A paper on an in-storage accelerator for genome sequence analysis is accepted at ASPLOS 2022 .
  • July 2021: A paper on GPUs co-scheduling is accepted at ICCAD 2021 .
  • July 2021: A paper on a set ISA for graph mining is accepted at MICRO 2021 .
  • June 2021: Faculty mentor at the >3rd Undergrad Architecture Mentoring (uArch) Workshop."
  • April 2021: Invited virtual talk titled "Utilizing System and Hardware to Accelerate HPC" at ANSCES 2021 .
  • February 2021: Invited virtual talk titled "High-performance Virtual Memory Design for Modern Architectures" at AMD.
  • July 2020: Our paper on a technique to maintain variable granularity metadata in large page is accepted at PACT 2020 .
  • July 2020: A paper on an energy-efficient and high-performance accelerator design for DNA sequence alignment is accepted at MICRO 2020
  • April 2020: A paper on how to efficiently manage reclaim schemes in a mobile system is accepted at USENIX ATC 2020 .
  • March 2020: A paper on how to improve the design of virtual memory is accepted at ISCA 2020 .
  • November 2019: Invited talk in Changsha and Wuhan on GPU Virtual Memory.
  • September 2019: Invited talk at UNIST on GPU memory hierarchy design.
  • July 2019: A paper on a scheme to improve reliability of a hybrid DRAM-NVM memory is accepted at MICRO 2019.
  • May 2019: Invited talks titled "High-performance Memory Hierarchy Design for Throughput Processors" at the National University of Singapore, Nanyang Technological University, KAIST, Seoul National University, National Taiwan University, City University of Hong Kong, Hong Kong University of Science and Technology, Kyushu University, Shanghai Jiao Tong University, Peking University, Institute of Computing Technology at the Chinese University of Science, Tsinghua University, and the National Chiao Tung University.
  • March 2019: A paper on near-data accelerator is accepted at ISCA 2019.

  • Selected Publications


  • Jiacheng Huang, Yunmo Zhang, Junqiao Qiu, Yu Liang, Rachata Ausavarungnirun, Qingan Li, Chun Jason Xue
    "More Apps, Faster Hot-Launch on Mobile Devices via Fore/Background-aware GC-Swap Co-design"
    Proceedings of the ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2024), San Diego, CA, USA, April 2024.
  • Amel Fatima, Sihang Liu, Korakit Seemakhupt, Rachata Ausavarungnirun, Samira Khan
    "vPIM: Efficient Virtual Address Translation for Scalable Processing in-Memory Architectures"
    Proceedings of the 60th Design Automation Conference (DAC 2023), San Francisco, CA, USA, July 2023.
  • Changlong Li, Yu Liang, Rachata Ausavarungnirun, Zongwei Zhu, Liang Shi, Chuan Jason Xue
    "ICE: Collaborating Memory and Process Management for User Experience on Resource-limited Mobile Devices"
    Proceedings of the 18th European Conference on Computer Systems (EuroSys 2023), Rome, Italy, May 2023.
  • Xing Li, Rachata Ausavarungnirun, Xiao Liu, Xueyuan Liu, Xuan Zhang, Heng Lu, Zhuoran Song, Naifeng Jing, Xiaoyao Liang
    "Gzippo: Highly-compact Processing-In-Memory Graph Accelerator Alleviating Sparsity and Redundancy"
    Proceedings of the 41st ACM/IEEE International Conference on Computer-Aided Design (ICCAD 2022), San Deigo, USA, October 2022.
  • Sangjin Choi, Taeksoo Kim, Jinwoo Jeong, Rachata Ausavarungnirun, Myeongjae Jeon, Youngjin Kwon, Jeongseob Ahn
    "Memory Harvesting in Multi-GPU Systems with Hierarchical Unified Virtual Memory"
    Proceedings of the 2022 USENIX Annual Technical Conference (USENIX ATC 2022), Carlsbad, CA, July 2022.
  • Nika Mansouri Ghiasi, Jisung Park, Harun Mustafa, Jeremie Kim, Ataberk Olgun, Arvid Gollwitzer, Damla Senol Cali, Can Firtina, Haiyu Mao, Nour Almadhoun Alserr, Rachata Ausavarungnirun, Nandita Vijaykumar, Mohammed Alser, and Onur Mutlu
    "GenStore: A High-Performance and Energy-Efficient In-Storage Computing System for Genome Sequence Analysis"
    Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2022), Lausanne, Switzerland, February 2022.
  • Yu Liang, Riwei Pan, Tianyu Ren, and Yufei Cui, Rachata Ausavarungnirun, Xianzhang Chen, Changlong Li, Tei-Wei Kuo, Chun Jason Xue
    "CacheSifter: Sifting Cache Files for Boosted Mobile Performance and Lifetime"
    Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST 2022), Santa Clara, CA, USA, February 2022.
  • Xuanyi Li, Chen Li, Yang Guo, Rachata Ausavarungnirun
    "Improving Inter-kernel Data Reuse With CTA-Page Coordination in GPGPU"
    Proceedings of the 40th IEEE/ACM International Conference On Computer-Aided Design (ICCAD 2021), Virtual Event, November 2021.
  • Maciej Besta, Raghavendra Kanakagiri, Grzegorz Kwasniewski, Rachata Ausavarungnirun, Jakub Beránek, Konstantinos Kanellopoulos, Kacper Janda, Zur Vonarburg-Shmaria, Lukas Gianinazzi, Ioana Stefan, Juan Gómez Luna, Jakub Golinowski, Marcin Copik, Lukas Kapp-Schwoerer, Salvatore Di Girolamo, Nils Blach, Marek Konieczny, Onur Mutlu, Torsten Hoefler
    "SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems"
    Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture (MICRO 2021), Virtual Event, October 2021.
  • Rachata Ausavarungnirun, Timothy Merrifield, Jayneel Gandhi, Christopher J. Rossbach
    "PRISM: Architectural Support for Variable-granularity Memory Metadata"
    Proceedings of the 29nd International Conference on Parallel Architectures and Compilation Techniques (PACT 2020), Atlanta, Georgia, USA, October 2020.
  • Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Lavanya Subramanian, Can Firtina, Jeremie Kim, Rachata Ausavarungnirun, Mohammed Alser, Anant Nori, Juan Gómez Luna, Amirali Boroumand, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu.
    "GenASM: A Low-Power, Memory-Efficient Approximate String Matching Acceleration Framework for Genome Sequence Analysis"
    To appear in the Proceedings of the 53nd IEEE/ACM International Symposium on Microarchitecture (MICRO 2020), Athen, Greece, October 2020.
  • Yu Liang, Jinheng Li, Xianzhang Chen, Rachata Ausavarungnirun, Riwei Pan, Tei-Wei Kuo, Chun Jason Xue
    "Differentiating Cache Files for Fine-grain Management to Improve Mobile Performance and Lifetime"
    Proceedings of the 12th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 2020), Boston, MA, USA, July 2020.
  • Yu Liang, Jinheng Li, Rachata Ausavarungnirun, Riwei Pan, Liang Shi, Tei-Wei Kuo, Chun Jason Xue
    "Acclaim: Adaptive Memory Reclaim to Improve User Experience in Android Systems"
    Proceedings of the 2020 USENIX Annual Technical Conference (USENIX ATC 2020), Boston, MA, USA, July 2020.
  • Nastaran Hajinazar, Pratyush Patel, Minesh Patel, Konstantinos Kanellopoulos, Saugata Ghose, Rachata Ausavarungnirun, Geraldo Francisco de Oliveira Junior, Jonathan Appavoo, Vivek Seshadri, Onur Mutlu
    "The Virtual Block Interface (VBI): A Flexible Alternative to Conventional Virtual Memory Frameworks"
    Proceedings of the 47th IEEE/ACM International Symposium on Computer Architecture (ISCA 2020), Valencia, Spain, June 2020.
  • Xiao Liu, David Roberts, Rachata Ausavarungnirun, Onur Mutlu, Jishen Zhao
    "Binary Star: Coordinated Reliability in Heterogeneous Memory Systems for High Performance and Scalability"
    Proceedings of the 52nd IEEE/ACM International Symposium on Microarchitecture (MICRO 2019), Columbus, OH, October 2019.
  • Amirali Boroumand, Saugata Ghose, Minesh Patel, Rachata Ausavarungnirun, Hasan Hassan, Brandon Lucia, Kevin Hsieh, Nastaran Hajinazar, Krishna T. Malladi, Hongzhong Zheng, Onur Mutlu
    "CoNDA: Enabling Efficient Near-Data Accelerator Communication by Optimizing Data Movement"
    Proceedings of the 46th International Symposium on Computer Architecture (ISCA 2019), Pheonix, AZ, June 2019.
  • Chen Li, Rachata Ausavarungnirun, Christopher J. Rossbach, Youtao Zhang, Onur Mutlu, Yang Guo, Jun Yang
    "A Framework for Memory Oversubscription Management in Graphics Processing Units"
    Proceedings of the 24rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2019), Providence, RI, April 2019.
  • Mohammad Sadrosadati, Seyed Borna Ehsani, Hajar Falahati, Rachata Ausavarungnirun, Arash Tavakkol, Mojtaba Abaee, Lois Orosa, Yaohua Wang, Hamid Sarbazi-Azad, and Onur Mutlu.
    "ITAP: Idle-Time-Aware Power Managementfor GPU Execution Units"
    ACM Transactions on Architecture and Code Optimization (ACM TACO), December 2018.
  • Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller, Saugata Ghose, Jayneel Gandhi, Christopher J. Rossbach, and Onur Mutlu,
    "Mosaic: Enabling Application-Transparent Support for Multiple Page Sizes in Throughput Processors"
    In ACM SIGOPS Operating System Review - Special Topics, Vol. 52, Issue 1, July 2018.
  • Rachata Ausavarungnirun, Vance Miller, Joshua Landgraf, Saugata Ghose, Jayneel Gandhi, Adwait Jog, Christopher J. Rossbach, and Onur Mutlu,
    "MASK: Redesigning the GPU Memory Hierarchy to Support Multi-Application Concurrency"
    Proceedings of the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2018), Williamsburg, VA, March 2018.
  • Amirali Boroumand, Saugata Ghose, Youngsok Kim, Rachata Ausavarungnirun, Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu,
    "Google Workloads for Consumer Devices: Mitigating Data Movement Bottlenecks"
    Proceedings of the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2018), Williamsburg, VA, March 2018.
  • Maciej Besta, Syed Minhaj Hassan, Sudhakar Yalamanchili, Rachata Ausavarungnirun, Onur Mutlu, Torsten Hoefler,
    "Slim NoC: A Low-Diameter On-Chip Network Topology for High Energy Efficiency and Scalability"
    Proceedings of the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2018), Williamsburg, VA, March 2018.
  • Mohammad Sadrosadati, Amirhossein Mirhosseini, Seyed Borna Ehsani, Hamid Sarbazi-Azad, Mario Drumond, Babak Falsafi, Rachata Ausavarungnirun, Onur Mutlu,
    "LTRF: Enabling High-Capacity Register Files for GPUs via Hardware/Software Cooperative Register Prefetching"
    Proceedings of the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2018), Williamsburg, VA, March 2018.
  • Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller, Saugata Ghose, Jayneel Gandhi, Christopher J. Rossbach, and Onur Mutlu,
    "Mosaic: A GPU Memory Manager with Application-Transparent Support for Multiple Page Sizes"
    Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2017), Boston, MA, October 2017.
  • Donghyuk Lee, Samira Khan, Lavanya Subramanian, Saugata Ghose, Rachata Ausavarungnirun, Gennady Pekhimenko, Vivek Seshadri, and Onur Mutlu,
    "Design-Induced Latency Variation in Modern DRAM Chips: Characterization, Analysis, and Latency Reduction Mechanisms"
    Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS 2017), Urbana-Champaign, IL, June 2017.
  • Onur Kayiran, Adwait Jog, Ashutosh Pattnaik, Rachata Ausavarungnirun, Xulong Tang, Mahmut T. Kandemir, Gabriel H. Loh, Onur Mutlu, Chita R. Das,
    "Fine-grain Datapath Management for Scale-up GPUs"
    Proceedings of the The 25th International Conference on Parallel Architectures and Compilation Techniques (PACT 2016), Haifa, Israel, September 2016.
  • Yang Li, Di Wang, Saugata Ghose, Jie Liu, Sriram Govindan, Sean James, Eric Peterson, John Siegler, Rachata Ausavarungnirun, Onur Mutlu,
    "SizeCap: Efficiently Handling Power Surges in Fuel Cell Powered Data Centers"
    Proceedings of the 22nd International Symposium on High-Performance Computer Architecture (HPCA 2016), Barcelona, Spain, March 2016.
  • Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Chang, Greg Nazario, Reetuparna Das, Gabriel Loh, Onur Mutlu
    "A Case for Hierarchical Rings with Deflection Routing: An Energy-Efficient On-Chip Communication Substrate"
    Parallel Computing (PARCO), May 2016 pp. 29-45. arXiv.org version, February 2016.
  • Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita Das, Mahmut Kandemir, Todd C. Mowry, Onur Mutlu,
    "A Framework for Accelerating Bottlenecks in GPU Execution with Assist Warps"
    Invited Book Chapter in Advances in GPU Research and Practice, Elsevier, to be published in 2016., arXiv.org version, February 2016.
  • Donghyuk Lee, Lavanya Subramanian, Rachata Ausavarungnirun, Jongmoo Choi, Onur Mutlu,
    "Decoupled Direct Memory Access: Isolating CPU and IO Traffic by Leveraging a Dual-Port DRAM"
    Proceedings of the The 24th International Conference on Parallel Architectures and Compilation Techniques (PACT 2015), San Francisco, October 2015.
  • Rachata Ausavarungnirun, Saugata Ghose, Onur Kayiran, Gabriel H. Loh, Chita R. Das, Mahmut T. Kandemir, Onur Mutlu,
    "Exploiting Inter-Warp Heterogeneity to Improve GPGPU Performance"
    Proceedings of the The 24th International Conference on Parallel Architectures and Compilation Techniques (PACT 2015), San Francisco, October 2015.
  • Mohammad Fattah, Antti Airola, Rachata Ausavarungnirun, Nima Mirzaei, Pasi Liljeberg, Juha Plosila, Siamak Mohammadi, Tapio Pahikkala, Onur Mutlu and Hannu Tenhunen,
    "A Low-Overhead, Fully-Distributed, Guaranteed-Delivery Routing Algorithm for Faulty Network-on-Chips"
    Proceedings of the 9th ACM/IEEE International Symposium on Networks on Chip (NOCS 2015), Vancouver, Canada, September 2015.
  • Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Abhishek Bhowmick, Rachata Ausavarungnirun, Onur Mutlu, Chita Das, Mahmut Kandemir, and Todd C. Mowry,
    "A Case for Core-Assisted Bottleneck Acceleration in GPUs: Enabling Efficient Data Compression"
    Proceedings of the 42nd International Symposium on Computer Architecture (ISCA 2015), Portland, OR, June 2015.
  • Onur Kayiran, Nachiappan CN, Adwait Jog, Rachata Ausavarungnirun, Mahmut T. Kandemir, Gabriel H. Loh, Onur Mutlu, Chita R. Das,
    "Managing GPU Concurrency in Heterogeneous Architectures,"
    Proceedings of the Proceedings of the 47th International Symposium on Micro Architecture (MICRO) (MICRO 2014), Cambridge, UK, December 2014 .
  • Rachata Ausavarungnirun, Chris Fallin, Xiangyao Yu, Kevin Chang, Greg Nazario, Reetuparna Das, Gabriel H. Loh, Onur Mutlu,
    "Design and Evaluation of Hierarchical Rings with Deflection Routing"
    Proceedings of the 26th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2014), Paris, France, October 2014. [ pdf | Slides ]
  • Chris Fallin, Greg Nazario, Xiangyao Yu, Kevin Chang, Rachata Ausavarungnirun, and Onur Mutlu,
    "Bufferless and Minimally-Buffered Deflection Routing"
    Invited Book Chapter in Routing Algorithms in Networks-on-Chip, pp. 241-275, Springer, 2014.
  • Vivek Seshadri, Yoongu Kim, Chris Fallin, Donghyuk Lee, Rachata Ausavarungnirun, Gennady Pekhimenko, Yixin Luo, Onur Mutlu,
    Phillip B. Gibbons, Michael A. Kozuch, and Todd C. Mowry,
    "RowClone: Fast and Energy-Efficient In-DRAM Bulk Data Copy and Initialization"
    Proceedings of the 46th International Symposium on Microarchitecture (MICRO 2013), Davis, CA, December 2013. [ pdf | Slides ]
  • Reetuparna Das, Rachata Ausavarungnirun, Onur Mutlu, Akhilesh Kumar, Mani Azimi.
    "Application-to-Core Mapping Policies to Reduce Memory System Interference in Multi-Core Systems"
    Proceedings of the 19th International Symposium on High-Performance Computer Architecture (HPCA 2013), Shenzhen, China, February 2013. [ pdf | Slides ]
  • Kevin Chang, Rachata Ausavarungnirun, Chris Fallin, Onur Mutlu.
    "HAT: Heterogeneous Adaptive Throttling for On-Chip Networks".
    Proceedings of the 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2012), New York, NY, October 2012.
    [ pdf | slides ]
  • HanBin Yoon, Justin Meza, Rachata Ausavarungnirun, Racheal A. Harding, Onur Mutlu.
    "Row Buffer Locality Aware Caching Policies for Hybrid Memories".
    Proceedings of the 30th IEEE International Conference on Computer Design (ICCD 2012), Montreal, Quebec, Canada, September 2012.
    [ pdf | slides ]
    Best Paper Award (in Computer Systems and Applications track).
  • Reetuparna Das, Rachata Ausavarungnirun, Onur Mutlu, Akhilesh Kumar, Mani Azimi
    "Application-to-Core Mapping Policies to Reduce Memory Interference in Multi-Core Systems".
    Proceedings of the 21st International Conference on Parallel Architectures and Compilation Techniques (PACT 2012), Poster Session, Minneapolis, MN, September 2012.
  • Rachata Ausavarungnirun, Kevin Chang, Lavanya Subramanian, Gabriel. H. Loh, Onur Mutlu.
    "Staged Memory Scheduling: Achieving High Performance and Scalability in Heterogeneous Systems".
    Proceedings of the 39th International Symposium on Computer Architecture (ISCA 2012), Portland, OR, June 2012.
    [ pdf | slides (pptx) | slides (pdf) ]
  • Chris Fallin, Greg Nazario, Xiangyao Yu, Kevin Chang, Rachata Ausavarungnirun, Onur Mutlu.
    "MinBD: Minimally-Buffered Deflection Routing for Energy-Efficient Interconnect".
    Proceedings of the 6th ACM/IEEE International Symposium on Networks-on-Chip (NOCS 2012), Lyngby, Denmark, May 2012.
    [ pdf | slides (pptx) | slides (pdf) ]
    One of the five papers nominated for the Best Paper Award by the Program Committee.
  • Kevin Chang, Rachata Ausavarungnirun, Chris Fallin, Onur Mutlu.
    "Adaptive Cluster Throttling: Improving High-Load Performance in Bufferless On-Chip Networks".
    SAFARI Technical Report No. 2011-006. September 6, 2011.
  • Teaching and Academic Services


  • Teaching:
    • Advanced Computer Architecture (at TGGS)
    • Hardware and System Software Architecture (at TGGS)
    • Graduate Programming Language and Compiler (at TGGS)
    • Graduate Computer Architecture (at VISTEC)
    • Computer Architecture (at Mahidol University International College)
    • Introduction to System Skill (at Mahidol University International College)
    • Principle of Computer System and Architecture (at Mahidol University International College)
    • Functional and Parallel Computing (at Mahidol University International College)
    • Organization of Programming Language (at Mahidol University International College)
    • During my PhD at CMU, I was a TA for 18-447, Introduction to Computer Architecture (Spring 2014 and Spring 2015)
  • Program Committee: ICS 2024, ASPLOS 2024, HPCA 2024, DAC 2024, HPCA 2023, IISWC 2023, MICRO 2023, IIWSC 2022, ISPASS 2022, MICRO 2021, ISPASS 2021, MICRO 2020, GPGPU 2020, GPGPU 2019
  • External Review Committee: ISCA 2024, ASPLOS 2023, MICRO 2022, ISCA 2022, ISCA 2021, HPCA 2021, ISCA 2020, ISCA 2019
  • Finance Chair: ISPASS 2022
  • Others


    Aside from my interest in architecture research, I have been a photo enthusiast for years. Below are some pictures from my portfolio.
    All images © 2006-2017 Rachata Ausavarungnirun

    Jardins du château de Versailles by Rachata Ausavarungnirun on 500px.com

    Louvre by Rachata Ausavarungnirun on 500px.com

    Wilderness by Rachata Ausavarungnirun on 500px.com

    Bison at Wind Cave Natl. Park by Rachata Ausavarungnirun on 500px.com

    Cucumber Fall, Ohiopyle by Rachata Ausavarungnirun on 500px.com

    Kanarra Falls by Rachata Ausavarungnirun on 500px.com

    Antelope Canyon by Rachata Ausavarungnirun on 500px.com