Publications


The materials presented on this page is to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Underlined names are students advised by me.    Italicized names are UCR students.

HPCA'23

KRISP: Enabling Kernel-wise Right-sizing for Spatial Partitioned GPU Inference Servers
Marcus Chow, Ali Jahanshahi, Daniel Wong
In Proceedings of the 29th IEEE International Symposium on High Performance Computer Architecture (HPCA), 2023. To appear. (Acceptance Rate: 25.0%)
 

 

ACM TACO'22

PowerMorph: QoS-Aware Server Power Reshaping for Data Center Regulation Service
Ali Jahanshahi, Nanpeng Yu, Daniel Wong
ACM Transactions on Architecture and Code Optimization. Volume 19, Issue 3, September 2022
 

 

SC'21

MAPA: Multi-Accelerator Pattern Allocation Policy for Multi-Tenant GPU Servers
Kiran Ranganath, Joshua D. Suetterlein, Joseph Manzano, Shuaiwen Leon Song, Daniel Wong
In Proceedings of the 2021 International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2021 (Acceptance Rate: 26.8%).
 

 

LCPC'21

LC-MEMENTO: A Memory Model for Accelerated Architectures
Kiran Ranganath, Jesun Firoz, Joshua Suetterlein, Joseph Manzano, Andres Marquez, Mark Raugas, Daniel Wong
In the 34th International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2021.
 

 

NAS'21

LocalityGuru: A PTX Analyzer for Extracting Thread Block-level Locality in GPGPUs
Devashree Tripathy, Amirali Abdolrashidi, Quan Fan, Daniel Wong, Manoranjan Satpathy
In Proceedings of the 15th International Conference on Networking, Architecture, and Storage (NAS), 2021.
 

 

NAS'21

ICAP: Designing Inrush Current Aware Power Gating Switch for GPGPU
Hadi Zamani Sabzi, Devashree Tripathy, Ali Jahanshahi, Daniel Wong
In Proceedings of the 15th International Conference on Networking, Architecture, and Storage (NAS), 2021.
 

 

ISCA'21

BlockMaestro: Enabling Programmer-Transparent Task-based Execution in GPU Systems
Amirali Abdolrashidi, Hodjat Asghari Esfeden, Ali Jahanshahi, Kaustubh Singh, Nael Abu-Ghazaleh, Daniel Wong
In Proceedings of the 48th ACM/IEEE International Symposium on Computer Architecture (ISCA), 2021 (Acceptance Rate: 18.7%).
 

 

ACM TACO'21

PAVER: Locality Graph-based Thread Block Scheduling for GPUs
Devashree Tripathy, Amirali Abdolrashidi, Laxmi N. Bhuyan, Liang Zhou, Daniel Wong
ACM Transactions on Architecture and Code Optimization, Volume 18, Issue 3, June 2021.
 

 

MICRO'20

BOW: Breathing Operand Windows to Exploit Bypassing in GPUs
Hodjat Asghari Esfeden, Amirali Abdolrashidi, Shafiur Rahman, Daniel Wong, Nael Abu-Ghazaleh
In Proceedings of the 53rd IEEE/ACM International Symposium on Microarchitecture (MICRO), 2020 (Acceptance Rate: 19.4%)
 

 

IEEE Computer Architecture Letters (CAL'20)

GPU-NEST: Characterizing Energy Efficiency of Multi-GPU Inference Servers
Ali Jahanshahi, Hadi Zamani Sabzi, Chester Lau, Daniel Wong
Computer Architecture Letters, 2020
 

 

FCCM'20

High-Performance Parallel Radix Sort on FPGA
Bashar Romanous, Mohammadreza Rezvani, Junjie Huang, Daniel Wong, Evangelos E. Papalexakis, Vassilis J. Tsotras, and Walid Najjar
In Proceedings of the 28th IEEE International Symposium On Field-Programmable Custom Computing Machines (FCCM), 2020 (Poster)

 

ASPLOS'19

CORF: Coalescing Operand Register File for GPUs
Hodjat Asghari Esfeden, Farzad Khorasani, Hyeran Jeon, Daniel Wong, Nael Abu-Ghazaleh
In Proceedings of the 24th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2019 (Acceptance Rate: 21.1%)
 Lightning Talk

 

HPCA'19

μDPM: Dynamic Power Management for the Microsecond Era
Chih-Hsun Chou, Laxmi N. Bhuyan, Daniel Wong
In Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture (HPCA), 2019 (Acceptance Rate: 19.7%)
 

 

IEEE Computer Architecture Letters (CAL'19)

Locality-aware GPU Register File
Hyeran Jeon, Hodjat Asghari Esfeden, Nael Abu-Ghazaleh, Daniel Wong
Computer Architecture Letters, 2019
 

 

IEEE Computer Architecture Letters (CAL'19)

Speeding up Collective Communications Through Inter-GPU Re-routing
Kiran Ranganath, Amirali Abdolrashidi, Shuaiwen Leon Song, Daniel Wong
Computer Architecture Letters, 2019
 

 

SMACD'19

Long-Term Reliability Management For Multitasking GPGPUs
Zeyu Sun, Taeyoung Kim, Marcus Chow, Shaoyi Peng, Han Zhou, Hyoseung Kim, Daniel Wong, Sheldon Tan
In Proceedings of the 2019 International Conference on Synthesis, Modeling, Analysis and Simulation Methods and Applications to Circuit Design (SMACD), 2019

 

Applied Energy'19

Frequency Regulation Service Provision in Data Center
Wei Wang, Amirali Abdolrashidi, Nanpeng Yu, Daniel Wong
Applied Energy, Volume 251, October 2019 (IF: 8.4)
 ScienceDirect

 

ISLPED'18

Load-Triggered Warp Approximation on GPU
Zhenhong Liu, Daniel Wong, Nam Sung Kim
In Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), 2018 (Acceptance Rate: 23.3%)
 

 

IPDPS'18

Joint Server and Network Energy Saving in Data Centers for Latency-Sensitive Applications
Liang Zhou, Chih-Hsun Chou, Laxmi N. Bhuyan, K. K. Ramakrishnan, Daniel Wong
In Proceedings of the 32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2018 (Acceptance Rate: 24.5%)
 

 

MICRO'17

WIREFRAME: Supporting Data-dependent Parallelism through Dependency Graph Execution in GPUs
AmirAli Abdolrashidi, Devashree Tripathy, Mehmet Esat Belviranli, Laxmi N. Bhuyan, Daniel Wong
In Proceedings of the 50th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2017 (Acceptance Rate: 18.6%)
 

 

SBAC-PAD'16

STOMP: Statistical Techniques for Optimizing and Modeling Performance of Blocked Sparse Matrix Vector Multiplication
Steena Monteiro, Forrest Iandola, Daniel Wong
In Proceedings of the 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2016.
 

 

ISLPED'16

DynSleep: Fine-grained Power Management for a Latency-Critical Data Center Application
Chih-Hsun Chou, Daniel Wong, Laxmi N. Bhuyan
In Proceedings of the International Symposium on Low Power Electronics and Design (ISLPED), 2016 (Acceptance Rate: 23%).
 

 

ICS'16

Origami: Folding Warps for Energy Efficient GPUs
Mohammad Abdel-Majeed, Daniel Wong, Justin Kuang, Murali Annavaram
In Proceedings of the ACM International Conference on Supercomputing (ICS), 2016 (Acceptance Rate: 24%).
 

 

DAC'16

Invited - Cross-layer modeling and optimization for electromigration induced reliability
Taeyoung Kim, Zeyu Sun, Chase Cook, Hengyang Zhao, Ruiwen Li, Daniel Wong, Sheldon X.-D. Tan
In Proceedings of the IEEE/ACM Design Automation Conference (DAC), 2016.
 

 

ISCA'16

Peak Efficiency Aware Scheduling for Highly Energy Proportional Servers
Daniel Wong
In Proceedings of the 43rd ACM/IEEE International Symposium on Computer Architecture (ISCA), 2016 (Acceptance Rate: 19.5%).
 

 

HPCA'16

Approximating Warps with Intra-warp Operand Value Similarity
Daniel Wong, Nam Sung Kim, Murali Annavaram
In Proceedings of the 22nd IEEE International Symposium on High Performance Computer Architecture (HPCA), 2016 (Acceptance Rate: 22%)
 

 

IISWC'15

A Retrospective Look Back on the Road Towards Energy Proportionality
Daniel Wong, Julia Chen, Murali Annavaram.
In Proceedings of the 2015 IEEE International Symposium on Workload Characterization (IISWC), 2015
Short paper with poster presentation
 

 

HPCA'14

Implications of High Energy Proportional Servers on Cluster-wide Energy Proportionality
Daniel Wong, Murali Annavaram.
In Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture (HPCA), 2014 (Acceptance Rate: 25.6%)
  

 

MICRO'13

Warped Gates: Gating Aware Scheduling and Power Gating for GPGPUs
Mohammad Abdel-Majeed* and Daniel Wong*, Murali Annavaram.
In Proceedings of the 46th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2013 (Acceptance Rate: 16.3%)
* Authors contributed equally
   

 

IEEE MICRO
Top Picks'13

Scaling the Energy Proportionality Wall with KnightShift
Daniel Wong, Murali Annavaram
In IEEE Micro’s “Top Picks from the Computer Architecture Conferences of 2012″ Issue, May/June 2013
 IEEExplore

MICRO'12

KnightShift: Scaling the Energy Proportionality Wall through Server-level Heterogeneity
Daniel Wong, Murali Annavaram.
In Proceedings of the 45th IEEE/ACM International Symposium on Microarchitecture (MICRO), 2012 (Acceptance Rate: 17.5%)
 
Selected as 1 of 11 IEEE Micro Top Pick in Computer Architecture 2013 

 

WEED'12

Evaluating A Prototype KnightShift-enabled Server
Daniel Wong, Murali Annavaram.
4th Workshop on Energy Efficient Design (WEED) held in conjunction with ISCA, 2012
  

 

MICRO'10

Adaptive and Speculative Slack Simulations of CMPs on CMPs
Jianwei Chen, Lakshmi Kumar Dabbiru, Daniel Wong, Murali Annavaram, Michel Dubois.
In Proceedings of the 2010 International Symposium on Microarchitecture (MICRO), 2010 (Acceptance Rate: 17.4%)
 

 

FDG'10

Implementing Games on Pinball Machines
Daniel Wong, Darren Earl, Fred Zyda, Ryan Zink, Sven Koenig, Allen Pan, Selby Shlosberg, Jaspreet Singh and Nathan Sturtevant.
Proceedings of Foundations of Digital Games (FDG), 2010 (Acceptance Rate: 34%)
  

 

AAAI'10

Teaching Robotics and Computer Science with Pinball Machines
Daniel Wong, Darren Earl, Fred Zyda and Sven Koenig.
Proceedings of the AAAI-10 Spring Symposium on Educational Robotics and Beyond: Design and Evaluation, pages 37-42, 2010.
 

 

Non-Referred Publications 

Joseph Bungo, Daniel Wong, Bringing GPU Accelerated Computing and Deep Learning to the Classroom, Journal of Computational Science Education (JOCSE), Volume 12, Issue 2. Presented in Seventh SC Workshop on Best Practices for HPC Training and Education (BPHTE), 2020. 

Daniel Wong, S. Lloyd, M. Gokhale, A Memory-mapped Approach to Checkpointing. Technical Report LLNL-TR-635611, Lawrence Livermore National Laboratory (LLNL), Livermore, CA, 2013. 

I. Karlin, A. Bhatele, B. Chamberlain, J. Cohen, Z. Devito, M. Gokhale, R. Haque, R. Hornung, J. Keasler, D. Laney, E. Luke, S. Lloyd, J. McGraw, R. Neely, D. Richards, M. Schulz, C.H. Still, F. Wang, Daniel Wong, LULESH Programming Model and Performance Ports Overview. Technical Report LLNL-TR-608824, Lawrence Livermore National Laboratory (LLNL), Livermore, CA, 2012. 

Daniel Wong, Murali Annavaram, Scalable System-level Active Low Power Mode with Bounded Latency. Technical Report CENG-2012-5, Department of Electrical Engineering, University of Southern California, Los Angeles (California), 2012. 

Daniel Wong, Murali Annavaram, Enhancing Server Energy Efficiency by Shifting Light Burden to an Assistant. 2nd Annual Ming Hsiegh Department of Electrical Engineering Research Festival, 2012. Honorable Mention Poster Award Also presented at Sixth USC-Tsinghua Symposium on Green Technology and Energy Informatics 

Daniel Wong, R. Zink and S. Koenig, Teaching Artificial Intelligence and Robotics via Games [Poster Abstract], Proceedings of the AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI), 2010 

Daniel Wong, M. Gokhale, Real-World Performance of Document-Similarity Web Attack Classifier In Embedded Hardware. LLNL Summer Intern Poster Symposium, 2010. 

John O’Hollaren, Vairavan Laxman, Noah Olsman, Michael Benzimra, Daniel Wong, and Nielson Bernardo. SeaBee III. Technical report, University of Southern California Competition Robotics (USCR), University of Southern California, 2010. 

Daniel Wong, D. Earl, F. Zyda and S. Koenig. Programming Pinball Machines for Fun and Education. Technical Report 08-901, Department of Computer Science, University of Southern California, Los Angeles (California), 2008. 

Press 

GPU Computing 101: Why University Educators Are Pulling NVIDIA Teaching Kits into Their Classrooms, Nvidia, https://blogs.nvidia.com/blog/2019/05/23/nvidia-teaching-kits/, 2019

Interview, Nvidia's Turing Chip Opens Door to New Virtual Reality Realm, ECT News Network, https://www.ectnews.com/story/85506.html, 2018

Daniel Wong, S. Koenig, PinHorse: Teaching Old Pinball Machines New Tricks, www.pinballnews.com, 2009