OSDI 2024 论文分类总结

OSDI 2024 论文分类总结

数据来源: OSDI 2024 会议官网

技术领域分类

技术领域分布

数据加载中...

技术领域论文数量

技术领域 论文数量 占比

作者来源分析

国家和机构分布图

数据加载中...

国家和机构分布表

国家 论文数量 占比 机构

关键词词云

论文标题关键词分布

论文来源机构分布

知识图谱

0%

论文列表

Memory Management

Sabre: Hardware-Accelerated Snapshot Compression for Serverless MicroVMs
Nikita Lazarev, Varun Gohil, MIT (CSAIL); James Tsai, Andy Anderson, Bhushan Chitlur (Intel Labs); Zhiru Zhang (Cornell University); Christina Delimitrou, MIT (CSAIL)
Nomad: Non-Exclusive Memory Tiering via Transactional Page Migration
Lingfeng Xiang, Zhen Lin, Weishu Deng, Hui Lu, Jia Rao (The University of Texas at Arlington); Yifan Yuan, Ren Wang (Intel Labs)
Managing Memory Tiers with CXL in Virtualized Environments
Yuhong Zhong, Columbia University (Microsoft Azure); Daniel S. Berger, Microsoft Azure (University of Washington); Carl Waldspurger (Carl Waldspurger Consulting); Ryan Wee (Columbia University); Ishwar Agarwal, Rajat Agarwal, Frank Hady, Karthik Kumar (Intel); Mark D. Hill (University of Wisconsin–Madison); Mosharaf Chowdhury (University of Michigan); Asaf Cidon (Columbia University)
Harvesting Memory-bound CPU Stall Cycles in Software with MSH
Zhihong Luo, Sam Son, Sylvia Ratnasamy (UC Berkeley); Scott Shenker (UC Berkeley & ICSI)
A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications
Lei Chen (University of Chinese Academy of Sciences); Shi Liu (UCLA); Chenxi Wang (University of Chinese Academy of Sciences); Haoran Ma, Yifan Qiao (UCLA); Zhe Wang, Chenggang Wu (University of Chinese Academy of Sciences); Youyou Lu (Tsinghua University); Xiaobing Feng, Huimin Cui (University of Chinese Academy of Sciences); Shan Lu (Microsoft Research); Harry Xu (UCLA)
DRust: Language-Guided Distributed Shared Memory with Fine Granularity, Full Transparency, and Ultra Efficiency
Haoran Ma, Yifan Qiao, Shi Liu, Shan Yu (UCLA); Yuanjiang Ni, Qingda Lu, Jiesheng Wu, Alibaba Group, Yiying Zhang (UCSD); Miryung Kim, Harry Xu (UCLA)

Low-Latency LLM Serving

Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Amey Agrawal (Georgia Institute of Technology); Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav Gulavani (Microsoft Research India); Alexey Tumanov (Georgia Institute of Technology); Ramachandran Ramjee (Microsoft Research India)
ServerlessLLM: Low-Latency Serverless Inference for Large Language Models
Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete (University of Edinburgh); Dmitrii Ustiugov (NTU Singapore); Yuvraj Patel, Luo Mai (University of Edinburgh)
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management
Wonbeom Lee, Jungi Lee, Junghwan Seo, Jaewoong Sim (Seoul National University)
Llumnix: Dynamic Scheduling for Large Language Model Serving
Biao Sun, Ziming Huang, Hanyu Zhao, Wencong Xiao, Xinyi Zhang, Yong Li, Wei Lin (Alibaba Group)
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving
Yinmin Zhong, Shengyu Liu (Peking University); Junda Chen (UC San Diego); Jianbo Hu, Peking University, Yibo Zhu (StepFun); Xuanzhe Liu, Xin Jin (Peking University); Hao Zhang (UC San Diego)

Distributed Systems

ACCL+: an FPGA-Based Collective Engine for Distributed Applications
Zhenhao He, Dario Korolija, Yu Zhu, Benjamin Ramhorst, Systems Group (ETH Zurich); Tristan Laan (University of Amsterdam); Lucian Petrica, Michaela Blott (AMD Research); Gustavo Alonso, Systems Group (ETH Zurich)
Beaver: Practical Partial Snapshots for Distributed Cloud Services
Liangcheng Yu (University of Pennsylvania); Xiao Zhang (Shanghai Jiao Tong University); Haoran Zhang (University of Pennsylvania); John Sonchack (Princeton University); Dan Ports (Microsoft / University of Washington); Vincent Liu (University of Pennsylvania)
Fast and Scalable In-network Lock Management Using Lock Fission
Hanze Zhang, Institute of Parallel, Distributed Systems, SEIEE (Shanghai Jiao Tong University); Shanghai AI
Chop Chop: Byzantine Atomic Broadcast to the Network Limit
Martina Camaioni, Rachid Guerraoui, Matteo Monti, Pierre-Louis Roman, Manuel Vidigueira, Gauthier Voron (EPFL)

Deep Learning

Enabling Tensor Language Model to Assist in Generating High-Performance Tensor Programs for Deep Learning
Yi Zhai (University of Science and Technology of China); Sijia Yang, (Huawei Technologies); Keyu Pan (ByteDance Ltd.); Renwei Zhang, (Huawei Technologies); Shuo Liu (University of Science and Technology of China); Chao Liu, Zichun Ye, (Huawei Technologies); Jianmin Ji (University of Science and Technology of China); Jie Zhao (Hunan University); Yu Zhang, Yanyong Zhang (University of Science and Technology of China)
Tensor Transformation
Lei Wang (University of Chinese Academy of Sciences & Microsoft Research); Lingxiao Ma, Shijie Cao, Quanlu Zhang, Jilong Xue (Microsoft Research); Yining Shi (Peking University & Microsoft Research); Ningxin Zheng, Ziming Miao, Fan Yang, Ting Cao, Yuqing Yang, Mao Yang (Microsoft Research)
Caravan: Practical Online Learning of In-Network ML Models with Labeling Agents
Qizheng Zhang (Stanford University); Ali Imran (Purdue University); Enkeleda Bardhi, Sapienza University of Rome, Tushar Swamy, Nathan Zhang (Stanford University); Muhammad Shahbaz (Purdue University and University of Michigan); Kunle Olukotun (Stanford University)
ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications
Yuhan Liu (University of Chicago); Chengcheng Wan (East China Normal University); Kuntai Du, Henry Hoffmann, Junchen Jiang (University of Chicago); Shan Lu (University of Chicago and Microsoft Research); Michael Maire (University of Chicago)

Operating Systems

SquirrelFS: using the Rust compiler to check file-system crash consistency
Hayley LeBlanc, Nathan Taylor, James Bornholt, Vijay Chidambaram (University of Texas at Austin)
High-throughput and Flexible Host Networking for Accelerated Computing
Athinagoras Skiadopoulos, Zhiqiang Xie, Mark Zhao (Stanford University); Qizhe Cai, Saksham Agarwal (Cornell University); Jacob Adelmann, David Ahern, Carlo Contavalli, Michael Goldflam, Vitaly Mayatskikh, Raghu Raja, Daniel Walton (Enfabrica); Rachit Agarwal (Cornell University); Shrijeet Mukherjee (Enfabrica); Christos Kozyrakis (Stanford University)
IntOS: Persistent Embedded Operating System and Language Support for Multi-threaded Intermittent Computing
Yilun Wu (Stony Brook University); Byounguk Min (Purdue University); Mohannad Ismail, Wenjie Xiong (Virginia Tech); Changhee Jung (Purdue University); Dongyoon Lee (Stony Brook University)
Data-flow Availability: Achieving Timing Assurance in Autonomous Systems
Ao Li, Ning Zhang (Washington University in St. Louis)
Microkernel Goes General: Performance and Compatibility in the HongMeng Production Microkernel
Haibo Chen (Huawei Central Software Institute and Shanghai Jiao Tong University); Xie Miao, Ning Jia, Nan Wang, Yu Li, Nian Liu, Yutao Liu, Fei Wang, Qiang Huang, Kun Li, Hongyang Yang, Hui Wang, Jie Yin, Yu Peng, Fengwei Xu (Huawei Central Software Institute)

Cloud Computing

When will my ML Job finish? Toward providing Completion Time Estimates through Predictability-Centric Scheduling
Abdullah Bin Faisal, Noah Martin, Hafiz Mohsin Bashir, Swaminathan Lamelas, Fahad R. Dogar (Tufts University)
Optimizing Resource Allocation in Hyperscale Datacenters: Scalability, Usability, and Experiences
Neeraj Kumar, Pol Mauri Ruiz, Vijay Menon, Igor Kabiljo, Mayank Pundir, Andrew Newell, Daniel Lee, Liyuan Wang, Chunqiang Tang (Meta Platforms)
ServiceLab: Preventing Tiny Performance Regressions at Hyperscale through Pre-Production Testing
Mike Chow (Meta Platforms); Yang Wang (Meta Platforms and The Ohio State University); William Wang, Ayichew Hailu, Rohan Bopardikar, Bin Zhang, Jialiang Qu, David Meisner, Santosh Sonawane, Yunqi Zhang, Rodrigo Paim, Mack Ward, Ivor Huang, Matt McNally, Daniel Hodges, Zoltan Farkas, Caner Gocmen, Elvis Huang, Chunqiang Tang (Meta Platforms)
MAST: Global Scheduling of ML Training across Geo-Distributed Datacenters at Hyperscale
Arnab Choudhury (Meta Platforms); Yang Wang (Meta Platforms and The Ohio State University); Tuomas Pelkonen (Meta Platforms); Kutta Srinivasan (LinkedIn); Abha Jain, Shenghao Lin, Delia David, Siavash Soleimanifard, Michael Chen, Abhishek Yadav, Ritesh Tijoriwala, Denis Samoylov, Chunqiang Tang (Meta Platforms)

Formal Verification

Automatically Reasoning About How Systems Code Uses the CPU Cache
Rishabh Iyer, Katerina Argyraki, George Candea (EPFL)
VeriSMo: A Verified Security Module for Confidential VMs
Ziqiao Zhou (Microsoft Research); Anjali (University of Wisconsin-Madison); Weiteng Chen, Microsoft Research, Sishuai Gong (Purdue University); Chris Hawblitzel, Weidong Cui (Microsoft Research)
Validating the eBPF Verifier via State Embedding
Hao Sun, Zhendong Su (ETH Zurich)
Using Dynamically Layered Definite Releases for Verifying the RefFS File System
Mo Zou, Dong Du, Mingkai Dong, Institute of Parallel, Distributed Systems, SEIEE (Shanghai Jiao Tong)
Anvil: Verifying Liveness of Cluster Management Controllers
Xudong Sun, Wenjie Ma, Jiawei Tyler Gu, Zicheng Ma (University of Illinois Urbana-Champaign); Tej Chajed (University of Wisconsin-Madison); Jon Howell, Andrea Lattuada, Oded Padon (VMware Research); Lalith Suresh (Feldera); Adriana Szekeres (VMware Research); Tianyin Xu (University of Illinois Urbana-Champaign)

Cloud Security

DSig: Breaking the Barrier of Signatures in Data Centers
Marcos K. Aguilera (VMware Research Group); Clément Burgelin, Rachid Guerraoui, Antoine Murat (EPFL); Athanasios Xygkis (Oracle Labs); Igor Zablotchi (Mysten Labs)
Ransom Access Memories: Achieving Practical Ransomware Protection in Cloud with DeftPunk
Zhongyu Wang, Yaheng Song, Erci Xu, Haonan Wu, Guangxun Tong, Shizhuo Sun, Haoran Li, Jincheng Liu, Lijun Ding, Rong Liu, Jiaji Zhu, Jiesheng Wu (Alibaba Group)
Secret Key Recovery in a Global-Scale End-to-End Encryption System
Graeme Connell (Signal Messenger); Vivian Fang (UC Berkeley); Rolfe Schmidt (Signal Messenger); Emma Dauterman, Raluca Ada Popa (UC Berkeley)
Flock: A Framework for Deploying On-Demand Distributed Trust
Darya Kaviani, Sijun Tan (UC Berkeley); Pravein Govindan Kannan (IBM Research); Raluca Ada Popa (UC Berkeley)

Data Management

FairyWren: A Sustainable Cache for Emerging Write-Read-Erase Flash Interfaces
Sara McAllister, Yucong “Sherry” Wang (Carnegie Mellon University); Benjamin Berg, UNC Chapel Hill, Daniel S. Berger (Microsoft Azure and University of Washington); George Amvrosiadis, Nathan Beckmann, Gregory R. Ganger (Carnegie Mellon University)
Massively Parallel Multi-Versioned Transaction Processing
Shujian Qian, Ashvin Goel (University of Toronto)
Burstable Cloud Block Storage with Data Processing Units
Junyi Shu, School of Computer Science (Peking University and Alibaba Cloud); Kun Qian, Ennan Zhai (Alibaba Cloud); Xuanzhe Liu, Xin Jin, School of Computer Science (Peking University)
Motor: Enabling Multi-Versioning for Distributed Transactions on Disaggregated Memory
Ming Zhang, Yu Hua, Zhijun Yang, Wuhan National Laboratory for Optoelectronics, School of Computer (Huazhong University of Science and Technology)

Analysis of Correctness

Detecting Logic Bugs in Database Engines via Equivalent Expression Transformation
Zu-Ming Jiang, Zhendong Su (ETH Zurich)
Inductive Invariants That Spark Joy: Using Invariant Taxonomies to Streamline Distributed Protocol Proofs
Tony Nuda Zhang (University of Michigan); Travis Hance (Carnegie Mellon University); Manos Kapritsos (University of Michigan); Tej Chajed (University of Wisconsin–Madison); Bryan Parno (Carnegie Mellon University)
Performance Interfaces for Hardware Accelerators
Jiacheng Ma, Rishabh Iyer, Sahand Kashani, Mahyar Emami, Thomas Bourgeat, George Candea (EPFL)
IronSpec: Increasing the Reliability of Formal Specifications
Eli Goldweber, Weixin Yu, Seyed Armin Vakil Ghahani, Manos Kapritsos (University of Michigan)
Identifying On-/Off-CPU Bottlenecks Together with Blocked Samples
Minwoo Ahn, Jeongmin Han (Sungkyunkwan University); Youngjin Kwon (Korea Advanced Institute of Science and Technology (KAIST)); Jinkyu Jeong (Yonsei University)

ML Scheduling

Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
Chaofan Lin (Shanghai Jiao Tong University); Zhenhua Han, Chengruidong Zhang, Yuqing Yang, Fan Yang (Microsoft Research); Chen Chen (Shanghai Jiao Tong University); Lili Qiu (Microsoft Research)
Usher: Holistic Interference Avoidance for Resource Optimized ML Inference
Sudipta Saha Shubha, Haiying Shen (University of Virginia); Anand Iyer (Georgia Institute of Technology)
Fairness in Serving Large Language Models
Ying Sheng (UC Berkeley and Stanford University); Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li, UC Berkeley, Danyang Zhuo (Duke University); Joseph E. Gonzalez, Ion Stoica (UC Berkeley)
MonoNN: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric Architectures
Donglin Zhuang (The University of Sydney); Zhen Zheng (Alibaba Group); Haojun Xia, The University of Sydney, Xiafei Qiu, Junjie Bai, Wei Lin (Alibaba Group); Shuaiwen Leon Song (The University of Sydney)