Papers
Here are sample papers for you to read and present. If you want to present a paper not in the list, you are welcome to do so, but please email me first to discuss with me the suitability of the paper.
Hardware Consistency and Coherence
Spandex: A Flexible Interface for Efficient Heterogeneous Coherence Links to an external site.
Architecture and Design of AlphaServer GS320 Links to an external site.
DeNovo: Rethinking the Memory Hierarchy for Disciplined Parallelism Links to an external site.
Efficient sequential consistency via conflict ordering Links to an external site.
Tardis: Time Traveling Coherence Algorithm for Distributed Shared Memory Links to an external site.
Non-Speculative Load-Load Reordering in TSO Links to an external site.
Cohort: Software-Oriented Acceleration for Heterogeneous SoCs Links to an external site.
Specifying Memory Models
x86-TSO: A Rigorous and Usable Programmer’s Model for x86 Multiprocessors Links to an external site.
GPU Concurrency: Weak Behaviours and Programming Assumptions Links to an external site.
A Formal Analysis of the NVIDIA PTX Memory Consistency Model Links to an external site.
The semantics of shared memory in Intel CPU/FPGA systems Links to an external site.
Compound Memory Models Links to an external site.
A Promising Semantics for Relaxed-Memory Concurrency. Links to an external site.
Validating Memory Consistency and Coherence
CCICheck: Using µhb Graphs to Verify the Coherence-Consistency Interface Links to an external site.
GPUHarbor: Testing GPU Memory Consistency at Large Links to an external site.
TLA+ Verification of Cache-Coherence Protocols Links to an external site.
Synthesizing Coherence Protocols
Teapot: Language Support for Writing Memory Coherence Protocols Links to an external site.
TRANSIT: Specifying Protocols with Concolic Snippets Links to an external site.
HeteroGen: Automatic Synthesis of Heterogeneous Cache Coherence Protocols Links to an external site.
Memory Persistency
Memory Persistency.
Links to an external site.
Delegated persist ordering Links to an external site.
Better I/O Through Byte-Addressable, Persistent Memory Links to an external site.
Efficient Persist Barriers for Multicores Links to an external site.
ATOM: Atomic Durability in Non-volatile Memory through Hardware Logging Links to an external site.
GPM: Leveraging Persistent Memory from a GPU Links to an external site.
Scoped Buffered Persistency Model for GPUs Links to an external site.
Persistent Processor Architecture Links to an external site.
Distributed Data Persistency Links to an external site.
Key-Value-Stores Consistency
FaRM: Fast Remote Memory
Links to an external site.
Hermes: a Fast, Fault-Tolerant and Linearizable Replication Protocol Links to an external site.
Kite: Efficient and Available Release Consistency for the Datacenter Links to an external site.
Microsecond Consensus for Microsecond Applications Links to an external site.
CliqueMap: Productionizing an RMA-Based Distributed Caching System Links to an external site.
Epoxy: ACID Transactions Across Diverse Data Stores Links to an external site.
Durable Functions: Semantics for Stateful Serverless Links to an external site.
SwiSh: Distributed Shared State Abstractions for Programmable Switches Links to an external site.
μManycore: A Cloud-Native CPU for Tail at Scale Links to an external site.
Pond: CXL-Based Memory Pooling Systems for Cloud Platforms Links to an external site.