Papers

Here are sample papers for you to read and present. If you want to present a paper not in the list, you are welcome to do so, but please email me first to discuss with me the suitability of the paper. 

Hardware Consistency and Coherence

Spandex: A Flexible Interface for Efficient Heterogeneous Coherence Links to an external site.

Architecture and Design of AlphaServer GS320 Links to an external site.

Efficient GPU synchronization without scopes: saying no to complex consistency models Links to an external site.

DeNovo: Rethinking the Memory Hierarchy for Disciplined Parallelism Links to an external site.

Efficient sequential consistency via conflict ordering Links to an external site.

Tardis: Time Traveling Coherence Algorithm for Distributed Shared Memory Links to an external site.

Non-Speculative Load-Load Reordering in TSO Links to an external site.

Cohort: Software-Oriented Acceleration for Heterogeneous SoCs Links to an external site.

Exploring Memory Consistency for Massively-Threaded Throughput-Oriented Processors  Links to an external site.

Specifying Memory Models

Herding Cats: Modelling, Simulation, Testing, and Data Mining for Weak Memory Links to an external site.

x86-TSO: A Rigorous and Usable Programmer’s Model for x86 Multiprocessors Links to an external site.

GPU Concurrency: Weak Behaviours and Programming Assumptions Links to an external site.

A Formal Analysis of the NVIDIA PTX Memory Consistency Model Links to an external site.

The semantics of shared memory in Intel CPU/FPGA systems Links to an external site.

Compound Memory Models Links to an external site.

A Promising Semantics for Relaxed-Memory Concurrency. Links to an external site.

Validating Memory Consistency and Coherence

PipeCheck: Specifying and Verifying Microarchitectural Enforcement of Memory Consistency Models. Links to an external site.

CCICheck: Using µhb Graphs to Verify the Coherence-Consistency Interface Links to an external site.

PerpLE: Improving the Speed and Effectiveness of Memory Consistency Testing Links to an external site.

GPUHarbor: Testing GPU Memory Consistency at Large Links to an external site.

MC Mutants: Evaluating and Improving Testing for Memory Consistency Specifications Links to an external site..

Shared Memory Consistency Protocol Verification Against Weak Memory Models: Refinement via Model-Checking Links to an external site.

Verification of a lazy cache coherence protocol against a weak memory model Links to an external site.

TLA+ Verification of Cache-Coherence Protocols Links to an external site.

Synthesizing Coherence Protocols

Teapot: Language Support for Writing Memory Coherence Protocols Links to an external site.

TRANSIT: Specifying Protocols with Concolic Snippets Links to an external site.

ProtoGen: Automatically Generating Directory Cache Coherence Protocols from Atomic Specifications Links to an external site.

HeteroGen: Automatic Synthesis of Heterogeneous Cache Coherence Protocols Links to an external site.

Memory Persistency

Memory Persistency. Links to an external site.

Delegated persist ordering Links to an external site.

Better I/O Through Byte-Addressable, Persistent Memory Links to an external site.

Efficient Persist Barriers for Multicores Links to an external site.

ATOM: Atomic Durability in Non-volatile Memory through Hardware Logging Links to an external site.

GPM: Leveraging Persistent Memory from a GPU Links to an external site.

Scoped Buffered Persistency Model for GPUs Links to an external site.

Persistent Processor Architecture Links to an external site.

Distributed Data Persistency Links to an external site.

Key-Value-Stores Consistency

FaRM: Fast Remote Memory Links to an external site.

Hermes: a Fast, Fault-Tolerant and Linearizable Replication Protocol Links to an external site.

Kite: Efficient and Available Release Consistency for the Datacenter Links to an external site.

Microsecond Consensus for Microsecond Applications Links to an external site.

Odyssey: The Impact of Modern Hardware on Strongly-Consistent Replication Protocols Links to an external site.

CliqueMap: Productionizing an RMA-Based Distributed Caching System Links to an external site.

Epoxy: ACID Transactions Across Diverse Data Stores Links to an external site.

PolarDB-SCC: A Cloud-Native Database Ensuring Low Latency for Strongly Consistent Reads Links to an external site.

Durable Functions: Semantics for Stateful Serverless Links to an external site.

SwiSh: Distributed Shared State Abstractions for Programmable Switches Links to an external site.

Amazon DynamoDB: A Scalable, Predictably Performant, and Fully Managed NoSQL Database Service Links to an external site.

μManycore: A Cloud-Native CPU for Tail at Scale Links to an external site.

Pond: CXL-Based Memory Pooling Systems for Cloud Platforms Links to an external site.