StanfordLegion · lightsighter · Jan 18, 2023 · Jan 19, 2023 · Jan 19, 2023 · Jan 19, 2023
diff --git a/_config.yml b/_config.yml
@@ -1,11 +1,11 @@
 title:            Legion Programming System
-tagline:          A data-centric approach to parallel programming 
+tagline:          High Productivity High Performance Computing
 description:      Home page for the Legion parallel programming system
 
 # Owner/author information
 owner:
   name:           Legion
-  bio:            A Data-Centric Parallel Programming System
+  bio:            High Productivity High Performance Computing
   email:          #[email protected]
   # Social networking links are used in author-bio sidebar. Update and remove as you like.
   twitter:        

diff --git a/images/analogy.png b/images/analogy.png
diff --git a/images/hphpc.png b/images/hphpc.png
diff --git a/index.md b/index.md
@@ -1,22 +1,84 @@
 ---
-layout: page 
+layout: page
 ---
 
-Legion is a data-centric parallel programming system for
-writing portable high performance programs targeted at
-distributed heterogeneous architectures.  Legion presents
-abstractions which allow programmers to describe properties
-of program data (e.g. independence, locality).  By making the
-Legion programming system aware of the structure of
-program data, it can automate many of the tedious tasks
-programmers currently face, including correctly extracting
-task- and data-level parallelism and moving data around
-complex memory hierarchies.  A novel mapping interface
-provides explicit programmer controlled placement of data 
-in the memory hierarchy and assignment of tasks to processors 
-in a way that is orthogonal to correctness, thereby enabling 
-easy porting and tuning of Legion applications to new 
-architectures.
+## Legion: High-Productivity High-Performance Computing ##
+
+The vast majority of all programs are sequential. Programmers are inherently
+productive when developing sequential code because they can construct more
+powerful programs simply by composing functionality from one or more software modules (e.g. libraries) 
+in serial without worrying about parallelism, data coherence, or synchronization. 
+The productivity engendered by this facet of sequential programming is vital to the 
+success of many popular software ecosystems such as Python, R, and MATLAB.
+However, the implementations of these environments struggle to achieve high performance 
+on parallel and distributed hardware without resorting to explicit parallelism. 
+Ideally users want to write programs in a high
+productivity sequential programming model and have those programs automatically executed with high performance on 
+parallel hardware. Achieving this end requires the development of a nuanced programming model and
+sophisticated programming systems capable of analyzing and transforming sequential programs into parallel programs.
+
+![High Productivity High Performance Computing](images/hphpc.png)
+
+Fortunately, there already exist
+[many](https://en.wikipedia.org/wiki/Tomasulo%27s_algorithm) 
+[well](https://en.wikipedia.org/wiki/Very_long_instruction_word) 
+[known](https://en.wikipedia.org/wiki/Register_renaming) 
+[techniques](https://en.wikipedia.org/wiki/Speculative_execution) 
+[for](https://en.wikipedia.org/wiki/Instruction_pipelining)
+[implicitly](https://en.wikipedia.org/wiki/Superscalar_processor)
+[parallelizing](https://en.wikipedia.org/wiki/Out-of-order_execution) 
+sequential programs to target parallel hardware. 
+However, in most systems these algorithms are currently only deployed to exploit
+fine-grained instruction-level parallelism. The primary thesis of the Legion project is 
+that these same techniques can and should be deployed hierarchically at coarser granularities 
+in software to leverage modern parallel hardware (multi-core CPUs, GPUs, supercomputers, etc.)
+without compromising the productivity of developing sequential programs.
+
+The basis for this thesis rests upon the fundamental observation that implicitly mapping 
+sequential programs onto parallel hardware looks similar at many different scales.
+At the finest granularity, hardware or compilers can extract parallelism from a stream of 
+instructions by analyzing register usage and mapping independent 
+instructions onto parallel hardware units. The same principles apply when extracting parallelism
+from a stream of demarcated functions called *tasks* operating on *logical regions* of data to map
+onto the parallel execution units inside of a workstation or a supercomputer 
+for different granularities of tasks and regions. (Legion derives its name from the concatenation
+of the words in 'logical region'.)
+
+![Implicit Parallelism Analogy](images/analogy.png)
+
+This analogy forms the basis of the Legion project, and its two primary software
+artifacts can be understood as direct analogs to existing systems. The Legion
+runtime endeavors to be a full reimplementation of a pipelined out-of-order superscalar processor
+in software for dynamically exploiting task-parallelism from a stream of tasks
+generated by the execution of a sequential program. Similarly, the Regent compiler
+strives to be an optimizing compiler, performing static analyses and transformations
+of programs at the coarser granularity of tasks before mapping them onto the Legion runtime.
+Armed with these systems that automatically parallelize and distribute sequential programs,
+we aim to facilitate the creation of high productivity high performance computing ecosystems
+so that all users can leverage modern massively parallel machines. 
+
+#### The Key Ideas ####
+
+In order to realize the above vision, the Legion project has developed several novel technologies:
+
+* A [dynamic data model](/pdfs/oopsla2013.pdf) that is flexible enough for tasks to 
+  dynamically specify arbitrary working set regions and the effects they will have on those regions.
+* The ability to dynamically compute [mathematical relationships](/pdfs/dpl2016.pdf) between regions,
+  and an [auto-parallelizing framework](/pdfs/parallelizer2019.pdf) for synthesizing them.
+* A [dynamic dependence and distributed coherence analysis](/pdfs/visibility2023.pdf) based
+  on algorithms and data structures from ray tracing to handle arbitrary aliasing of regions.
+* A technique called *control replication* that decouples 
+  task creation from execution to avoid sequential bottlenecks
+  (with both [static](/pdfs/cr2017.pdf) and [dynamic](/pdfs/dcr2021.pdf) incarnations).
+* A [scale-free programming model](/pdfs/idx2021.pdf) that encourages the development
+  of programs that can be ported to run on machines of different sizes without modification.
+* A *mapping interface* for [decoupling correctness from performance](/pdfs/sc2012.pdf})
+  and thereby guaranteeing performance portability of programs.
+
+Many of these ideas are intertwined and resonate with each other in the design
+and we encourage you to explore them further.
+
+#### Get Started ####
 
 To learn more about Legion you can:
 
@@ -25,7 +87,7 @@ To learn more about Legion you can:
  * Download our [publications]({{ "/publications/" | relative_url }})
  * Ask questions on our [mailing list]({{ "/community/" | relative_url }})
 
-#### About Legion ####
+#### Acknowledgments ####
 
 Legion is developed as an open source project, with major
 contributions from [LANL](https://www.lanl.gov/),