Break out semantics to library #1619

disconcision · 2025-04-17T23:49:01Z

This is a speculative breaking semantics out of core. Wanted to see how clean it was. Answer is: better than I thought, but not perfect.

Semantics contains what you'd expect, I think. Borderline cases are things like Operators which are maybe more aware of the surface syntax than is strictly seemly.

Haz3lcore is a bit more fraught. Currently it contains:

tylrCore proper (Editor.re and dependencies)
maketerm + expToSeg (Interface between tylr and semantics)
TyDi (most logic could live in Semantics, but not sure if appropriate. I feel @7h3kk1d's Introduce is in a similar boat)
projectors

@Negabinary at this point I'm wondering if our first-order move here for splices is proceed with this PR, removing haz3lcore entirely and just moving all contents to haz3lweb, most but not all to a new tylr subfolder, but leaving the resolving the tylr/projectors API to future work.

TODO:

rm unnecessary deps from semantics

7h3kk1d · 2025-04-18T00:37:43Z

I would like for introduce to live on the ast but there's some issues with cursor position for empty containers. To where if you get the ID from an expression in the AST and move to that in the zipper the cursor is outside of it. i.e on empty lists the cursor ends up outside the list. The only way I can see to address it is by having a path/zipper on the AST and a translation back to the tylr zipper.

Sidenote: i think if we remove the is dependency for this the property tests will be a bit faster and debuggers will work.

disconcision · 2025-04-18T01:13:22Z

yeah i have the same issue with TyDi. there might be a slightly more lightweight option than full paths, e.g. term->term transformations plus an additional value representing an existing structural movement action to take after replacement, e.g. whether the caret should advance to the first hole after replacement.

7h3kk1d · 2025-04-18T01:16:31Z

e.g. whether the caret should advance to the first hole after replacement.

That's what I had done for introduce but Cyrus didn't like introduce on lists being singleton lists and otherwise there's no hole. Either way you have the same issue on empty strings. Maybe go to empty lists and string should go inside but that also seems sketchy. It's currently a flag to go left once

disconcision · 2025-04-18T02:36:19Z

I guess you could still abstract this; identify a more general notion of 'go to next point of interest', in the sense of per form defined ~ canonical cursor positions to effectuate structured transformations (eg middle of token for [|] to do [] => [?|]). (for tydi I went worse to tho, I just did all syntax construction as strings).

codecov · 2025-04-18T03:11:11Z

Codecov Report

Attention: Patch coverage is 6.31579% with 89 lines in your changes missing coverage. Please review.

Project coverage is 46.18%. Comparing base (7997def) to head (30f562d).

Files with missing lines	Patch %	Lines
src/semantics/term/Sort.re	0.00%	23 Missing ⚠️
src/haz3lcore/zipper/action/Select.re	0.00%	11 Missing ⚠️
src/haz3lcore/TyDi/TyDi.re	0.00%	8 Missing ⚠️
src/semantics/statics/Ctx.re	0.00%	7 Missing ⚠️
src/haz3lcore/lang/MakeTerm.re	0.00%	5 Missing ⚠️
src/haz3lcore/TyDi/TyDiForms.re	0.00%	4 Missing ⚠️
src/haz3lcore/TyDi/TyDiSuggestion.re	0.00%	4 Missing ⚠️
src/haz3lcore/TyDi/TyDiCtx.re	0.00%	3 Missing ⚠️
src/haz3lcore/projectors/ProjectorBase.re	0.00%	3 Missing ⚠️
src/haz3lcore/zipper/action/Indicated.re	0.00%	3 Missing ⚠️
... and 10 more

Additional details and impacted files

@@                 Coverage Diff                 @@
##           projector-gadts    #1619      +/-   ##
===================================================
+ Coverage            46.05%   46.18%   +0.13%     
===================================================
  Files                  126      126              
  Lines                14046    13991      -55     
===================================================
- Hits                  6469     6462       -7     
+ Misses                7577     7529      -48

Files with missing lines	Coverage Δ
src/haz3lcore/derived/Measured.re	`47.54% <ø> (ø)`
src/haz3lcore/derived/TermMap.re	`100.00% <100.00%> (ø)`
src/haz3lcore/derived/TermRanges.re	`93.75% <ø> (ø)`
src/haz3lcore/derived/TileMap.re	`100.00% <ø> (ø)`
src/haz3lcore/lang/Precedence.re	`92.00% <ø> (-2.00%)`	⬇️
src/haz3lcore/pretty/ExpToSegment.re	`75.57% <ø> (ø)`
src/haz3lcore/projectors/ProjectorInfo.re	`18.18% <ø> (ø)`
src/haz3lcore/projectors/ProjectorInit.re	`11.11% <ø> (ø)`
...c/haz3lcore/projectors/implementations/CardProj.re	`0.35% <ø> (ø)`
...z3lcore/projectors/implementations/CheckboxProj.re	`0.00% <ø> (ø)`
... and 68 more

... and 1 file with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

7h3kk1d · 2025-04-18T13:14:10Z

Yeah, I'd have to think more about how it would look. Maybe there's only a single position for most forms.

7h3kk1d

This all seemed surprisingly unproblematic. Not sure how it fits into the longer term projector work and I would like to see the semantics and AST be split out as much as possible in the future but this seems like a positive improvement.

7h3kk1d · 2025-04-18T13:16:35Z

src/haz3lcore/lang/Sort.re

-  | Typ => "type"
-  | Rul => "rule"
-  | Exp => "expression";
+include Semantics.Sort;


Are we planning on these diverging or is this just just an alias?

just an alias, as Sort is used hundreds of times across tylr. not sure what best practices here should be; i definitely do this reluctantly. I only aliased Sort and Id in this manner

7h3kk1d · 2025-04-18T13:17:19Z

src/haz3lcore/projectors/ProjectorBase.re

+  statics: option(Semantics.Statics.Info.t),
  /* Dynamic information about the syntax including
   * live values of the syntax. Dynamics may be
   * disabled by the user; this case (None) must be
   * handled by projector authors */
-  dynamics: option(Dynamics.Info.t),
+  dynamics: option(Semantics.Dynamics.Info.t),


You shouldn't need to namespace this given the open above.

7h3kk1d · 2025-04-18T13:24:43Z

src/haz3lmenhir/AST.re

@@ -183,7 +183,8 @@ let gen_constructor_ident: QCheck.Gen.t(string) =
    let* leading = char_range('A', 'Z');
    let+ tail = string_size(~gen=char_range('a', 'z'), int_range(1, 4));
    let ident = String.make(1, leading) ++ tail;
-    if (List.exists(a => a == ident, Haz3lcore.Form.base_typs)) {
+    //TODO(andrew): copied list of base types below to remove Form dep...
+    if (List.exists(a => a == ident, ["String", "Int", "Float", "Bool"])) {


I think we should reconsider this as well since (from what I recall) those are excluded as constructor names in parsing and I think that should happen in statics if anything. This doesn't happen for other non-base types.

how would you handle this for the moment? remove this and leave it to statics to decide?
(seems reasonable to defer as much as possible to statics; the proximate situation is going to be that all of tylr, semantics, and the parser all implicitly depend on a shared but non-materialized language definition. i don't think we have an easy path for actually abstracting this is the implementation. this particular one (base types) already has multiple divergences in the implementation (i counted three different sets of base types used in different places))

This needs to be consistent with the tylr version. Im fine with it being either hard coded here or a list defined in semantics somewhere. If we define it in semantics can't all 3 libraries use the same defn?

7h3kk1d · 2025-04-18T13:31:06Z

src/haz3lcore/Editor.re

-    settings.flip_animations && Action.should_animate(a)
-      ? Animation.request([Animation.Actions.move("caret")]) : ();


I know this is happening on perform now, but can I get some context.

the animation system I created requires (a) capturing DOM state before an action is triggered, (b) storing that state so that it is persisted into the next MVU loop, and (c) capturing DOM state after the action has been formed, after the DOM is regenerated, but before draw occurs. (a) was done here; now i've moved it slightly so that Animation can live in Web. (b) happens in Main.re.

The reason why it was here to begin with (aside from the lazy fact that core has web dep due to projectors), is that in general we want to trigger animations based on the particular action taken, possibly with explicit dependence on both the before and after state of the model. I'm continually torn on whether effects like this should be handled as part of an entirely separate switch; i like the separation of concerns, but also think that it's not terrible to be forced to think about the effect when you change an action. I had started moving in the separation direction, but ended up punting, forgetting about it, and leaving it in an intermediate state. I think now that the effect should probably be abstracted to the Action.should_animate function, which should take the before and after models as well as the action, and produce an effect.

7h3kk1d · 2025-04-18T13:39:34Z

src/haz3lweb/exercises/Exercise.re

@@ -87,7 +87,7 @@ type pos =
 type spec = p(Zipper.t);


This file or at least locally below might benefit from opening Semantics

7h3kk1d · 2025-04-18T13:50:49Z

src/semantics/term/Grammar.re

My biggest question with the PR is the what the medium term plan for semantics is. Naming-wise it's a bit confusing semantics defines the AST of the language. I would expect it to just be typechecking, elaboration, dynamics. But that's a huge change.

we could put term files in a separate target, but not clear to me what we gain from that at the moment. i look at this change as a first-order movement to obtain some basic directional separation between hazel-the-language and hazel-the-editor, motivated partially by thinking through clarifying projector versus tylr dependencies. alternatively, we could call this library language.

more broadly, i'm trying to iteratively clarify the overall structure of the codebase. it's not obvious to be the current web/core division makes sense, even in principle. increasingly i feel like we're going to want multiple components which can interact with the 'language server', and theses components may have tightly integrated backends and frontends, where the backend refers to some edit action calculi specific to how their UI is surfaced to the user. these components might reasonably decide to divide internally into front and back build targets, particularly if they want to support multiple frontends, but this feels different to be than enforcing a front-back distinction on the codebase entire. but the big incoming question i think is how grove fits in

I think language ic clearer but I might be being pedantic.

we could put term files in a separate target, but not clear to me what we gain from that at the moment.

I don't think it's tenable/necessary know but long term I think the menhir parser, and tylr could only depend on the AST and not the semantics. It's also beneficial from an enforced separation of concerns standpoint.

it's not obvious to be the current web/core division makes sense, even in principle. increasingly i feel like we're going to want multiple components which can interact with the 'language server', and theses components may have tightly integrated backends and frontends, where the backend refers to some edit action calculi specific to how their UI is surfaced to the user. these components might reasonably decide to divide internally into front and back build targets, particularly if they want to support multiple frontends, but this feels different to be than enforcing a front-back distinction on the codebase entire. but the big incoming question i think is how grove fits in

I agree there's multiple concerns here. My main medium term priority is to have a build of tylr/semantics/syntax minus projectors so that we can run the non-web parts of hazel without the js of ocaml requirement. I don't think it requires fully separating the frontend/backend divide. For any components where we can inject their dependencies so that they can be tested or run on the backend.

i went through and update dependencies somewhat; a lot of the tests depend purely on semantics now. probably breaking makterm and expToSeg out would do most of the rest?

7h3kk1d · 2025-04-18T13:51:40Z

src/semantics/term/Sort.re

+let consistent = (s, s') =>
+  switch (s, s') {
+  | (Any, _)
+  | (_, Any) => true
+  | _ => s == s'
+  };


Can we just add deriving eq to t and make consistent equal to equals?

7h3kk1d · 2025-04-18T13:53:36Z

src/semantics/term/Sort.re

+let to_string =
+  fun
+  | Any => "Any"
+  | Pat => "Pat"
+  | TPat => "TPat"
+  | Typ => "Typ"
+  | Rul => "Rul"
+  | Exp => "Exp";


We can derive this using ppx_variants_conv.

7h3kk1d · 2025-04-18T13:55:18Z

test/Test_Menhir.re

-open Haz3lcore;
+open Semantics;


Is there anything clashing if we open both?

7h3kk1d · 2025-04-18T13:57:30Z

src/haz3lcore/pretty/ExpToSegment.re

+//TODO(andrew): ...
+module Secondary2 = Secondary;
+open Semantics;
+module Secondary = Secondary2;


Is this a intermediate step while you're seeing if you can remove Semantics.Secondary? Otherwise I don't see why they can't be aliases.

break out semantics to library

543d234

disconcision requested review from 7h3kk1d and Negabinary April 17, 2025 23:49

rm unused/redundant build libs

30f562d

7h3kk1d reviewed Apr 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Break out semantics to library #1619

Break out semantics to library #1619

disconcision commented Apr 17, 2025 •

edited

Loading

7h3kk1d commented Apr 18, 2025

disconcision commented Apr 18, 2025 •

edited

Loading

7h3kk1d commented Apr 18, 2025 •

edited

Loading

disconcision commented Apr 18, 2025

codecov bot commented Apr 18, 2025

7h3kk1d commented Apr 18, 2025

7h3kk1d left a comment

7h3kk1d Apr 18, 2025

disconcision Apr 18, 2025

7h3kk1d Apr 18, 2025

7h3kk1d Apr 18, 2025

disconcision Apr 18, 2025

7h3kk1d Apr 18, 2025

7h3kk1d Apr 18, 2025

disconcision Apr 18, 2025

7h3kk1d Apr 18, 2025

7h3kk1d Apr 18, 2025

disconcision Apr 18, 2025

7h3kk1d Apr 18, 2025

disconcision Apr 18, 2025

7h3kk1d Apr 18, 2025 •

edited

Loading

7h3kk1d Apr 18, 2025

7h3kk1d Apr 18, 2025

7h3kk1d Apr 18, 2025

		settings.flip_animations && Action.should_animate(a)
		? Animation.request([Animation.Actions.move("caret")]) : ();

Break out semantics to library #1619

Are you sure you want to change the base?

Break out semantics to library #1619

Conversation

disconcision commented Apr 17, 2025 • edited Loading

7h3kk1d commented Apr 18, 2025

disconcision commented Apr 18, 2025 • edited Loading

7h3kk1d commented Apr 18, 2025 • edited Loading

disconcision commented Apr 18, 2025

codecov bot commented Apr 18, 2025

Codecov Report

7h3kk1d commented Apr 18, 2025

7h3kk1d left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

7h3kk1d Apr 18, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

disconcision commented Apr 17, 2025 •

edited

Loading

disconcision commented Apr 18, 2025 •

edited

Loading

7h3kk1d commented Apr 18, 2025 •

edited

Loading

7h3kk1d Apr 18, 2025 •

edited

Loading