Check set.mm LaTeX in CI pipeline #3100

david-a-wheeler · 2023-03-17T17:09:46Z

Generate a list of symbols in TeX format, then try to generate a PDF. This will check if every symbol is valid TeX.

I don't expect this specific commit to succeed, but hopefully it'll help us see what's left to do.

Generate a list of symbols in TeX format, then try to generate a PDF. This will check if every symbol is valid TeX. I don't expect this specific commit to succeed, but hopefully it'll help us see what's left to do. Signed-off-by: David A. Wheeler <[email protected]>

Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler · 2023-03-17T17:23:56Z

Currently this fails on:

! LaTeX Error: File `phonetic.sty' not found.

Is there an Ubuntu package with phonetic.sty? Maybe adding this would help: texlive-fonts-extra

david-a-wheeler · 2023-03-17T17:26:08Z

@GinoGiotto @benjub @wlammen - eventually I'd like the CI pipeline to call the script & see if the LaTeX works. Then we'll be immediately notified of future problems when they happen. Suggestions on how to "make this work" are welcome.

david-a-wheeler · 2023-03-17T17:27:11Z

This is related to #3099

tirix · 2023-03-17T23:55:39Z

Did you check out those Github actions @david-a-wheeler ?

benjub · 2023-03-18T12:03:02Z

Is there an Ubuntu package with phonetic.sty? Maybe adding this would help: texlive-fonts-extra

Looks like it:

$ apt-file search phonetic.sty
texlive-fonts-extra: /usr/share/texlive/texmf-dist/tex/latex/phonetic/phonetic.sty
texlive-fonts-extra-doc: /usr/share/doc/texlive-doc/fonts/phonetic/209/phonetic.sty.gz

Signed-off-by: David A. Wheeler <[email protected]>

benjub · 2023-03-18T18:21:30Z

you can also apt get texlive-science:

$ apt-file search accents.sty
latexml: /usr/share/perl5/LaTeXML/Package/accents.sty.ltxml
texlive-latex-recommended: /usr/share/texlive/texmf-dist/tex/latex/lwarp/lwarp-accents.sty
texlive-science: /usr/share/texlive/texmf-dist/tex/latex/accents/accents.sty

Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler · 2023-03-19T18:45:12Z

@benjub :
you can also apt get texlive-science:

Thanks! That's better than trying to download it multiple different ways (with different versions). I think we're homing into a useful LaTeX test. Once we test every proposed change, we'll be much more likely to notice problems :-).

Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler · 2023-03-19T18:50:12Z

The latest revision hasn't finished running, but I can already tell you where it's going to fail:

LaTeX Warning: Command \L invalid in math mode on input line 982.

Can our TeX-ers fix that?

benjub · 2023-03-19T21:05:50Z

The latest revision hasn't finished running, but I can already tell you where it's going to fail:

LaTeX Warning: Command \L invalid in math mode on input line 982.

Can our TeX-ers fix that?

Hard to tell without seeing the file that caused the warning. If this is because of the latexdef of @tirix's df-rag, then probably \L should be changed to \llcorner. (There are more symbols from other packages, but as I wrote in ~conventions, we prefer to use symbols from core packages even if they look slightly less nice.)

david-a-wheeler · 2023-03-19T21:47:52Z

The problematic generated line is:

\verb!raG! & $\L\mathrm{G}$ \\

So yes, we need to change its latex representation at line 436180:

htmldef "raG" as "&#8735;G"; /* &angrt; fails validator so use its code */
  althtmldef "raG" as "&#8735;G"; /* &angrt; fails validator so use its code */
  latexdef "raG" as "\L\mathrm{G}";

I'll create a separate pull request to do that. Then we may be able to generate all TeX symbols without problems... and keep them that way.

@benjub

Currently we have only one identified problem in the generated TeX table: `\L` isn't allowed in math mode. Switching to `\\corner` should fix this (per @benjub). This should fix the remaining problem in creating the LaTeX CI pipeline check in #3100. Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler · 2023-03-19T21:51:34Z

See #3104 which should fix the remaining known problem.

@benjub

Currently we have only one identified problem in the generated TeX table: `\L` isn't allowed in math mode. Switching to `\\corner` should fix this (per @benjub). This should fix the remaining problem in creating the LaTeX CI pipeline check in #3100. Signed-off-by: David A. Wheeler <[email protected]>

.github/workflows/verifiers.yml

benjub · 2023-03-19T22:57:44Z

By the way: why not do this in a separate job of the .yml ?

We really depend on LaTeX, not just TeX, so make that clear in the comments about verification. Signed-off-by: David A. Wheeler <[email protected]>

Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler · 2023-03-19T23:36:33Z

By the way: why not do this in a separate job of the .yml ?

We could, but we rebuild metamath-exe from source code. If we did this in a separate job we'd need to rebuild metamath-exe twice. I thought it'd be simpler to do it in a single job.

benjub · 2023-03-19T23:47:13Z

We could, but we rebuild metamath-exe from source code. If we did this in a separate job we'd need to rebuild metamath-exe twice. I thought it'd be simpler to do it in a single job.

Isn't the LaTeX part independent of metamath.c ? I don't see where it is used in the part you added ?

david-a-wheeler · 2023-03-20T00:33:00Z

Isn't the LaTeX part independent of metamath.c ? I don't see where it is used in the part you added ?

If this is the only LaTeX test, I agree. But I'm expecting to use metamath to generate some LaTeX proofs and test this too (not done yet). So it makes sense to pit them here.

GinoGiotto · 2023-03-20T18:56:27Z

I'm expecting to use metamath to generate some LaTeX proofs and test this too (not done yet)

If it's possible I would suggest to randomize the process (check different proofs each time), to increase the probability of spotting bugs in the future.

david-a-wheeler · 2023-03-20T19:15:44Z

@GinoGiotto - it's worthy of discussion. The risk is that the tests become "flapping" tests, that is, appear to randomly fail. If you rerun a test & it succeeds, it won't be obvious there's a problem.

We could pick statement number 1 + ( (hash of .mm file) mod (number of statements) ). Then every change to the database would pick a specific statement to test, but it'd be the same one until the next change.

GinoGiotto · 2023-03-20T21:19:35Z

The risk is that the tests become "flapping" tests, that is, appear to randomly fail. If you rerun a test & it succeeds, it won't be obvious there's a problem.

One approach could be to design a process where, in case of failure, the system repeats the same tests as before, and only resumes random testing once the problem is solved. But to be honest I don't know how it would be possible to implement such a mechanism.

jkingdon · 2023-03-23T05:55:30Z

If it's possible I would suggest to randomize the process (check different proofs each time), to increase the probability of spotting bugs in the future.

Although fuzz testing or generative testing[1] or the like can be helpful,[2] I don't think it is good idea to be including tests containing randomness in our github checks, for the reasons @david-a-wheeler gives. It just becomes too hard to know whether a test failure was caused by a particular commit or pull request that way.

[1] see for example https://medium.com/geckoboard-under-the-hood/how-generative-testing-changed-the-way-we-qa-geckoboard-b4a48a193449

[2] well at least in the abstract, I'm not sure I have more answers than anyone else about what to advise someone who is thinking of writing such tests, in terms of how to run these tests (manual? automated? under what circumstances?)

GinoGiotto · 2023-03-23T13:06:32Z

A different approach could be to simply test very long proofs. They are usually more likely to contain bugs (e.g. this bug was very common among long proofs metamath/metamath-exe#129). The disadvantage is that testing could be quite slow tho.

david-a-wheeler · 2023-03-23T19:33:18Z

Fuzzing is awesome for security vulnerability detection, but I don't think that's the goal here. I think we want predictable tests and ones that don't take too long within our CiCD process. Ipdflatex is slow. still think this compromise would do it:

We could pick statement number 1 + ( (hash of .mm file) mod (number of statements) ). Then every change to the database would pick a specific statement to test, but it'd be the same one until the next change.

We could pick several instead of just one. The point is that it would pick "randomly" for a particular version of set.mm, but it'd be predictable for any particular version. That would ensure that re-running the tests for a particular version of set.mm would run the same tests.

We could pick separately create a set of "long" ones & pick just one.

We could separately have a process that runs PDF generation on all proofs, but I'd expect that to take 24hours or so. Something you'd want to do separately.

icecream17 · 2023-03-30T12:11:19Z

If all proofs were checked, maybe another solution would be to just pick the latest statement. That would be more likely to correlate with any pr's changes.

david-a-wheeler · 2023-03-30T12:18:48Z

If all proofs were checked, maybe anothet r solution would be to just pick the latest statement. That would be more likely to correlate with any pr's changes.

That's true, but that means that you have to compute "what is the most recent statement". That requires the CI pipeline to use a process to compare the differences over time, and that's more complex than something that uses only the proposed last state.

Also what's "most recent" is a little more complex - that depends on the branch you're on (sometimes multiple different branches can end up at the same place).n That's less of a big deal but worth mentioning.

icecream17 · 2023-03-31T10:58:44Z

I was thinking about using the date in the tags of a theorem, which is statically findable, although possibly not actually the most recent theorem now that I think about it

david-a-wheeler · 2023-04-02T17:21:54Z

I was thinking about using the date in the tags of a theorem, which is statically findable, although possibly not actually the most recent theorem now that I think about it

That's a good idea. That's probably close enough for our purposes.

david-a-wheeler · 2023-04-23T18:25:25Z

All: there are many other things we could do with LaTeX, but let's merge this PR as a starting point. It at least ensures that people won't create new definitions/symbols with broken LaTeX, and that's a starting point.

Do have a +1 from anyone?

This reverts commit 3e6b9b8.

david-a-wheeler marked this pull request as draft March 17, 2023 17:11

Install TeX before using it

d3a6ed5

Signed-off-by: David A. Wheeler <[email protected]>

This was referenced Mar 17, 2023

Fixed Latexdefs #3099

Merged

Malfunctioning Latexdefs left to fix #3091

Closed

GinoGiotto mentioned this pull request Mar 17, 2023

Updated TeX packages metamath/metamath-exe#133

Merged

david-a-wheeler added 4 commits March 18, 2023 13:20

Add a package to load phonetic.sty

4fa9121

Signed-off-by: David A. Wheeler <[email protected]>

Add LaTeX "accents" package from CTAN.

74525a7

Signed-off-by: David A. Wheeler <[email protected]>

Initialize tlmgr so we can use it to install LaTeX packages

d2c6d89

Signed-off-by: David A. Wheeler <[email protected]>

Install LaTeX package manually into ~/texmf

4df551d

Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler added 3 commits March 19, 2023 14:35

Install "accents.sty" from Debian repo

5b18437

Signed-off-by: David A. Wheeler <[email protected]>

Make pdflatex check fail if something is "invalid"

ba24bfe

Signed-off-by: David A. Wheeler <[email protected]>

Merge branch 'develop' into check_latex

a425d72

Fix misspelling of "grep"

cef2ce8

Signed-off-by: David A. Wheeler <[email protected]>

david-a-wheeler mentioned this pull request Mar 19, 2023

Change df-ran TeX from \L to \\lcorner #3104

Merged

Merge branch 'develop' into check_latex

44eb307

david-a-wheeler marked this pull request as ready for review March 19, 2023 22:50

benjub reviewed Mar 19, 2023

View reviewed changes

.github/workflows/verifiers.yml Outdated Show resolved Hide resolved

benjub reviewed Mar 19, 2023

View reviewed changes

.github/workflows/verifiers.yml Outdated Show resolved Hide resolved

david-a-wheeler added 2 commits March 19, 2023 18:58

Change comments to be clear TeX -> LaTeX

b1b827b

We really depend on LaTeX, not just TeX, so make that clear in the comments about verification. Signed-off-by: David A. Wheeler <[email protected]>

Fix grep of 'invalid' for LaTeX output

202a845

Signed-off-by: David A. Wheeler <[email protected]>

jkingdon approved these changes Apr 23, 2023

View reviewed changes

jkingdon merged commit 3e6b9b8 into develop Apr 24, 2023

jkingdon deleted the check_latex branch April 24, 2023 01:10

jkingdon added a commit that referenced this pull request Apr 24, 2023

Revert "Check set.mm LaTeX in CI pipeline (#3100)"

a5e149e

This reverts commit 3e6b9b8.

jkingdon mentioned this pull request Apr 24, 2023

Revert "Check set.mm LaTeX in CI pipeline" #3134

Closed

jkingdon added a commit to jkingdon/set.mm that referenced this pull request Apr 25, 2023

Revert "Check set.mm LaTeX in CI pipeline (metamath#3100)"

7b47156

This reverts commit 3e6b9b8.

benjub mentioned this pull request Oct 14, 2023

LaTeX CI Checks #3079

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check set.mm LaTeX in CI pipeline #3100

Check set.mm LaTeX in CI pipeline #3100

david-a-wheeler commented Mar 17, 2023

david-a-wheeler commented Mar 17, 2023

david-a-wheeler commented Mar 17, 2023

david-a-wheeler commented Mar 17, 2023

tirix commented Mar 17, 2023

benjub commented Mar 18, 2023

benjub commented Mar 18, 2023

david-a-wheeler commented Mar 19, 2023

david-a-wheeler commented Mar 19, 2023

benjub commented Mar 19, 2023

david-a-wheeler commented Mar 19, 2023

david-a-wheeler commented Mar 19, 2023

benjub commented Mar 19, 2023

david-a-wheeler commented Mar 19, 2023

benjub commented Mar 19, 2023

david-a-wheeler commented Mar 20, 2023 •

edited

Loading

GinoGiotto commented Mar 20, 2023

david-a-wheeler commented Mar 20, 2023

GinoGiotto commented Mar 20, 2023

jkingdon commented Mar 23, 2023

GinoGiotto commented Mar 23, 2023

david-a-wheeler commented Mar 23, 2023

icecream17 commented Mar 30, 2023

david-a-wheeler commented Mar 30, 2023

icecream17 commented Mar 31, 2023

david-a-wheeler commented Apr 2, 2023 via email

david-a-wheeler commented Apr 23, 2023

Check set.mm LaTeX in CI pipeline #3100

Check set.mm LaTeX in CI pipeline #3100

Conversation

david-a-wheeler commented Mar 17, 2023

david-a-wheeler commented Mar 17, 2023

david-a-wheeler commented Mar 17, 2023

david-a-wheeler commented Mar 17, 2023

tirix commented Mar 17, 2023

benjub commented Mar 18, 2023

benjub commented Mar 18, 2023

david-a-wheeler commented Mar 19, 2023

david-a-wheeler commented Mar 19, 2023

benjub commented Mar 19, 2023

david-a-wheeler commented Mar 19, 2023

david-a-wheeler commented Mar 19, 2023

benjub commented Mar 19, 2023

david-a-wheeler commented Mar 19, 2023

benjub commented Mar 19, 2023

david-a-wheeler commented Mar 20, 2023 • edited Loading

GinoGiotto commented Mar 20, 2023

david-a-wheeler commented Mar 20, 2023

GinoGiotto commented Mar 20, 2023

jkingdon commented Mar 23, 2023

GinoGiotto commented Mar 23, 2023

david-a-wheeler commented Mar 23, 2023

icecream17 commented Mar 30, 2023

david-a-wheeler commented Mar 30, 2023

icecream17 commented Mar 31, 2023

david-a-wheeler commented Apr 2, 2023 via email

david-a-wheeler commented Apr 23, 2023

david-a-wheeler commented Mar 20, 2023 •

edited

Loading