freenode/#clasp - IRC Chatlog

11:07:19 drmeister Hello everyone

11:15:58 kpoeck Hello

11:16:33 heisig Hello drmeister, hello kpoeck.

11:17:30 kpoeck Mahlzeit :-)

11:18:29 kpoeck Drmeister: I would like to fix interpret_token_or_throw_reader_error to allow all possible float-types in _STARreadDefaultFloatFormatSTAR

11:18:53 drmeister Hello gentlemen

11:19:08 drmeister kpoeck: Be my guest!

11:19:13 kpoeck Is there any real difference in precision between DoubleFloat_O and LongFloat_O

11:19:46 kpoeck since typedef double LongFloat;?

11:21:43 kpoeck And the same question for ShortFloat and SingleFloat?

11:23:12 kpoeck Am now compiling under the assumption that I can read both a DoubleFloat and a Long with strtod

11:23:52 kpoeck And read ShortFloat with strtof

11:24:18 drmeister No DoubleFloat and LongFloat are the same. I've toyed with the idea of using the GMP infinite precision float for long-float.

11:24:55 kpoeck Still puzzled with SingleFloat is read with strod, since clasp_make_single_float is defined to accept a float and is passed a double

11:25:32 kpoeck ok, than regarding LongFloat the fix is obvious

11:25:46 drmeister ShortFloat and SingleFloat are also confused - SingleFloat is a tagged C 'float'. I think (but I may have forgotten) ShortFloat is still managed by the GC.

11:26:51 kpoeck ShortFloat is allocated by GC_ALLOCATE(ShortFloat_O, sf);

11:27:05 kpoeck Probably what you said

11:28:08 kpoeck and SingleFloat allocated by gc::make_tagged_single_float<core::SingleFloat_I>(x);

11:29:45 drmeister Yes. SingleFloat values are not stored on the heap at all - they are coded into the tagged pointer itself. Like fixnum or character.

11:30:15 drmeister They are shifted a few bits, have tag written into the lower three bits and that's how they are represented.

11:30:51 drmeister So it leaves the question of what to do with ShortFloat_O?

11:31:38 drmeister They could be aliased to SingleFloat's... Are there any other ideas of what to do with them?

11:32:18 drmeister What about LongFloat - I'd like to implement them with GMP arbitrary precision floats.

11:33:02 kpoeck Looking at ansi, it is totally fine to just have 2 representations of floats

11:33:15 drmeister Kind of how Bignum's are implemented with GMP mpz.

11:33:55 kpoeck In ccl, for number crunching I used short-floats instead of single-float, since they would not cons

11:33:58 drmeister kpoeck: Right - I did that with that in mind - I didn't tidy up ShortFloat and LongFloat because I didn't know what to do with them at the time. So I just left them.

11:36:21 drmeister Cleavir has a way of doing a lot of math without boxing/unboxing operations that I am looking forward to using.

11:38:41 kpoeck Seems to be the way to go than

11:39:32 kpoeck So I just extend interpret_token_or_throw_reader_error for short-float and long-float (testing right now)

11:40:31 kpoeck On a different matter, could you perhaps test whether issue #531 also crashes your clasp?

11:41:03 kpoeck I did rerun the ansi-tests after the big merge and only have this regression

11:42:43 heisig drmeister: Currently, with the whole machine learning hype, 16bit half-floats have become fashionable again. So that could be a use case for ShortFloat_O.

11:45:12 drmeister Good point - that sounds like a good use for short-float.

11:46:31 drmeister Heronimus Bosch - what a weirdo. https://tuinderlusten-jheronimusbosch.ntr.nl/en

11:47:12 drmeister The detail in that painting in amazing though.

11:50:06 kpoeck Off to public viewing Germany Mexiko in the world cup

11:50:58 kpoeck Meanwhile running ansi-tests with the extended interpret_token_or_throw_reader_error

11:51:31 drmeister I confirmed that issue 531

11:51:36 drmeister Enjoy the game.

12:06:43 kpoeck Will follow the channel via the phone

12:08:06 kpoeck Glad that i didn’t hallucinate with issue 531

12:14:56 drmeister heisig: What do you think about arbitrary precision float's - would it be a good idea to expose them as long-float?

12:19:35 Bike what good is a 16 bit float...?

12:23:09 drmeister Machine learning apparently

12:35:12 heisig Bike: This is wikipedia on 16bit floats: https://en.wikipedia.org/wiki/Bfloat16_floating-point_format

12:35:53 Bike i know what they are, i just don't understand what they're for

12:36:17 Shinmera "The bfloat16 format is utilized in upcoming Intel AI processors, such as Nervana NNP-L1000, Xeon processors, and Intel FPGAs,[1][2][3] Google Cloud TPUs,[4][5][6] and Tensorflow.[6][7]"

12:36:58 heisig drmeister: Arbitrary precision long-floats sound like a great idea. I know applications where this would be extremely useful.

12:37:00 drmeister Now you are just teasing

12:38:33 drmeister Ok then. short-float will be 16bit floats - I'll need to tweak the tagging scheme. long-float's will be arbitrary precision floats

12:38:50 Bike i don't think we need to do that right now, though

12:38:56 Bike let's put it on the bug tracker as an enhancement

12:39:05 drmeister Absolutely.

12:39:26 Bike also just knowing processors use it doesn't tell me what it's for

12:39:43 stassats` less silicon, more units => faster

12:39:47 stassats` that's about it

12:40:07 stassats` doubt it's any useful for clasp, other than some interoperability

12:42:39 kpoeck Single-floats do not seem to cons, it that correct?

12:43:21 drmeister That is correct - single-floats do not cons - they are stored in tagged pointers.

12:43:42 stassats` well, they are not pointers

12:44:04 stassats` tagged immediates

12:44:05 drmeister Right - tagged values - tagged whatevers

12:44:10 kpoeck Thats good enough for me

12:44:12 drmeister tagged immediates

12:45:00 kpoeck Will read about 16 bits floats

12:45:27 drmeister We have one tagged bit pattern left unused that we could dedicate to them.

12:45:31 drmeister IIRC

12:46:32 drmeister I was going to use them as pointers into aligned C++ memory - but that never panned out. Actually, I'll wait until Martin comes on board - he may have some use for another kind of tagged immediate.

12:47:15 stassats` i think the only thing clasp will benefit from with 16 bit floats is reduced storage

12:47:42 stassats` scalar operations would not benefit, aside maybe something like division

12:48:09 stassats` otherwise you'll need clasp to run on these TPUs or support SIMD or wherever these floats are used

12:48:40 drmeister Well, there is interoperability with C++ machine learning libraries if we need to pass 16bit floats back and forth. Interoperability - as you mentioned above.

12:49:06 stassats` doubt that requires full blown tagging support

12:49:15 drmeister Noted.

12:57:59 drmeister What's up with a language-server-protocol server for Common Lisp? Does anyone use one?

13:00:02 drmeister jupyterlab appears to be going in that direction - to use language server protocol.

13:00:44 drmeister ACTION is rummaging around inside of jupyterlab discussion groups and github issue trackers

13:03:36 Bike how would a widget work, then...?

13:03:58 drmeister I think this is for input cells and editors.

13:04:25 Bike oh, well, yeah.

13:04:32 Bike i'tll tell jupyter how to do indentation and stuff, right.

13:04:40 drmeister jupyterlab lets you bring up editors - they have emacs and vi key bindings

13:04:52 drmeister Right - the indentation is what drives me a little crazy.

13:04:59 drmeister Lack thereof

13:05:10 attila_lendvai hi! I could use some insights/pointers... so, iclasp is a CL interpreter written in C++. aclasp is iclasp loading a bunch of files implementing a minimal CL system. what I don't understand is that compile-aclasp seems to be actually compiling stuff. how is that happening? what kind of compilation can iclasp do?

13:05:40 stassats` it can load interpret the compiler?

13:05:45 stassats` load and interpret

13:06:47 Bike iclasp is the C++ interpreter. then it loads a bunch of files including a compiler, and compiles itself

13:09:31 drmeister attila_lendvai: If you start from ./waf distclean configure build_iboehm <-- it builds the C++ interpreter iclasp-boehm

13:10:07 stassats` so, you're interpreting C++ now?

13:10:31 drmeister Then you can use ./waf build_rboehm <-- This loads the C++ interpreter and has it load the aclasp Common Lisp source code as interpreted code - absolutely no Common Lisp compilation happens up to this point.

13:10:42 Bike stassats`: the interpreter written in C++

13:11:09 drmeister stassats`: You got me: ./waf distclean configure build_iboehm <-- it builds the C++ interpreter of a subset of Common Lisp programming language: iclasp-boehm

13:12:22 drmeister So: ./waf build_rboehm puts you in a REPL that loaded the aclasp/bclasp Common Lisp compiler as interpreted code. At this point you can compile-file a subset of Common Lisp code.

13:12:25 stassats` (that wasn't a serious question)

13:12:57 drmeister stassats`: I know - and your point was fine - what I said was ambiguous.

13:14:10 drmeister ./waf build_aboehm loads the same Common Lisp source files as interpreted functions and then runs 'compile-file' on all of the Common Lisp source files that ./waf build_rboehm and ./waf build_aboehm load.

13:15:09 Bike someone else asked it seriously before

13:16:07 drmeister ./waf build_aboehm then links the compile-file'd code into a fasl called clasp/build/boehm/fasl/aclasp-boehm-image.fasl <--- Run with this and you get what you get when you run ./waf build_rboehm but everything is compiled. It provides a subset of Common Lisp and can further be used to compile a complete Common Lisp: 'bclasp'

13:17:52 drmeister The reason for all of this is that the interpreter written in C++ is an S-expression walking interpreter and it expands macros whenever they are encountered during evaluation. This is really, really slow when you start hitting a lot of macros as in loop and CLOS.

13:18:42 drmeister So we need some kind of compilation before we even load that code.

13:25:29 attila_lendvai ACTION processes the info and updates the wiki when done

13:29:15 attila_lendvai bclasp is complete-complete CL, or just mostly complete CL?

13:31:29 drmeister bclasp is a complete-complete CL.

13:33:45 drmeister There is another wrinkle regarding 'aclasp'

13:35:14 drmeister As ./waf build_aclasp compile-file's it's own code - it LOAD's the fasl files. This replaces interpreted functions with compiled ones as they are generated. This happens in both serial and parallel build. The code is set up to be tolerant to hot-swapping interpreted code with compiled versions in any order.

13:35:58 attila_lendvai and aclasp is compile-file'ing how? aclasp is already capable of calling the LLVM machinery at runtime?

13:36:28 drmeister Yes - everything stage of Clasp building has full access to all of the llvm machinery.

13:36:33 drmeister every stage

13:37:30 drmeister If you go iclasp-boehm -I -n (loads absolutely no CL code, either interpreted or compiled).

13:37:44 drmeister All of the llvm-sys functions (llvm machinery) are exposed.

13:38:43 drmeister You get a repl and a very restricted subset of the Common Lisp language - but everything that is exposed from C++ is exposed here.

13:41:50 drmeister Its better to build aclasp with fewer parallel processes than more - because more of the code can then take advantage of the compiled fasls that replaced the interpreted code. I think I currently limit aclasp compilation to 8 parallel processes. I haven't timed it carefully.

13:42:08 drmeister But 36 was too many.

13:42:23 beach This bootstrapping procedure should be written down somewhere.

13:42:35 attila_lendvai beach: I'm editing the wiki as we are talking

13:42:45 beach Excellent.

13:42:48 drmeister I think Bike did write it down.

13:42:54 beach OK.

13:42:54 attila_lendvai FTR, it'll be here: https://github.com/clasp-developers/clasp/wiki/The-Build-Process/

13:43:50 attila_lendvai there's docs/bootstrap.rst. maybe I'll extend that and point to it from the wiki. I'll think about it

13:58:35 attila_lendvai registerClasses.py and src/common/build is obsoleted by the CL scraper that is run by sbcl and wscript, right? I'll delete anything in the codebase and docs related to it. let me know if that's not desirable!

14:00:08 attila_lendvai or is the python groveler in a good enough state that it's worth keeping it in some tools-for-build/obsolete/ directory?

14:01:23 drmeister Working backwards. The python groveler can be gently lowered into a grave and buried. It is dead.

14:02:30 drmeister Where is registerClasses.py? I don't see it when I try to 'find' it

14:02:35 attila_lendvai ACTION keeps recording using git rm

14:03:07 attila_lendvai drmeister: in the docs, and some of its output can be found checked into the repo (by rgrep registerClasses .)

14:03:20 drmeister src/common/build can be removed

14:03:26 drmeister ls

14:04:20 drmeister Here - I'll remove some old crap from src/common/*

14:04:36 attila_lendvai drmeister: as I noted above I'm already doing it

14:05:19 attila_lendvai i.e. merge conflivt

14:05:28 drmeister Right - right

14:05:39 drmeister Please proceed

14:07:18 attila_lendvai drmeister: but I welcome your impulses for cleaning the codebase! it's very nice for the newcomers, thanks! :)

14:08:34 drmeister It's probably better to do it slowly and methodically. I don't instantly recognize what is being used and what is not.

14:08:56 drmeister build mentioned BJAM - that is definitely not used anymore - thank goodness.

14:10:42 attila_lendvai ACTION is also cleaning docs/

14:12:45 attila_lendvai drmeister: I have the following left in src/common. let me know if any of those should also get "gently lowered into a grave"... :) analyzeLldbBacktrace.py build-json.lisp buildJSON.py copywrite.lsp metering-allocations.lisp peakMemory.cc

14:14:40 attila_lendvai buildJSON.py also seems suspect

14:16:04 drmeister Yeah - I think that can be removed.

14:20:38 attila_lendvai drmeister: I assume analyzeLldbBacktrace.py can also be disappeared. I moved the rest that seemed useful to tools/ and I'll delete src/common/

14:23:57 drmeister Actually, can you move analyzeLldbBacktrace.py to tools? I have used that to analyze the size of stack frames to figure out if there are functions that allocate too much on the stack.

14:25:36 attila_lendvai ACTION does that

14:30:29 attila_lendvai I've pushed it. let me know if I happened to delete anything that you want restored: https://github.com/clasp-developers/clasp/commit/61e2b47c0299baaf89e24c08b1b4a7159a860c64

14:30:43 drmeister Thank you very much.

15:00:26 heisig Ok guys, see you tomorrow!

15:00:42 beach Have a nice trip.

15:00:49 heisig Thanks.

15:00:58 drmeister Ok.

15:01:10 drmeister If you have any trouble - call my cell.

15:02:37 karlosz beach: its not explicitly written in the docs, but we are assuming that for phi functions, the order of the predecessors have to match the order of the inputs, right?

15:03:03 karlosz some code you wrote seems to assume that, and otherwise we have to do the dominator calculations over and over again to find the corresponding input for each predecessor

15:03:33 karlosz but things like delete-instruction shuffle the order of the predecessors, like if you deleted the first phi of a cluster

15:03:37 karlosz see the PR i wrote

15:04:02 beach Yes, but that is also the reason why I strongly dislike SSA. It imposes such an order whereas the flow graph itself does not. It ruins everything.

15:04:48 beach And, you are right, I did not take predecessor order into account when I wrote the code to modify the graph.

15:05:42 beach OK, PR merged. Thanks.

15:06:17 karlosz maybe phi instructions can have a mapping between inputs and predecessors as an extra slot?

15:06:47 beach I especially dislike SSA because I am convinced (but I have yet to do the research) that basically none of the optimization techniques that claim to need SSA takes advantage of the fact that SSA is really S.S.A.

15:06:50 Bike you'd still have to update it when the graph is modified, rice?

15:07:21 beach karlosz: I don't think that would be a good idea.

15:07:26 beach Preserving the order is better.

15:07:38 beach Doing something other than SSA would be the best.

15:07:47 beach But I am not there yet.

15:08:38 karlosz beach: https://en.wikipedia.org/wiki/Sparse_conditional_constant_propagation this is the algorithm that i jjust implemented that requires SSA

15:09:04 karlosz it runs in linear time with SSA, and i think itd be very diffciult to write without assuming definitions are unique and dominate uses

15:09:32 beach karlosz: What aspect of SSA does it need?

15:09:34 karlosz the effects are folding something like https://paste.gnome.org/pp2hrvkia to just values 4 4 1 2 3 4

15:09:51 karlosz it needs unique definitions

15:09:56 karlosz that dominates uses

15:10:07 karlosz otherwise youd have to compute some sort of reaching definitions first

15:10:10 beach Oh, but other notations than SSA have that property.

15:10:21 beach So again, it does not use the fact that SSA i S.S.A.

15:10:29 beach It uses some other property of it.

15:10:31 karlosz which other form has that property?

15:10:46 beach For example the SFA that I have been thinking about.

15:11:08 karlosz do you have a link to it?

15:11:09 beach Like SSA, but put an assignment in each incoming arc of what use to be the Phi instruction.

15:11:15 beach Not sure it is written down.

15:11:37 beach All I am saying is that SSA ruins everything, and no algorithm I know take advantage of the fact that SSA is S.S.A.

15:12:02 beach Plus, it is exactly the S.S.A. aspect of SSA that ruins everything.

15:12:48 beach karlosz: But I have yet to do the research like I said. It would be a beautiful result though.

15:13:12 karlosz i do the transformation you are talking aboutl. to actually implement the phi functions, i convert the phi to assignmnet instruction along each arc before lowering to code

15:13:23 beach There you go.

15:13:34 beach So you do not take advantage of the fact that SSA is S.S.A.

15:13:40 karlosz its standard though, and still considered SSA in the leiterature, because those phi functions still have to be implemented somehow

15:13:44 karlosz but i get your point

15:13:46 karlosz the order thing is really nasty

15:13:52 beach Very nasty.

15:13:57 karlosz maybe that should be the standard form of SSA cleavir uses

15:14:08 karlosz since then you avoid the cluster madness too

15:14:18 beach karlosz: I justify my salary by questioning traditional wisdom in the literature.

15:14:48 karlosz ha!

15:15:05 beach karlosz: I have to at least convince myself that SFA is good enough for all the important published optimization techniques.

15:15:55 beach But it is a beautiful research project. Take the literature on compiler optimization techniques that claim to need SSA, and show that they don't use the fact that SSA is S.S.A.

15:16:22 beach I know of one algorithm that needs it, but then it was shown that the fact that it does makes it worse than others.

15:16:39 karlosz so is SFA just SSA with the phi node implemented?

15:16:40 beach And the fix was a kludge to the hack.

15:17:01 beach Phi nodes replaced by assignments in the incoming branches, yes.

15:17:16 beach So it is not S.S.A because there are several assignments to the new variable.

15:17:24 beach But it has all the other properties of SSA.

15:17:40 beach And I am betting it has all the ones that are needed by published algorithms.

15:17:51 karlosz this form of SSA is described in chapter 19 of Appel in his chapter on SSA

15:17:55 karlosz just so you know

15:18:01 beach SFA is?

15:18:04 beach Excellent!

15:18:18 karlosz "we can "implement" the phi-functions using a move instruction on each incoming edge, as shown in Section 19.6"

15:18:20 beach A variant of SSA that is not S.S.A.

15:18:28 beach Brilliant.

15:18:58 beach That information makes me more confident that I am right.

15:19:51 beach So, I would much prefer to have SFA as the notation used by Cleavir.

15:20:21 beach But, since I am not yet sure that it can do all the interesting published optimization techniques, I am not willing to make such a declaration quite yet.

15:20:42 beach On the other hand, SFA can be trivially converted to SSA if that should be needed.

15:20:56 beach And, of course, computing SFA is the same work as computing SSA.

15:20:57 karlosz yes, its an easy one step invertible transform

15:21:05 beach Right.

15:21:11 karlosz although SFA does cause the need for a slight special case in a lot of algorithms

15:21:37 karlosz you need to consider multiple defining instructions for those merge point locations

15:21:42 karlosz but nowhere else

15:21:45 beach That is entirely possible. But those algorithms should be examined to see whether they perhaps could even be simplified.

15:22:04 karlosz right

15:22:16 karlosz my varioation on SSA insert unphi nodes that act ass assignments in the arc

15:22:32 beach If you can live with that, then that is what I prefer.

15:22:39 karlosz so you can easily distinguish between normal assignments and thioose join point assignments

15:22:53 beach I see.

15:22:54 karlosz right, but either way its a one step transform so implementors can choose

15:22:59 karlosz i for one would like the order thing gone

15:23:03 beach I agree.

15:23:27 beach *sigh* so much to do, so little time.

15:41:12 beach karlosz: The optimization technique you gave a link to looks like one that should be included in Cleavir. In case I haven't mentioned it, I think Cleavir should ultimately contain a large collection of such techniques, so that client code can choose the ones that are appropriate for that particular client.

15:42:00 beach Many of the published techniques need to be adapted to the fact that our language is more complicated than the C-like language that is assumed in published techniques.

15:50:51 karlosz yeah, i adapted the technique by defining a generic function protocol on instructions that declare wehtehr they allow constant folding and methods for how to actually constant fold

15:51:17 karlosz ultimately it seems well suited to do this on HIR

15:58:33 attila_lendvai drmeister: so, in the current build scheme the scanning of extensions/ only makes sense in the 'dclasp' stage, right?

15:59:46 attila_lendvai i.e. building vanilla clasp won't ever mess with extensions/, right?

16:02:06 drmeister attila_lendvai: Correct about the dclasp stage and extensions.

16:02:55 drmeister Correct, vanilla clasp never messes with extensions. However, vanilla clasp does automatically check for extensions. I like the "(1) clone an extension into clasp/extensions (2) it just builds

16:06:48 attila_lendvai drmeister: I'm working towards my staged bootstrapping idea. I already have stage-1 (iclasp) building, and now I'm reducing the codebase for the stage-2 branch (aclasp). I'm envisioning that these early stages will be much simpler, with much fewer files. and the build scripts will also be much simpler, e.g. all of the stage_char complexity can go away from them.

16:07:31 drmeister As long as it doesn't break the build.

16:07:40 drmeister ACTION 's eye starts twitching uncontrollably.

16:08:53 drmeister And that means up to and including build_dxxxx

16:09:13 attila_lendvai drmeister: it'll be a long ride, but I have a rather exciting vision. everything will go into my own repo until its status/demo gets you excited, too... :)

16:09:33 drmeister I'll look forward to it.

16:12:50 attila_lendvai it will have its costs, e.g. 3-4 potentially diverging codebases, but if things work out as I expect them to, then touching the earlier stages will not be necessary too often, and in return it will make bootstrapping much more understandable and debuggable.

16:14:22 attila_lendvai there'll be a "toplevel" bootstrap.sh or equivalent that checks out and builds the stages under build/, and the toplevel dir will be used to build the currently latest stage

16:15:03 attila_lendvai I'm also considering how to introduce two "starting points" for the bootstrap, e.g. another one started off of sbcl+cleavir.

16:16:28 attila_lendvai I should have a much more detailed understanding of the bootstrap process, the lack of which will bite me a few times, but... how else to learn its intimate details?

16:17:49 drmeister The sbcl+cleavir one Bike and I have talked about a lot - it requires a lot of clasp runtime to be replicated in sbcl.

16:18:28 drmeister I think if you do that one - that might be the only one to focus on. If you could compile the cclasp code from sbcl+cleavir then why do anything else?

16:18:46 attila_lendvai I won't attempt it, just thinking ahead to set things up so that we can accommodate for that

16:19:40 attila_lendvai drmeister: for the beauty of it? of being able to bootstrap the same system based off of two distinct languages and ending up with reproducible builds (same output executable from both base languages)

16:19:47 drmeister We were thinking that sbcl+cleavir would compile to a list of generic function calls that would be run in the iclasp-boehm interpreter to invoke the llvm functions to generate code.

16:21:07 drmeister I'm all for beauty - but I don't want anyone wasting time on something that won't be used. There aren't a lot of us.

16:22:25 drmeister I don't have a lot of energy to talk anyone out of anything. But please, really think hard about spending a lot of time on something that won't be useful.

16:23:18 drmeister Potentially diverging codebases makes me anxious. However, we are bumping into constraints wrt what we can optimize because of the way clasp builds.

16:24:55 attila_lendvai it's also a bit of a security concern, although somewhat far fetched... see the http://wiki.c2.com/?TheKenThompsonHack (a trojan in gcc that reproduces itself when bootstrapping gcc from sources with a gcc.exe, and also "miscompiles" the sources of login to implement a backdoor)

16:25:21 drmeister How about you talk with Bike about this more? He has thought about the sbcl+cleavir compilation a lot. If you two could come up with a solid plan for a better build system that (1) is faster and (2) doesn't constrain optimizations - we could support you to do it.

16:26:41 attila_lendvai drmeister: my hope is that we will be able to easily and cheaply "abandon" stages and "escape forward", which will make the diverging codebases less of an issue (unless we manage to introduce a bug that manifests several stages later, but I'm not sure I could even create one if I was asked to)

16:27:20 drmeister Yeah - I'm aware of the compiler trojan issue - I worry about it like I worry about malevolent AI. All I say is - just don't doesn't mess with my molecules.

16:28:06 attila_lendvai I'll keep you guys updated here, but first I need to have a proof of concept that can rebuild clasp as it is currently to convince myself that this is viable and worth it

16:28:18 drmeister Well, it's more of a concern. We could use cclasp to build cclasp right now - we don't because we are worried about the compiler spinning out of control with a bug.

16:28:33 drmeister Ok.

16:32:58 attila_lendvai I would like to retain the ability to bootstrap off of clang, and the longer I wait, the more probable that you will eventually jump the sharks and start using clasp.exe for further development. I want to come up with something before that... :)

16:44:54 karlosz ACTION wishes he understood the clisp bootstrap process

16:45:36 drmeister karlosz: Type (1) 'make' (2) a bunch of stuff happens really fast (3) you have a working clisp executable.

16:45:40 drmeister ACTION is envious

16:46:09 karlosz yep. plus add loading cleavir and you only add a minute -.-

16:46:20 karlosz self building too!

16:47:34 drmeister Cleavir is building itself now?

16:47:41 karlosz yeah, it has been for a while

16:47:53 karlosz self build takes a minute and a half

16:47:54 drmeister Wait - this is clisp compiled cleavir compiling cleavir?

16:47:55 karlosz about

16:48:00 karlosz yes

16:48:02 karlosz and it can do it again

16:48:05 drmeister or this is cleavir compiled cleavir compiling cleavir?

16:48:07 karlosz in about the same amount of time

16:48:10 karlosz both

16:48:25 karlosz they both take roughly a minute and a half

16:48:28 drmeister Bike: Are you seeing this?

16:48:33 karlosz maybe faster now that ive gotten rid of some dominator stuff

16:49:11 drmeister karlosz: Do you have it set up so that I could try that?

16:50:20 drmeister Just so that I have this absolutely clear. You compile cleavir with clisp and then compile cleavir with that. Then you can run the result of that and compile cleavir again?

16:50:34 karlosz yes

16:50:45 drmeister In the final step there is no clisp generated code building anything anymore.

16:50:48 karlosz yep

16:51:11 drmeister That suggests that there is something seriously slow in the clasp runtime.

16:51:14 karlosz just pull from my branch and you should be able to try it

16:51:35 karlosz you need asdf set up with the latest sicl

16:51:41 karlosz like todays sicl

16:51:43 drmeister Because we are comparing cleavir generated clisp bytecode to cleavir generated llvm-ir lowered to native code.

16:52:03 karlosz yeah

16:52:05 karlosz i think its the compile time

16:52:20 drmeister What is the compile time?

16:52:51 karlosz like, the speed gain you have from having better generated code is a microoptimization compared to not having to do llvm optimizations

16:53:05 karlosz or whatever other cleavir optimizations you have going on

16:53:13 karlosz all i do is convert to ssa and do some dead code elimination

16:53:40 karlosz soon some conditional constant prop

16:53:54 karlosz but that doesnt impact the performance much either, since all ssa optimizations run in linear time

16:54:07 drmeister Well, we've been talking about doing that. Adding your hir/mir->ssa conversion and then lowering ssa to llvm-ir. It would be less work for llvm.

16:54:47 karlosz do you having timings for cclasp compiling cleavir?

16:55:01 karlosz maybe gc is a bottleneck?

16:55:15 karlosz since cleavir itself shouldnt compile that slowly, as my thing demonstrates

16:55:56 drmeister gc is maybe 30%

16:56:19 drmeister We can profile Common Lisp generated code alongside C++/C code. Nothing has stood out.

16:57:01 karlosz very strange

16:57:15 drmeister I've had a nagging feeling for years that something is wrong hidden in the runtime. This might help us suss it out if it's really there.

16:57:29 drmeister That or clisp bytecode is amazingly fast compared to llvm-ir->native code.

16:57:48 karlosz right. to be absolutely certain, the way im self compiling is to do (dolist (system '(cleavir-generate-ast cleavir-ast-to-hir sicl-boot cleavir-dominance))

16:57:49 drmeister Or I'm misunderstanding what you've got there - that's why I'd like to run it.

16:57:50 karlosz (asdf:load-system system :force t))

16:58:45 karlosz yeah, you can pull my branch, do (load "clisp/src/cleavir/load.lisp")

16:58:52 karlosz (setf *use-cleavir* t)

16:58:57 karlosz (push :cleavir *features*)

16:59:20 karlosz and do (let ((sys::*load-compiling* t)) (load "clisp/src/cleavir/load.lisp"))

16:59:25 karlosz that should launch the self compile

16:59:28 karlosz and you can just run it again

16:59:35 karlosz to get the cleavir on cleavir compiling cleavir

17:00:03 karlosz err, (setf cleavir-clisp::*use-cleavir* t)

17:00:13 karlosz i doubt the llvm generated code is slower than clisp generated bytecode

17:01:02 karlosz make sure you have today's sicl

17:01:09 drmeister Pulled and latest sicl pulled

17:01:54 drmeister No asdf

17:02:32 drmeister I'm in clisp/src

17:02:36 drmeister ./clisp --> repl

17:03:01 drmeister I have to be careful not to confuse Clasp.

17:03:10 drmeister How do you recommend getting asdf up in clisp?

17:04:20 karlosz i put this in my .clisprc.lisp

17:04:32 karlosz https://paste.gnome.org/pp3fodlas

17:04:38 karlosz you can get rid of the quicklisp thing

17:04:55 karlosz i.e. point it to a place where you have a relatively recent asdf

17:05:03 karlosz and uiop

17:07:50 drmeister I have a recent asdf - but it has a single asdf/build/asdf.lisp and uiop is included in that - isn't it?

17:08:29 drmeister I commented out the uiop.lisp load and use /Users/meister/Development/cando/src/lisp/modules/asdf/build/asdf.lis

17:08:32 drmeister p

17:10:34 drmeister https://www.irccloud.com/pastebin/J1hiiUQw/

17:10:49 drmeister Component CLEAVIR-GENERATE-AST not found

17:11:07 drmeister I put the latest sicl in ~/quicklisp/local-projects/SICL/

17:11:37 drmeister Sorry for the basic questions - I know Clasp - other lisps with asfd/quicklisp - I'm a babe in the woods.

17:12:40 drmeister Adding back the quicklisp-init thing from your .clisprc.lisp

17:14:31 drmeister (sigh) DEFPACKAGE QL-CLISP: There is no symbol EXT::DELETE-DIR .

17:16:22 karlosz er, thats a very recent bug fixed in quicklisp

17:16:31 karlosz let me see what local patch i made

17:17:24 karlosz do this: https://github.com/quicklisp/quicklisp-client/commit/957e342a455fee4f8e708c66b8cf25825b19e280

17:17:27 karlosz that should be it

17:17:37 karlosz it also took me a while to get asdf and quicklisp working with clisp, no worries

17:17:56 karlosz im not sure about the uiop, but sounds like it should work

17:19:07 drmeister Done - thank you.

17:20:44 drmeister What other systems do I need? acclimation

17:23:09 drmeister concrete-syntax-tree

17:24:03 drmeister eclector

17:24:33 drmeister The usual suspects

17:25:57 drmeister karlosz: LOAD: A file with name /Users/meister/Development/clisp/src/cleavir/ssa-constant-prop.lisp does not exist

17:31:21 karlosz er, can you delete that file from load?

17:31:32 karlosz i didnt mean to check that in just now

17:32:09 karlosz that WIP

17:43:34 drmeister Ok, I did it twice - how do I know that the second time it's cleavir generated bytecode that is building everything?

17:43:48 karlosz can you disassemble a cleavir function and paste the output?

17:44:36 karlosz its easy to tell what code clisp produces and which cleavir does

17:45:13 karlosz also, for me at least, once cleavir starts compiling itself i just get a bunch of expected at most nil warnings

17:45:20 karlosz which dont actually impact anything

17:45:53 drmeister https://www.irccloud.com/pastebin/I3UUUhyn/

17:46:40 drmeister I don't see any "expected at most nil" anything in the output.

17:46:46 drmeister That's what I'm looking for?

17:46:49 karlosz thats clisp compiled cleavir

17:46:57 karlosz what is the value of cleavir-clisp::*use-cleavir*?

17:47:07 karlosz did you bind sys::*load-compiling* to t?

17:47:29 drmeister cleavir-clisp::*use-cleavir* has no value

17:47:42 karlosz you have to set that to t

17:47:46 drmeister https://www.irccloud.com/pastebin/gdnt7Jfb/

17:47:49 karlosz before compiling twice

17:48:05 drmeister That's what I did - and then I repeated step 3.

17:48:13 drmeister Wrong package for *use-cleavir*?

17:48:14 karlosz you're in the wrong package

17:48:24 karlosz try (setf cleavir-clisp::*use-cleavir* t)

17:50:07 drmeister At the start there is no package with that name.

17:50:33 drmeister Did I miss something in the instructions?

17:50:52 drmeister I start up clisp and load the script I just pasted.

17:51:32 drmeister I should do:

17:51:37 drmeister https://www.irccloud.com/pastebin/8fxVnYLX/

17:52:02 karlosz https://paste.gnome.org/pqghmtsbm

17:52:33 karlosz load.lisp should give you a cleavir-clisp package

17:52:39 drmeister That's loading compiling it three times - right?

17:53:09 karlosz the first time is to load cleavir into clisp

17:53:16 karlosz the second time is to compile the sources with cleavir

17:53:25 karlosz the thrid time is to use the cleavir compiled cleavir to compile itself

17:53:31 drmeister Got it...

17:53:32 drmeister https://www.irccloud.com/pastebin/ozS71n3H/

17:53:42 karlosz yes

17:54:20 karlosz this is what i get for the disassemble command you used above

17:54:25 karlosz https://paste.gnome.org/po63fgcfe

17:54:45 drmeister If I then want to compile-file some test file using cleavir - is there anything I need to tell compile-file to use cleavir?

17:55:36 drmeister https://www.irccloud.com/pastebin/7jydAFAb/

17:55:45 karlosz as long as cleavir-clisp::*use-cleavir* is bound to t it should work

17:56:42 karlosz oh yea, can you add #-cleavir in check special form syntax, above function?

17:56:49 karlosz the probl,em is clisp allows two arguments to function

17:56:56 karlosz above (eql 'function)

17:57:10 karlosz and cleavir doesnt have an extra system parameter to check special form syntax

17:57:26 drmeister Where? You mean hack sicl?

17:57:31 karlosz sorry im making you go down a goose chase. this is th elast one. i proimise

17:57:33 karlosz yeah

17:57:40 karlosz sicl doesnt have a system parameter for function

17:57:46 karlosz for cehck special form syntax

17:58:01 karlosz so i need that for genereate ast, which doesnt accept (FUNCTION ... ...) with two arguments

17:59:22 karlosz in check-special-syntax.lisp in generate ast, #-cleavir goes above the method that specializes on (eql 'function)

17:59:33 drmeister So #-cleavir where?

17:59:38 karlosz ^^^

17:59:45 drmeister https://github.com/robert-strandh/SICL/blob/master/Code/Cleavir/Generate-AST/environment-query.lisp#L3

17:59:53 karlosz no not there

18:00:05 karlosz https://github.com/robert-strandh/SICL/blob/master/Code/Cleavir/Generate-AST/check-special-form-syntax.lisp

18:00:18 karlosz line 66

18:00:43 drmeister Here? https://github.com/robert-strandh/SICL/blob/master/Code/Cleavir/Generate-AST/check-special-form-syntax.lisp#L66

18:00:53 karlosz yes

18:00:56 karlosz #-cleavir above that form

18:01:07 karlosz to prevent it from rejecting two arguments to the function special form

18:01:12 drmeister feature out the whole thing: #-cleavir (defmethod check-special-form-syntax ...)

18:01:45 karlosz yes

18:03:48 karlosz beach: check-special-form-syntax in generate-ast doesnt accept a system parameter like convert-special does. this ends up being a problem for implementations that let function take two arguments, for example. is it worth changing now?

18:05:01 drmeister Ok, lots of warnings now: WARNING: RAW was called with too many arguments:

18:05:05 karlosz good

18:05:09 karlosz it should be working then

18:16:40 drmeister There is a lot of output being generated - it's slowing things down. Is this expected?

18:16:41 drmeister https://www.irccloud.com/pastebin/HSqQklXC/

18:17:06 karlosz yeah

18:17:23 karlosz but does it take longer than 2 minutes to compile?

18:17:55 drmeister Yeah - the volume of output is slowing it down.

18:18:42 karlosz weird. i get that much output and it can still do it in about a minute and a half for me

18:18:58 karlosz im not quite sure whats causng it. if it really were miscompiling, it would have segfaulted by now

18:19:55 karlosz is your terminal slow or something? i do this in slime in emacs and it can cope fine

18:20:23 drmeister In emacs it was intolerable. I switched to macOS terminal and started it again.

18:20:37 drmeister It just finished

18:21:35 karlosz the second load or the third?

18:21:58 drmeister This:

18:21:59 drmeister https://www.irccloud.com/pastebin/IvgTuxIk/

18:22:13 karlosz oh so all of it

18:22:23 karlosz then that should be fully self hsotyed cleavir running on cleavir

18:22:27 karlosz and it didnt segfault?

18:22:49 karlosz if you compile file things now it should be with clisp cleavir

18:23:08 karlosz i mean it should be with cleavir on cleavir

18:26:19 drmeister As a sanity check I'm compiling the following:

18:26:22 drmeister https://www.irccloud.com/pastebin/zYQe23VA/

18:26:32 drmeister With (load (compile-file "fibn.lisp"))

18:26:50 drmeister Then (time (fibn 1000000 78))

18:27:08 drmeister Clasp 6.079 secs 16 bytes consed

18:27:40 drmeister cleavir+clisp: 8.57 secs 888000000 bytes

18:28:14 drmeister Hang on - different results!

18:28:16 drmeister Whaa?

18:29:17 karlosz uhh

18:29:25 karlosz whos giving the wrong answewr

18:30:03 karlosz looks like i am

18:30:06 karlosz f**k

18:30:13 drmeister clisp

18:30:25 karlosz okay back to the drawing board then...

18:30:59 drmeister I've been there before.

18:31:34 karlosz the self compile gave me a false sense of security

18:31:37 karlosz its never over

18:31:58 drmeister Well - it gets better.

18:32:31 karlosz even so, im surprised there wasnt much of a difference in timings

18:34:19 drmeister It looks like we've suffered some regression in the speed of our code.

18:34:48 drmeister Back in 2015 I posted this: https://drmeister.wordpress.com/2015/07/30/timing-data-comparing-cclasp-to-c-sbcl-and-python/

18:35:54 karlosz i see. a 3x slowdown?

18:36:04 drmeister 10,000,000 calculations of the 78th Fibonacci number. --> 2.9 seconds with Clasp on an older laptop.

18:36:30 drmeister Now It's 61.9 seconds.

18:37:05 drmeister 21x

18:37:13 karlosz wow

18:37:28 drmeister We've been messing around with things - we will have to take a look at it though.

18:38:01 drmeister Looking at the code...

18:39:18 drmeister Debugging is forced on - switching off.

18:41:05 drmeister Hmmph - I can't switch it off now without recompiling everything.

18:46:46 drmeister Crap - it's stuck on and I can't seem to switch it off.

18:49:07 drmeister I'm rebuilding Clasp now with debug frames switched off.

18:59:34 drmeister I'll be very surprised if there has been a significant drop in the efficiency of the code since last year. The major changes that do slow things down are being corrected using HIR inlining by Bike and aren't even active at the moment.

19:00:37 drmeister karlosz: After running that script I pasted ...

19:00:39 drmeister https://www.irccloud.com/pastebin/x2ZTq4ZQ/

19:01:08 karlosz what happened?

19:01:23 drmeister Nothing - I'm just making absolutely sure I understand.

19:02:08 drmeister After running that script when I use compile-file - it's cleavir compiled code running cleavir. I want to compare timing between cleavir+clasp and cleavir+clisp

19:02:34 karlosz yes

19:02:35 drmeister It's unfortunate (and my fault) for naming clasp so close to clisp.

19:02:36 karlosz hence the wrong answer

19:02:57 karlosz well its clisp fault for naming their implementation C(ommon)Lisp

19:02:59 drmeister Right - for now - until you figure it out.

19:03:03 karlosz yeah, i have

19:03:13 drmeister Oh - you did - good for you!

19:03:22 drmeister How easy is it to fix?

19:03:38 karlosz very easy, two locations were shared that shouldnt have been.

19:03:41 karlosz well ill fix it first then tell

19:05:12 drmeister Here's how I think things will shake out. For this microbenchmark clasp's generated code is at least 25x faster than cleavir+clisp's that's native code vs bytecode.

19:06:00 drmeister For compilation however, clasp is much, much slower than cleavir+clisp. Right now I have no idea why.

19:06:23 karlosz right, thats to be expected

19:06:24 drmeister But, I can compare them side-by-side and time things.

19:06:33 karlosz right

19:06:33 drmeister What is to be expected?

19:06:40 karlosz the fact that that micro benchmark is slow

19:06:45 karlosz clisp just ignores type declaration

19:06:47 karlosz ss

19:07:04 karlosz you want to compare them side by side and see where the bottleneck is?

19:07:26 drmeister Right - but why the compilers are so different in speed is a mystery to me. Clasp is running with native code everywhere.

19:09:04 karlosz well, you are probably doing much more work when compiling

19:09:06 karlosz like the llvm stuff

19:09:19 karlosz the cleavir stuff is very different from numeric code

19:09:31 karlosz too

19:09:45 karlosz so the generated code probably isnt that different in speed when comparing lots of generic fucntion dispatch style stuff

19:10:30 drmeister llvm is part of it - maybe 50-90% of it. I need better timing.

19:10:48 drmeister Your code is doing lots of generic function dispatch as well - exactly as much as Clasp's

19:11:50 karlosz right, but i mean i dont think native code vs bytecode matters that much when it comes to those types of things

19:11:56 karlosz plus clisps generic dispatch may be faster

19:12:27 drmeister It's not - I've timed it. We use beach's fast generic function method.

19:14:35 karlosz i see.

19:16:19 drmeister Oh - nope - wrong common lisp - I compared ours to sbcl, ecl and ccl - not clisp.

19:16:25 drmeister https://usercontent.irccloud-cdn.com/file/Bk9ZcjL2/image.png

19:19:20 karlosz okay, i get the right answer for your fib test now

19:19:28 karlosz testing to see if i can still self compile now...

19:31:19 karlosz i might as well see if i can squash the warnings while im at it, but at least the numeric code is correct now

19:47:08 drmeister Can I pull the latest? Squashing the warnings would help me time things.

19:52:02 karlosz the latest fixes your fib function but im in the middle of squashing the warnings right now

19:52:06 karlosz i can let you know when its ready

19:52:14 drmeister I'll wait

19:52:28 drmeister No hurry. Work at your leisure.

20:04:09 karlosz drmeister: if you pull the latest and write a #-cleavir here: https://github.com/robert-strandh/SICL/blob/master/Code/Cleavir/Generate-AST/convert-form.lisp#L205

20:04:18 karlosz that will work around the warnings

20:04:25 karlosz a proper fix will take a bit longer

20:04:37 karlosz im going for a swim for now, hope that helps

20:05:00 drmeister Thank you very much. You swim? Good for you - so do I.

20:05:14 karlosz cool :)

20:29:22 kpoeck Will run cl-bench on todays clasp and compare to previous timings

20:31:02 drmeister kpoeck: Thanks - there is a switch in wscript called DEBUG_CCLASP_LISP - if you comment that out it won't force generation of debug frames for every function.

20:31:41 drmeister A debug frame means a call on entry to the function and another on exit to push and pop a shadow stack frame.

20:31:58 drmeister clisp+cleavir is much better without all those warnings.

20:32:04 kpoeck So i need distclean after that change

20:32:08 kpoeck ?

20:32:13 drmeister Yes.

20:33:08 kpoeck Will do (well 1 run with current settings and 1 after recompiling)

20:33:59 drmeister That would be great. It should really just hurt function calls in tight loops.

20:34:17 kpoeck But the best speed test is probably recompiling clasp (last step)

20:36:31 drmeister That's the most complicated thing we do - yes.

20:36:43 drmeister I turned off the debug frames and ran the fibn demo again.

20:37:14 drmeister Clasp 10.8secs for 10,000,000 evaluations of the 78th Fibonnaci number

20:37:28 drmeister cleavir+clisp: 74 secs

20:37:50 drmeister So - better - but still slow by a factor of about 4 relative to a year ago.

20:46:18 drmeister Bike: Inlining of math - is that working?

20:46:28 drmeister Here's the llvm-ir for the 'fibn' function

20:46:29 Bike in dev, sure

20:46:36 drmeister https://usercontent.irccloud-cdn.com/file/BUyEX5K4/cfg.FIBN%5ECOMMON-LISP-USER%5EFN%5E%5E.dot.pdf

20:47:31 drmeister Here's the original code...

20:47:38 Bike what's with the tail calls

20:47:57 drmeister https://www.irccloud.com/pastebin/k4bmjs3t/

20:48:04 drmeister Yeah - who ordered those?

20:48:12 Bike of cc_safe_fdefinition

20:48:16 Bike i mean... that's not a tail call

20:49:13 drmeister I'm not sure

20:49:21 Bike there's definitely some crap happening here

20:49:25 drmeister It works - should it work if it's tail calls?

20:49:26 Bike extractvalues and geps and stuff

20:55:38 drmeister It's calling primop:inlined-two-arg--

20:56:45 Bike maaan, what.

20:57:18 drmeister Yeah - all three inlined-two-arg--, inlined-two-arg-+, and inlined-two-arg-<

20:57:22 drmeister It's calling them all.

20:57:32 drmeister Inlining is broke.

20:57:38 Bike fuuuuuck

20:57:45 Bike i already had to deal with _one_ broken inlining mechanism :(

20:58:28 drmeister Well, this is AST level inlining - correct?

20:58:37 Bike yes

20:58:42 drmeister Can I run any tests?

20:59:04 Bike you just did and clasp failed

20:59:58 drmeister This one should be easy though - it worked before.

21:00:11 drmeister It could be the messing around we did with copying ASTs

21:00:29 Bike i don't think that would do it, and i got rid of it anyway

21:01:09 Bike can you (describe (cleavir-env:function-info nil 'primop:inlined-two-arg-+))?

21:02:28 drmeister No AST

21:02:29 drmeister https://www.irccloud.com/pastebin/DPNA03gG/

21:03:19 drmeister Or my messing around with functor.h broke it

21:03:30 Bike ...functor? what?

21:03:49 Bike https://github.com/clasp-developers/clasp/blob/dev/src/lisp/kernel/lsp/evalmacros.lsp#L131-L132 well that's still there

21:03:49 drmeister The c++ header file that describes function objects

21:03:56 Bike yes, but how would that affect this

21:04:12 Bike can you try like, (declaim (inline foo)) (defun foo ()) and see if foo has an ast

21:04:26 Bike probably in a compile-file

21:04:52 Bike well, i can probably figure it out tomorrow

21:04:58 drmeister How do I get the AST?

21:05:10 Bike just look at the function info again

21:05:16 drmeister I see it.

21:05:32 Bike oh, it does have one? maybe defun inline hook is just messed up during build or something.

21:05:46 drmeister Yeah - that does have an AST

21:06:06 drmeister https://www.irccloud.com/pastebin/hiWfpMvy/

21:06:35 drmeister This would explain why sometimes incremental build breaks with errors about inlined-two-arg-+ being missing.

21:06:41 Bike yes it would.

21:06:52 Bike probably something dumb about building then.

21:07:00 drmeister I didn't realize that inlined-two-arg-+ was even tied to a function - I thought it was just a symbol.

21:07:24 drmeister Huh - it has an fdefinition

21:07:34 Bike yeah, it's just defined with defun

21:07:38 drmeister (fdefinition 'primop:inlined-two-arg-+) --> #<FUNCTION PRIMOP:INLINED-TWO-ARG-+>

21:07:50 Bike i guess we could have it have no runtime definition and thereby get errors earlier

21:08:09 Bike https://github.com/clasp-developers/clasp/blob/dev/src/lisp/kernel/clasp-builder.lsp#L573-L576 there's always this crap

21:08:35 Bike maybe i should fix up the sandbox build again just because i hate this

21:09:49 drmeister https://github.com/clasp-developers/clasp/blob/dev/src/lisp/kernel/cleavir/inline-prep.lisp#L130

21:13:32 drmeister I see - inlined-two-arg-+ HAS to have a definition - at least once otherwise there would be no AST.

21:14:00 Bike we could set up the ast without defining the actual function if we wanted.

21:14:30 Bike it's kind of haram, but we're the implementation anyway

21:14:32 drmeister Or set the fdefinition of inlined-two-arg-+ to a function that generates an error?

21:14:43 Bike or that

21:15:25 drmeister I don't see a reason why it's broken at the moment. It's clearly being defined and so the AST should be generated - wth?

21:16:37 Bike i'm just as confused

21:17:03 drmeister On another topic - karlosz says that he has cleavir compiling cleavir and I have a copy of it.

21:17:22 drmeister We can compare cleavir+clisp to cleavir+clasp timing

21:17:56 drmeister In this microbenchmark - even without inlining clasp generated code is 7x faster.

21:18:40 karlosz how long does running self compile take without the warnings?

21:19:40 drmeister There are a lot of warnings - he squelched most of them - but I'm running in a macOS terminal so that it's hopefully not such a big deal.

21:20:02 drmeister I'm timing right now

21:20:30 drmeister Oh - hi - I thought Bike asked that - that's why I said "he squelched".

21:27:54 drmeister karlosz: To compile the code three times takes 405 seconds

21:28:16 drmeister That's to run this:

21:28:16 drmeister https://www.irccloud.com/pastebin/x2ZTq4ZQ/

21:28:48 karlosz seems about right

21:29:18 drmeister After that running: (time (let ((sys::*load-compiling* t)) (load "cleavir/load.lisp")))

21:29:35 drmeister What does binding sys::*load-compiling* to T do?

21:29:46 karlosz it just makes load use the compiler instead of the interpreter

21:30:10 karlosz and *use-cleavir* mredirects compile to cleavir's compile

21:30:45 karlosz otherwise it would just be loading those files with the clisp interpreter again and again

21:38:49 drmeister 81 seconds

21:39:06 drmeister How much Cleavir is it compiling here?

21:39:39 drmeister It's these...

21:39:40 drmeister https://www.irccloud.com/pastebin/FP4WwXQp/

21:39:55 drmeister And dependencies

21:40:58 karlosz tes

21:41:00 karlosz yes

21:41:30 karlosz not sure how much is there but that includes stuff like alexandria, eclector, etc...

21:45:24 drmeister I'm starting over and I'll compare clasp to clisp

22:25:11 drmeister In clasp (asdf:load-system :alexandria :force t) returns immediately after the first time I run it.

22:26:05 karlosz clear your fasls and try?

22:26:14 drmeister Good point

22:26:30 drmeister cleavir in clisp compilex alexandria in 15.2 sec

22:26:59 drmeister Argh

22:27:06 drmeister cleavir in clisp compiles alexandria in 15.2 sec

22:27:11 drmeister cleavir in clasp ...

22:28:13 drmeister 84.4 secs

22:28:34 karlosz thats not too bad

22:28:46 drmeister No - not as bad as I expected.

22:29:00 karlosz thats about the time i had for clisp+cleavir before i switched my basic block representation

22:29:07 karlosz maybe there is still some low hanging fruit for you all

22:30:33 drmeister Clasp conses about 2.5x more memory

22:31:19 drmeister https://www.irccloud.com/pastebin/oc8ZknVI/

22:31:22 drmeister That is Clasp

22:31:37 drmeister And below is the TIME output of clisp

22:31:38 drmeister https://www.irccloud.com/pastebin/iU64ZUqG/

22:32:41 Bike the more memory is interesting.

22:32:58 drmeister If the "LLVM time" and "clang link time" is accurate (and I'm not sure they are) then 37.3 seconds is spent in llvm/clang

22:33:48 drmeister So about 47 seconds in Clasp

22:35:10 drmeister So Clasp running cleavir is about 3x slower than clisp running cleavir and it uses about 2.5x the memory.

22:36:45 drmeister I need to spend some time looking at how llvm time and clang link time are calculated.

22:39:24 karlosz it could just be due to clasp's cleavir doing different optimizations than clisp

22:42:16 drmeister Do you do the same HIR optimizations that we do?

22:42:26 drmeister Do you have the same HIR stages we do?

22:42:36 karlosz not at all

22:42:50 Bike i suspect that's not the problem though.

22:42:51 karlosz look at tools.lisp clisp-hir-transformations

22:42:57 Bike (though the basic block thing might be interesting)

22:43:02 drmeister https://www.irccloud.com/pastebin/NRppgCy7/

22:43:44 karlosz so i dont do any kildall type inference or eliminate-load-time-value-inputs

22:43:59 karlosz besides that it doesnt look that different

22:44:05 karlosz i just do ssa conversion and some stuff with that

22:44:07 Bike we don't do kildall most of the time.

22:45:47 drmeister Would it be interesting to add a facility to TRACE that reports time and memory allocation?

22:46:53 Bike you mean in the course of the call? that's kind of weird, but i guess it could be interesting

22:47:10 Bike i think a general profiler would be better for it though

22:47:26 drmeister karlosz: How is profiling in clisp?

22:48:19 Bike maybe start with an ability for TIME to do detailed memory breakdowns and use it in trace if we want

22:51:12 drmeister What is the metrics stuff in Cleavir?

22:52:06 drmeister I can't find it now - I thought there was something like with-metrics

22:52:36 Bike with-meter.

22:52:43 drmeister Yeah - what's that for?

22:53:18 Bike it doesn't look at memory allocation or anything

22:54:06 drmeister Sounds like it could use an API for implementation dependent memory allocation tracking.