freenode/#clasp - IRC Chatlog

3:41:29 beach Good morning everyone!

4:35:07 drmeister Hi beach

4:35:21 drmeister Bike: Are you still online?

4:35:26 Bike for a bit.

4:35:43 drmeister Where were things when you finished on Friday with inlining?

4:36:02 Bike that one form in asdf was failing.

4:36:02 drmeister You commented out inline.lisp and you could build cclasp - but asdf had a problem.

4:36:12 drmeister Did you nail down which form it was?

4:36:22 Bike Not yet.

4:36:30 Bike why, did you hit something?

4:36:46 drmeister No - I wanted to explain to Martin where we were with that stuff.

4:36:57 drmeister He's arriving tomorrow.

4:37:46 Bike i don't really have a concept for what the problem could be. it's pretty rare to be able to build the whole of clasp and then hit a problem just with asdf

4:40:31 drmeister It's happened to me several times. I have often built clasp and then failed when building or running one of those. Not for the reason you are seeing.

4:40:51 drmeister But they stress the compiler in different ways than clasp source does.

4:41:04 Bike probably mostly because asdf does everything in eval whens

4:42:00 drmeister Well, we won't know until you narrow it down to the bad form.

4:42:18 Bike mhm...

4:42:49 drmeister I'm worried that it is the inlining criterion. This inlines local functions - could it be a recursive labels function?

4:43:05 Bike i already hit and fixed a problem where recursive functions would kill it.

4:44:04 drmeister Ok - just speculating.

4:44:32 Bike maybe a mutually recursive function would hit it... who knows

4:45:11 drmeister That sounds tricky to detect.

4:49:47 Bike it's probably not bad, i'm just not sure if the way i wrote it would detect it

8:13:35 heisig Good morning!

8:22:24 beach Hello heisig.

8:22:41 beach Are you still in the EU?

8:29:30 heisig beach: Yes, I am still in Germany. I will leave tomorrow morning.

8:29:50 beach Have a good trip!

8:32:37 heisig Thank you! It is a direct flight, so everything should work smoothly.

8:32:46 beach Oh, that's nice.

11:07:19 drmeister Hello everyone

11:15:58 kpoeck Hello

11:16:33 heisig Hello drmeister, hello kpoeck.

11:17:30 kpoeck Mahlzeit :-)

11:18:29 kpoeck Drmeister: I would like to fix interpret_token_or_throw_reader_error to allow all possible float-types in _STARreadDefaultFloatFormatSTAR

11:18:53 drmeister Hello gentlemen

11:19:08 drmeister kpoeck: Be my guest!

11:19:13 kpoeck Is there any real difference in precision between DoubleFloat_O and LongFloat_O

11:19:46 kpoeck since typedef double LongFloat;?

11:21:43 kpoeck And the same question for ShortFloat and SingleFloat?

11:23:12 kpoeck Am now compiling under the assumption that I can read both a DoubleFloat and a Long with strtod

11:23:52 kpoeck And read ShortFloat with strtof

11:24:18 drmeister No DoubleFloat and LongFloat are the same. I've toyed with the idea of using the GMP infinite precision float for long-float.

11:24:55 kpoeck Still puzzled with SingleFloat is read with strod, since clasp_make_single_float is defined to accept a float and is passed a double

11:25:32 kpoeck ok, than regarding LongFloat the fix is obvious

11:25:46 drmeister ShortFloat and SingleFloat are also confused - SingleFloat is a tagged C 'float'. I think (but I may have forgotten) ShortFloat is still managed by the GC.

11:26:51 kpoeck ShortFloat is allocated by GC_ALLOCATE(ShortFloat_O, sf);

11:27:05 kpoeck Probably what you said

11:28:08 kpoeck and SingleFloat allocated by gc::make_tagged_single_float<core::SingleFloat_I>(x);

11:29:45 drmeister Yes. SingleFloat values are not stored on the heap at all - they are coded into the tagged pointer itself. Like fixnum or character.

11:30:15 drmeister They are shifted a few bits, have tag written into the lower three bits and that's how they are represented.

11:30:51 drmeister So it leaves the question of what to do with ShortFloat_O?

11:31:38 drmeister They could be aliased to SingleFloat's... Are there any other ideas of what to do with them?

11:32:18 drmeister What about LongFloat - I'd like to implement them with GMP arbitrary precision floats.

11:33:02 kpoeck Looking at ansi, it is totally fine to just have 2 representations of floats

11:33:15 drmeister Kind of how Bignum's are implemented with GMP mpz.

11:33:55 kpoeck In ccl, for number crunching I used short-floats instead of single-float, since they would not cons

11:33:58 drmeister kpoeck: Right - I did that with that in mind - I didn't tidy up ShortFloat and LongFloat because I didn't know what to do with them at the time. So I just left them.

11:36:21 drmeister Cleavir has a way of doing a lot of math without boxing/unboxing operations that I am looking forward to using.

11:38:41 kpoeck Seems to be the way to go than

11:39:32 kpoeck So I just extend interpret_token_or_throw_reader_error for short-float and long-float (testing right now)

11:40:31 kpoeck On a different matter, could you perhaps test whether issue #531 also crashes your clasp?

11:41:03 kpoeck I did rerun the ansi-tests after the big merge and only have this regression

11:42:43 heisig drmeister: Currently, with the whole machine learning hype, 16bit half-floats have become fashionable again. So that could be a use case for ShortFloat_O.

11:45:12 drmeister Good point - that sounds like a good use for short-float.

11:46:31 drmeister Heronimus Bosch - what a weirdo. https://tuinderlusten-jheronimusbosch.ntr.nl/en

11:47:12 drmeister The detail in that painting in amazing though.

11:50:06 kpoeck Off to public viewing Germany Mexiko in the world cup

11:50:58 kpoeck Meanwhile running ansi-tests with the extended interpret_token_or_throw_reader_error

11:51:31 drmeister I confirmed that issue 531

11:51:36 drmeister Enjoy the game.

12:06:43 kpoeck Will follow the channel via the phone

12:08:06 kpoeck Glad that i didn’t hallucinate with issue 531

12:14:56 drmeister heisig: What do you think about arbitrary precision float's - would it be a good idea to expose them as long-float?

12:19:35 Bike what good is a 16 bit float...?

12:23:09 drmeister Machine learning apparently

12:35:12 heisig Bike: This is wikipedia on 16bit floats: https://en.wikipedia.org/wiki/Bfloat16_floating-point_format

12:35:53 Bike i know what they are, i just don't understand what they're for

12:36:17 Shinmera "The bfloat16 format is utilized in upcoming Intel AI processors, such as Nervana NNP-L1000, Xeon processors, and Intel FPGAs,[1][2][3] Google Cloud TPUs,[4][5][6] and Tensorflow.[6][7]"

12:36:58 heisig drmeister: Arbitrary precision long-floats sound like a great idea. I know applications where this would be extremely useful.

12:37:00 drmeister Now you are just teasing

12:38:33 drmeister Ok then. short-float will be 16bit floats - I'll need to tweak the tagging scheme. long-float's will be arbitrary precision floats

12:38:50 Bike i don't think we need to do that right now, though

12:38:56 Bike let's put it on the bug tracker as an enhancement

12:39:05 drmeister Absolutely.

12:39:26 Bike also just knowing processors use it doesn't tell me what it's for

12:39:43 stassats` less silicon, more units => faster

12:39:47 stassats` that's about it

12:40:07 stassats` doubt it's any useful for clasp, other than some interoperability

12:42:39 kpoeck Single-floats do not seem to cons, it that correct?

12:43:21 drmeister That is correct - single-floats do not cons - they are stored in tagged pointers.

12:43:42 stassats` well, they are not pointers

12:44:04 stassats` tagged immediates

12:44:05 drmeister Right - tagged values - tagged whatevers

12:44:10 kpoeck Thats good enough for me

12:44:12 drmeister tagged immediates

12:45:00 kpoeck Will read about 16 bits floats

12:45:27 drmeister We have one tagged bit pattern left unused that we could dedicate to them.

12:45:31 drmeister IIRC

12:46:32 drmeister I was going to use them as pointers into aligned C++ memory - but that never panned out. Actually, I'll wait until Martin comes on board - he may have some use for another kind of tagged immediate.

12:47:15 stassats` i think the only thing clasp will benefit from with 16 bit floats is reduced storage

12:47:42 stassats` scalar operations would not benefit, aside maybe something like division

12:48:09 stassats` otherwise you'll need clasp to run on these TPUs or support SIMD or wherever these floats are used

12:48:40 drmeister Well, there is interoperability with C++ machine learning libraries if we need to pass 16bit floats back and forth. Interoperability - as you mentioned above.

12:49:06 stassats` doubt that requires full blown tagging support

12:49:15 drmeister Noted.

12:57:59 drmeister What's up with a language-server-protocol server for Common Lisp? Does anyone use one?

13:00:02 drmeister jupyterlab appears to be going in that direction - to use language server protocol.

13:00:44 drmeister ACTION is rummaging around inside of jupyterlab discussion groups and github issue trackers

13:03:36 Bike how would a widget work, then...?

13:03:58 drmeister I think this is for input cells and editors.

13:04:25 Bike oh, well, yeah.

13:04:32 Bike i'tll tell jupyter how to do indentation and stuff, right.

13:04:40 drmeister jupyterlab lets you bring up editors - they have emacs and vi key bindings

13:04:52 drmeister Right - the indentation is what drives me a little crazy.

13:04:59 drmeister Lack thereof

13:05:10 attila_lendvai hi! I could use some insights/pointers... so, iclasp is a CL interpreter written in C++. aclasp is iclasp loading a bunch of files implementing a minimal CL system. what I don't understand is that compile-aclasp seems to be actually compiling stuff. how is that happening? what kind of compilation can iclasp do?

13:05:40 stassats` it can load interpret the compiler?

13:05:45 stassats` load and interpret

13:06:47 Bike iclasp is the C++ interpreter. then it loads a bunch of files including a compiler, and compiles itself

13:09:31 drmeister attila_lendvai: If you start from ./waf distclean configure build_iboehm <-- it builds the C++ interpreter iclasp-boehm

13:10:07 stassats` so, you're interpreting C++ now?

13:10:31 drmeister Then you can use ./waf build_rboehm <-- This loads the C++ interpreter and has it load the aclasp Common Lisp source code as interpreted code - absolutely no Common Lisp compilation happens up to this point.

13:10:42 Bike stassats`: the interpreter written in C++

13:11:09 drmeister stassats`: You got me: ./waf distclean configure build_iboehm <-- it builds the C++ interpreter of a subset of Common Lisp programming language: iclasp-boehm

13:12:22 drmeister So: ./waf build_rboehm puts you in a REPL that loaded the aclasp/bclasp Common Lisp compiler as interpreted code. At this point you can compile-file a subset of Common Lisp code.

13:12:25 stassats` (that wasn't a serious question)

13:12:57 drmeister stassats`: I know - and your point was fine - what I said was ambiguous.

13:14:10 drmeister ./waf build_aboehm loads the same Common Lisp source files as interpreted functions and then runs 'compile-file' on all of the Common Lisp source files that ./waf build_rboehm and ./waf build_aboehm load.

13:15:09 Bike someone else asked it seriously before

13:16:07 drmeister ./waf build_aboehm then links the compile-file'd code into a fasl called clasp/build/boehm/fasl/aclasp-boehm-image.fasl <--- Run with this and you get what you get when you run ./waf build_rboehm but everything is compiled. It provides a subset of Common Lisp and can further be used to compile a complete Common Lisp: 'bclasp'

13:17:52 drmeister The reason for all of this is that the interpreter written in C++ is an S-expression walking interpreter and it expands macros whenever they are encountered during evaluation. This is really, really slow when you start hitting a lot of macros as in loop and CLOS.

13:18:42 drmeister So we need some kind of compilation before we even load that code.

13:25:29 attila_lendvai ACTION processes the info and updates the wiki when done

13:29:15 attila_lendvai bclasp is complete-complete CL, or just mostly complete CL?

13:31:29 drmeister bclasp is a complete-complete CL.

13:33:45 drmeister There is another wrinkle regarding 'aclasp'

13:35:14 drmeister As ./waf build_aclasp compile-file's it's own code - it LOAD's the fasl files. This replaces interpreted functions with compiled ones as they are generated. This happens in both serial and parallel build. The code is set up to be tolerant to hot-swapping interpreted code with compiled versions in any order.

13:35:58 attila_lendvai and aclasp is compile-file'ing how? aclasp is already capable of calling the LLVM machinery at runtime?

13:36:28 drmeister Yes - everything stage of Clasp building has full access to all of the llvm machinery.

13:36:33 drmeister every stage

13:37:30 drmeister If you go iclasp-boehm -I -n (loads absolutely no CL code, either interpreted or compiled).

13:37:44 drmeister All of the llvm-sys functions (llvm machinery) are exposed.

13:38:43 drmeister You get a repl and a very restricted subset of the Common Lisp language - but everything that is exposed from C++ is exposed here.

13:41:50 drmeister Its better to build aclasp with fewer parallel processes than more - because more of the code can then take advantage of the compiled fasls that replaced the interpreted code. I think I currently limit aclasp compilation to 8 parallel processes. I haven't timed it carefully.

13:42:08 drmeister But 36 was too many.

13:42:23 beach This bootstrapping procedure should be written down somewhere.

13:42:35 attila_lendvai beach: I'm editing the wiki as we are talking

13:42:45 beach Excellent.

13:42:48 drmeister I think Bike did write it down.

13:42:54 beach OK.

13:42:54 attila_lendvai FTR, it'll be here: https://github.com/clasp-developers/clasp/wiki/The-Build-Process/

13:43:50 attila_lendvai there's docs/bootstrap.rst. maybe I'll extend that and point to it from the wiki. I'll think about it

13:58:35 attila_lendvai registerClasses.py and src/common/build is obsoleted by the CL scraper that is run by sbcl and wscript, right? I'll delete anything in the codebase and docs related to it. let me know if that's not desirable!

14:00:08 attila_lendvai or is the python groveler in a good enough state that it's worth keeping it in some tools-for-build/obsolete/ directory?

14:01:23 drmeister Working backwards. The python groveler can be gently lowered into a grave and buried. It is dead.

14:02:30 drmeister Where is registerClasses.py? I don't see it when I try to 'find' it

14:02:35 attila_lendvai ACTION keeps recording using git rm

14:03:07 attila_lendvai drmeister: in the docs, and some of its output can be found checked into the repo (by rgrep registerClasses .)

14:03:20 drmeister src/common/build can be removed

14:03:26 drmeister ls

14:04:20 drmeister Here - I'll remove some old crap from src/common/*

14:04:36 attila_lendvai drmeister: as I noted above I'm already doing it

14:05:19 attila_lendvai i.e. merge conflivt

14:05:28 drmeister Right - right

14:05:39 drmeister Please proceed

14:07:18 attila_lendvai drmeister: but I welcome your impulses for cleaning the codebase! it's very nice for the newcomers, thanks! :)

14:08:34 drmeister It's probably better to do it slowly and methodically. I don't instantly recognize what is being used and what is not.

14:08:56 drmeister build mentioned BJAM - that is definitely not used anymore - thank goodness.

14:10:42 attila_lendvai ACTION is also cleaning docs/

14:12:45 attila_lendvai drmeister: I have the following left in src/common. let me know if any of those should also get "gently lowered into a grave"... :) analyzeLldbBacktrace.py build-json.lisp buildJSON.py copywrite.lsp metering-allocations.lisp peakMemory.cc

14:14:40 attila_lendvai buildJSON.py also seems suspect

14:16:04 drmeister Yeah - I think that can be removed.

14:20:38 attila_lendvai drmeister: I assume analyzeLldbBacktrace.py can also be disappeared. I moved the rest that seemed useful to tools/ and I'll delete src/common/

14:23:57 drmeister Actually, can you move analyzeLldbBacktrace.py to tools? I have used that to analyze the size of stack frames to figure out if there are functions that allocate too much on the stack.

14:25:36 attila_lendvai ACTION does that

14:30:29 attila_lendvai I've pushed it. let me know if I happened to delete anything that you want restored: https://github.com/clasp-developers/clasp/commit/61e2b47c0299baaf89e24c08b1b4a7159a860c64

14:30:43 drmeister Thank you very much.

15:00:26 heisig Ok guys, see you tomorrow!

15:00:42 beach Have a nice trip.

15:00:49 heisig Thanks.

15:00:58 drmeister Ok.

15:01:10 drmeister If you have any trouble - call my cell.

15:02:37 karlosz beach: its not explicitly written in the docs, but we are assuming that for phi functions, the order of the predecessors have to match the order of the inputs, right?

15:03:03 karlosz some code you wrote seems to assume that, and otherwise we have to do the dominator calculations over and over again to find the corresponding input for each predecessor

15:03:33 karlosz but things like delete-instruction shuffle the order of the predecessors, like if you deleted the first phi of a cluster

15:03:37 karlosz see the PR i wrote

15:04:02 beach Yes, but that is also the reason why I strongly dislike SSA. It imposes such an order whereas the flow graph itself does not. It ruins everything.

15:04:48 beach And, you are right, I did not take predecessor order into account when I wrote the code to modify the graph.

15:05:42 beach OK, PR merged. Thanks.

15:06:17 karlosz maybe phi instructions can have a mapping between inputs and predecessors as an extra slot?

15:06:47 beach I especially dislike SSA because I am convinced (but I have yet to do the research) that basically none of the optimization techniques that claim to need SSA takes advantage of the fact that SSA is really S.S.A.

15:06:50 Bike you'd still have to update it when the graph is modified, rice?

15:07:21 beach karlosz: I don't think that would be a good idea.

15:07:26 beach Preserving the order is better.

15:07:38 beach Doing something other than SSA would be the best.

15:07:47 beach But I am not there yet.

15:08:38 karlosz beach: https://en.wikipedia.org/wiki/Sparse_conditional_constant_propagation this is the algorithm that i jjust implemented that requires SSA

15:09:04 karlosz it runs in linear time with SSA, and i think itd be very diffciult to write without assuming definitions are unique and dominate uses

15:09:32 beach karlosz: What aspect of SSA does it need?

15:09:34 karlosz the effects are folding something like https://paste.gnome.org/pp2hrvkia to just values 4 4 1 2 3 4

15:09:51 karlosz it needs unique definitions

15:09:56 karlosz that dominates uses

15:10:07 karlosz otherwise youd have to compute some sort of reaching definitions first

15:10:10 beach Oh, but other notations than SSA have that property.

15:10:21 beach So again, it does not use the fact that SSA i S.S.A.

15:10:29 beach It uses some other property of it.

15:10:31 karlosz which other form has that property?

15:10:46 beach For example the SFA that I have been thinking about.

15:11:08 karlosz do you have a link to it?

15:11:09 beach Like SSA, but put an assignment in each incoming arc of what use to be the Phi instruction.

15:11:15 beach Not sure it is written down.

15:11:37 beach All I am saying is that SSA ruins everything, and no algorithm I know take advantage of the fact that SSA is S.S.A.

15:12:02 beach Plus, it is exactly the S.S.A. aspect of SSA that ruins everything.

15:12:48 beach karlosz: But I have yet to do the research like I said. It would be a beautiful result though.

15:13:12 karlosz i do the transformation you are talking aboutl. to actually implement the phi functions, i convert the phi to assignmnet instruction along each arc before lowering to code

15:13:23 beach There you go.

15:13:34 beach So you do not take advantage of the fact that SSA is S.S.A.

15:13:40 karlosz its standard though, and still considered SSA in the leiterature, because those phi functions still have to be implemented somehow

15:13:44 karlosz but i get your point

15:13:46 karlosz the order thing is really nasty

15:13:52 beach Very nasty.

15:13:57 karlosz maybe that should be the standard form of SSA cleavir uses

15:14:08 karlosz since then you avoid the cluster madness too

15:14:18 beach karlosz: I justify my salary by questioning traditional wisdom in the literature.