freenode/#clasp - IRC Chatlog

19:20:49 drmeister You make a very good point.

19:22:55 drmeister Going the core dump route means the following (1) We probably need to get rid of the boehm GC support (2) We need help from Ravenbrook to implement it (3) We need to GC code.

19:28:20 drmeister (4) We wouldn't compile-file the system files. We would LOAD the source code into memory and the top level forms would be compiled/JITted as they load. The JITted modules would be in the MPS memory and so they would be treated like everything else.

19:29:30 drmeister (5) We would need to figure out what side-effects in C++ are taking place in our current startup and run them before we start the system back up again.

19:29:45 Bike we want 3 and 5 anyway, right?

19:30:17 drmeister I'm read for anything that will let us move forward.

19:31:35 drmeister This (1) I've got a great idea to speed things up (2) I'm implementing it (3) Oh shit - the way we startup prevents me from deploying it because of <insert complicated bootstrapping explanation here> is absolute bullshit.

19:32:04 Bike it is frustrating

19:32:44 drmeister Text does not express my frustration.

19:32:49 drmeister Nor do emoji's.

19:32:59 Bike you sure? there's some pretty good ones now

19:33:11 drmeister :sauropod:

19:33:12 Bike 🏤

19:33:19 Bike that's not for frustration, but it's pretty good

19:33:34 Shinmera :scream:

19:33:38 drmeister What is that? A building with a trumpet?

19:33:57 drmeister Yes - there is always :scream:

19:33:59 Shinmera http://shinmera.tymoon.eu/public/scream.wav

19:34:17 Bike it's a european style post office

19:34:24 Bike apparently they have post horns on them?

19:34:44 Shinmera ::activate emotes

19:34:44 Colleen Modules activated.

19:34:47 Shinmera :scream:

19:34:50 Shinmera Hrmn.

19:35:00 Bike Scream is powering up

19:35:10 Shinmera ::activate emoticon

19:35:10 Colleen Modules activated.

19:35:12 Shinmera :scream:

19:35:12 Colleen http://shinmera.tymoon.eu/public/scream.wav

19:35:14 Shinmera There we are

19:35:23 Shinmera :scream2:

19:35:23 Colleen http://shinmera.tymoon.eu/public/scream2.wav

19:35:36 drmeister Van Gogh was a genius

19:38:41 Bike ok, cut out a few bogus source infos, so trying to M-. to a function defined in the repl properly says it doesn't know the source

19:38:47 attile_lendvai drmeister: in bits.cc there's some templated version of the bitarray operations. are they really worth it performace-per-complexity wise? if i understand correctly it only saves a couple of memory dereference

19:39:06 attile_lendvai (plus it's dead code currently)

19:40:40 Bike i think we could probably lose SourceFileInfo, but it's pretty entrenched

19:41:03 drmeister attile_lendvai: I thought it was live code - checking...

19:41:54 drmeister #ifndef - right - it is dead code.

19:42:20 drmeister Saving dereferences in tight loops is important isn't it? That's why I implemented that.

19:43:17 drmeister If the code works I'd be inclined to say we switch to it and use it.

19:43:43 attile_lendvai drmeister: did any of those show up in a profiling session? because it's quites some extra complexity...

19:44:00 attile_lendvai drmeister: ok, i'll try to get it alive and hook into lisp

19:44:10 Bike right now the bit-and functions go through that boole-like indirection anyway

19:44:14 drmeister Nothing useful shows up in profiling sessions.

19:44:36 drmeister But no - they did not show up in a profiling session.

19:44:56 drmeister How about we move that code into a wiki page so I can remember how to implement the template version if I ever need it.

19:45:33 drmeister I figured out yesterday there is a big chunk of llvm time that I wasn't counting when I count llvm time.

19:46:10 drmeister I wrote this function (link-inline-remove-builtins original-module)

19:46:42 drmeister It turns out the inline pass takes about 80% of the time when I (compile nil '(lambda (x y) (+ x y))) repeatedly

19:47:11 drmeister I'm counting it now as LLVM time and it should appear in the TIME output

20:33:11 kpoeck regarding bit.cc

20:33:39 kpoeck I spent some time to make CL_DEFUN T_sp core__bit_array_op(int opval, Array_sp tx, Array_sp ty, T_sp tr) more correct

20:35:07 kpoeck please don't throw that away

20:36:29 drmeister kpoeck: We are thinking of throwing out the template version - you are talking about the non-template version - correct?

20:36:49 kpoeck yes

20:37:30 drmeister Ok - I think we fine then.

20:39:07 drmeister This is from cclasp - I'm better at counting llvm time vs our compiler time:

20:39:09 drmeister https://www.irccloud.com/pastebin/nTNBsuCI/

20:40:11 drmeister This is consistent with what I've been seeing for the past two days. llvm time is substantial - about 50%

20:40:37 drmeister Here is clisp + cleavir doing the same thing...

20:42:05 drmeister https://www.irccloud.com/pastebin/aA1bpifs/

20:42:45 drmeister Clasp is a bit faster

20:43:14 drmeister But they are comparable.

20:49:53 drmeister Bike: Are you still in front of your machine?

20:50:00 Bike i am.

20:50:11 drmeister Can you tell me how much memory my iclasp-boehm is using?

20:50:30 drmeister None of my attempts through ssh are working.

20:50:43 Bike let me figureouthow to view other users processes

20:51:08 drmeister Activity Monitor in Applications/Utilities

20:51:15 drmeister It shows other users processes.

20:51:21 Bike yeah.

20:51:23 Bike 15.31 GB

20:51:35 drmeister Yep.

20:51:39 Bike obviously the last two digits vary

20:51:49 drmeister It's gotten into compile-fileing cleavir

20:51:56 drmeister In the allinone branch

20:53:17 drmeister When I invoke TIME from within clasp-builder.lisp - I don't see any TIME output. It's supposed to send it to *trace-output*. I'm thinking that's the same as *standard-error* and maybe waf has rerouted that?

20:53:26 drmeister Ugh - soooo many systems in the way.

20:53:43 drmeister If anyone else has any other insight why I get no TIME output - I'd love to hear it.

20:54:14 Bike still growing.

20:54:39 Bike waf involves a lot of buffering output.

20:54:44 drmeister It will do that - and then it may double when it tries to link everything. For my laptop that was an extinction level event and I didn't see the other side of it.

20:55:06 drmeister It's not too pokey though - it's compiling single threaded just fine.

20:55:35 drmeister (defstruct (xxxx (:type vector) :named) ...) makes it pause for 10-20 seconds! But other than that it's chugging along.

20:55:55 drmeister It's funny that (defstruc yyy ...) doesn't pause at all.

20:56:58 Bike i wonder if i did something weirdin the expansion code...

20:58:50 Bike i don'tsee anything obvious

20:59:00 Bike vector struct is more complicated than a list struct, but not by much

21:27:49 Bike 21.41 GB now

21:39:48 drmeister How about now

21:39:52 drmeister Argh - you are gone.

21:42:34 drmeister I logged in with another shell 23G

21:42:46 drmeister 23GB and it's writing out a .ll file and a .o file.

21:42:59 drmeister I should probably switch back to bitcode .bc files rather than human readable .ll files.

22:11:52 drmeister The allinone build generated a .o file - so it works - but it's a lot less relevant now that we are looking towards dumping image files.

22:20:24 attile_lendvai ::notify kpoeck do you mean the changes that are already checked in the repo? if yes, then of course, that'd be a major offense... :) if you mean changes that are not in dev, then do warn me

22:20:24 Colleen attile_lendvai: Got it. I'll let kpoeck know as soon as possible.

1:56:22 drmeister I'm getting a better handle on clasp compile time vs llvm compile time.

1:56:24 drmeister https://www.irccloud.com/pastebin/jDvlvzeh/

1:56:36 drmeister Optimization is expensive. llvm is expensive.

1:57:32 drmeister Loading and compiling foundation.lsp with the JIT - with no llvm optimization is 1.039 seconds

1:57:51 drmeister When I turn on inlining and other optimization that jumps to 2.15 seconds

1:58:12 drmeister The time clasp spends on it is about 0.5 seconds.

1:59:39 drmeister The minimum amount of llvm time we can get away with is 0.58 seconds

2:00:40 drmeister So llvm is about 50% to 80% of the total time to compile JITted code.

2:04:20 drmeister I defined cmp:*optimization-level* - it can be one of the values 0,1,2,3

2:04:37 drmeister At 0 - there is no inlining and no other optimization - just native code generation.

2:04:44 drmeister At 2 it starts inlining.

2:10:28 drmeister How far are we from Cleavir doing code generation?

2:20:18 drmeister compile-file performance is similar

2:20:19 drmeister https://www.irccloud.com/pastebin/ARL1nvC2/

2:39:09 drmeister I compile-file 69 source files with optimization level 3 using code compiled at optimization level 3 -->

2:39:39 drmeister Note: This is bclasp's compiler

2:40:26 drmeister Hang on - I'm moving too fast.

2:41:32 drmeister bclasp loading/compiling 69 source files 247 seconds total 189 seconds llvm time.

2:41:52 drmeister 76% of the time in LLVM

2:54:11 drmeister compile-file of the files 402 seconds total 165 sec llvm 114 sec linking

2:54:24 drmeister 70% llvm time total

2:58:07 drmeister I found this...

2:58:07 drmeister https://llvm.org/devmtg/2014-04/PDFs/LightningTalks/fast-jit-code-generation.pdf

2:58:25 drmeister It takes llvm-ir and does fast code generation. The trade-off is different

3:00:03 drmeister I'm going to try a different trade-off. Knock *optimization-level* to 0 and build everything.

3:05:21 Bike intrinsics not being inlined sounds bad.

3:10:10 drmeister I might be able to do inlining in a different way - more deliberate inlining at call sites.

3:10:50 drmeister But still - we are balanced right now with the bclasp compiler - where the it takes about half the compilation budget and llvm takes the other half.

3:12:44 drmeister I figure if I compile everything with *optimization-level* 0 and then compile-file the 69 source files at optimization level 3 (as above) - that will be a decent comparison of optimized code vs unoptimized code.

3:35:49 beach Good morning everyone!

3:48:27 drmeister Hi beach

3:49:24 drmeister Without optimization 400 seconds total 158 seconds llvm 110 seconds linking

3:50:54 karlosz it would be really useful to inline &rest and &key actually

3:51:07 karlosz call-next-method is almost always called with 0 args

3:51:17 karlosz that way the entire rest allocation can be eliminated

3:51:23 karlosz if it gets inlined

3:51:35 karlosz less consing for &rest

3:52:10 karlosz and i guess &key processing could be done at compile time

3:52:26 Bike for call-next-method specifically i think a compiler macro kind of thing is fine

3:53:27 karlosz yeah, thats how clisp implements it because they dont have any local inlining

3:53:36 karlosz but it could be helpful in general

3:53:48 beach karlosz: &key processing, if performance critical, could be done with a compiler macro.

3:54:24 karlosz beach: can that be done for local functions too? this is in the context of inlining local functions

3:54:55 beach Ah, I see.

3:55:32 beach karlosz: I don't expect many local functions to 1. be performance critical and 2. to have keyword arguments.

3:55:40 beach Do you have reasons to believe the contrary?

3:58:26 karlosz no, but being able to inline them would reduce consing, which is nice

3:58:59 karlosz inlining can eliminate the need for a closure entirely

4:00:31 karlosz with partial inlining i can make this fragment https://paste.gnome.org/po4kmyaob

4:01:06 karlosz compile to just CONST 24 RETURN

4:01:25 karlosz it would be sad if a rest arg made everything fully consed

4:01:49 beach That's impressive.

4:40:35 karlosz *sigh* inlining must have changed the order of the predecessors

4:41:25 karlosz no, that cant be it. inlining happens before closure conversion

4:47:58 karlosz its set-predecessors

4:48:14 karlosz it destroys the predecessor/input relationship phi nodes need

4:48:48 karlosz have to chuck this order thing for phi nodes asap

6:28:16 kpoeck Morning

6:28:16 Colleen kpoeck: attile_lendvai said 8 hours, 7 minutes ago: do you mean the changes that are already checked in the repo? if yes, then of course, that'd be a major offense... :) if you mean changes that are not in dev, then do warn me

6:29:44 kpoeck @attile_lendvai : regardings bits.cc, i only meant code already checked in