freenode/lisp - IRC Chatlog

8:45:44 LdBeth hi nydel

9:08:56 russellw I just discovered something weird. On theoretical grounds, vectors should be faster than lists, right? Well, tried a simple test just now in SBCL, find vs member, and lists are faster... by 3 orders of magnitude

9:09:45 nydel LdBeth: hello there :)

9:11:41 nydel LdBeth: nice to see you here. working on/playing with anything fun?

9:19:43 russellw The above is also true of CCL. Presumably they just put much more effort into optimizing lists

9:20:05 beach russellw: Three orders of magnitude? A factor 1000?

9:20:16 russellw beach, yes

9:20:29 beach That doesn't sound right.

9:20:45 beach I think your test must be measuring something else.

9:21:32 nydel russellw: would you share the test forms?

9:22:08 russellw it absolutely does not sound right, to the point where I would've not believed it had I not seen the results myself, so I would definitely encourage you to try it

9:22:11 russellw (time(dotimes(i 100000000)(find 'z #(a b c d e f g h i j k l m n o p q r s t u v w x y z))))

9:22:31 russellw (time(dotimes(i 100000000)(member 'z '(a b c d e f g h i j k l m n o p q r s t u v w x y z))))

9:22:55 russellw adjust loop counts as necessary

9:23:13 beach russellw: Try using sequences of length 1000000 instead of that short.

9:23:51 beach russellw: Also, compilation time is included in your timing.

9:23:56 beach Stick those things in functions.

9:25:23 beach russellw: In SBCL with a million elements, using FIND on the list and on the vector gives indistinguishable times.

9:25:38 beach russellw: MEMBER is much faster, but not a factor 1000.

9:26:04 beach russellw: It is also likely that the compiler removes your code entirely in one case.

9:26:07 russellw beach, in practice you would use a hash table with 1000000 elements. The above is not an artificial test; it reflects a real pattern that I use somewhat frequently, which is why I was interested in the relative times. I think we both know compilation time is not going to be significant here, but I would by all means encourage you or anyone else interested, to try variants of the test and report

9:26:07 russellw the results

9:26:23 russellw the compiler does not remove the code entirely, as can be verified by trying different loop counts

9:27:00 russellw indistinguishable times with a million elements is an interesting data point; it is at least considerably closer to what one would expect

9:27:07 beach Anyway, you can not design a performance test that way. It will measure something else.

9:28:27 russellw there is no such thing as a perfect performance test, but I think the above is sufficient grounds to drop my tentative plan to start using find instead of member

9:28:30 beach With 10 million elements, find gives 0.080 seconds in one case and 0.079 in the other.

9:29:01 beach MEMBER gives 0.021 seconds, so it is 4 times as fast.

9:29:37 beach But this result only reflects the fact that MEMBER is better optimized than FIND. Not that any of the data types is "faster" than the other.

9:31:00 russellw So not as outlandish, but still interestingly different from what would expect on theoretical grounds. and if X is better optimized than Y, then X tends to end up faster than Y. That's the whole point of optimization. And was also my suggested explanation in the first place

9:32:05 beach russellw: No, you said that they put more effort into optimizing lists.

9:32:25 beach russellw: But in reality, they probably put more effort into optimizing MEMBER.

9:32:30 ggole Lists might show more variance in performance if you don't create them all at once

9:33:13 ggole The conses will likely be contiguous given the way you have constructed the list.

9:33:15 russellw which they clearly did, as your own results verified. the same function was the same speed on lists and vectors in the asymptotic case, which you only get if you put more optimization into lists, that by default would be slower

9:33:35 beach Whatever.

9:33:58 russellw ggole, true

9:36:34 no-defun-allowed maybe there's cache coherency but that's about it for lists

9:36:52 nydel russellw: if you compose a hypothesis to mark the beginning of these tests, what is it, and is it proven? if so, have you tried to disprove it?

9:38:07 russellw nydel, my hypothesis was that vectors would be faster than lists for testing membership. That hypothesis has been comprehensively disproven

9:40:13 ggole If this operation is consuming a lot of time, you could (probably) do a lot better than switching between member and find

9:40:24 ggole eg, using binary search

9:41:38 russellw ggole, with many elements yes, or a hash table as I mentioned, but I am looking at scenarios where there would only be a handful of elements. Constant factors matter when N is small

9:42:04 ggole Binary search is good even when n is quite small

9:42:38 scymtym here, all three examples compile to the same (empty bodied) loop and take the same time to execute. this is not surprising as the compiler can fold ({member,find} 'z '(… z))

9:43:31 russellw scymtym, yeah, that doesn't surprise me. which compiler?

9:43:39 scymtym sbcl

9:43:56 russellw ah, setting higher optimization level?

9:44:14 scymtym no, very safe and slow

9:44:53 russellw then what did you do differently? because it did not optimize away the loop body when I tried it

9:45:28 scymtym (lambda () (dotimes (i 1000000) (find 'z '(a … z)))) (disassemble *)

9:45:47 scymtym how did you check what it did/didn't optimize?

9:47:10 russellw by noticing that the run time stayed proportional to the number of iterations

9:47:42 russellw I did not wrap it in a lambda, which you just did. So I was wondering if that made the difference. But I just tried your version now...

9:48:55 russellw https://pastebin.com/6T6HxkGE

9:48:56 scymtym taking time proportional to the iteration count only means that the loop wasn't optimized away entirely. the body still could be and indeed is

9:49:34 scymtym maybe that SBCL version didn't fold FIND

9:49:59 russellw well the constant of proportionality was a lot more than an empty loop. Anyway, the disassembly I just posted, does not look to me like the loop body was optimized away. This is with a recent version of the compiler.

9:54:11 LdBeth nydel: just finished writing assignment in JS

12:41:40 shka_ keep-if

12:41:53 shka_ what do you think about ONLY and WITHOUT?

12:43:20 beach Hmm.

12:43:44 beach I would prefer something like INCLUDE and EXCLUDE for symmetry. But it's moot. We have what we have.

12:48:12 dlowe I like FILTER and SELECT

12:48:55 beach Well, like we said before, "filter" is ambiguous. It can mean "keep" or "reject".

12:49:17 beach You seem to use it as "reject".

12:49:27 beach ... since you have SELECT.

12:49:47 beach But other people use it as "keep", apparently.

12:50:02 beach ... some other people...

12:50:47 beach REMOVE-IF and REMOVE-IF-NOT are fine, because REMOVE means non-destructive.

12:50:50 ggole The nice thing about remove/remove-if is that the name fits beautifully for the non-function case, which is not really true of select

12:50:55 dlowe right. As unwieldy as REMOVE-IF-NOT is, it is really obvious what it does

12:50:58 beach None of the alternatives suggest that.

12:51:36 ggole And (reject 1 seq) doesn't really read like "remove one from seq" to me

12:53:03 shka_ beach: in CL, yes, I have my own library

12:53:21 shka_ and i did not submit into QL so i can change names

12:53:26 dlowe yeah, it's very very easy to make wrappers if you don't like the names

12:54:29 shka_ it is not wrapper around CL functions

12:54:32 shka_ cl standard functions

12:54:40 ggole remove-if and remove-unless?

12:55:00 shka_ remove-when more like

12:55:24 ggole So remove-when and remove-unless, matching the macros. Hmm.

12:55:33 ggole That does make a certain sense.

12:55:59 dlowe yeah, those are good too

12:56:07 shka_ hm

12:56:49 dlowe To the time machine!

12:57:12 shka_ don't try to change the past

12:57:23 shka_ try to shape the future ;-)

13:00:38 dlowe impatient people still use the time machine :p

13:05:21 warweasle I used the time machine. The closest I could get everything back to normal ended up with Trump. Frankly, I just quit trying after that.

13:05:51 warweasle Although the trains in Nazi-controlled America are AMAZING!

13:06:04 warweasle Now we are back to volvos.

13:07:37 jmercouris warweasle: What are you talking about?

13:07:50 dlowe sorry, in this timeline we only talk about common lisp on this channel

13:07:57 shka_ warweasle: thanks, my day was not going well, but this was really funny

13:11:52 dale_ ** NICK dale

13:13:56 shrdlu68 Good afternoon

13:13:56 minion shrdlu68, memo from pjb: trees without pointers: https://pastebin.com/yreWM7NW

13:47:00 shrdlu68 minion: memo for pjb: I'll give this a try, thanks!

13:47:00 minion Remembered. I'll tell pjb when he/she/it next speaks.

14:39:09 veinoflegaladvic ** NICK veinofincubus

14:45:27 cl-arthur games

14:45:34 beach ?

14:46:06 cl-arthur typo :)

15:07:16 astronavt_ ** NICK astronavt

15:19:30 vtomole Hey beach, how's SICL coming along?

15:20:37 beach vtomole: Making steady progress. Thanks!

15:21:45 beach vtomole: There is now a #sicl IRC channel where I can blab freely about what I am doing.

15:21:56 beach It is even logged by Shinmera's bot

15:22:04 vtomole I'll be there

15:23:20 beach I am making very good progress on bootstrapping, and since Clasp seems to want to replace LLVM (which is very slow) by Common Lisp code generating x86-64, I am working on HIR-to-MIR which will eventually be possible to translate to x86-64.

15:28:44 vtomole beach: Replacing LLVM in Clasp seems like a huge project. Good luck to those involved! Why is it slow?

15:29:28 beach Luckily, we don't have to replicate all of LLVM. Having said that, I am convinced that it would be much easier to write something like that in Common Lisp than in C++.

15:29:42 beach It is apparently slow because it is modular.

15:30:16 beach I often say "it is impossible to write a C++ program that is both modular and fast", and here is another data point in that direction.

15:30:24 astalla Don't you lose C/C++ interoperability though, which was the whole point of Clasp?

15:30:46 beach astalla: That's what I would have thought, but drmeister says that he can handle that.

15:31:34 vtomole C/C++ interoperability? Is CFFI not enough?

15:31:43 beach Apparently not, no.

15:32:06 djeis[m] I guess it could still make use of LLVM specifically for the C/C++ interop.

15:32:21 djeis[m] For generating the proper shims and the like.

15:32:39 djeis[m] While leaving the bulk of codegen to something faster.

15:32:43 beach I know very little about this topic. You would have to ask drmeister.

15:33:26 djeis[m] Yea

15:34:39 djeis[m] Generating code to access C++ directly is a lot more complicated than just what you can get with CFFI, because C++ is all kinds of nonsense at the ABI level.

15:35:17 astalla vtomole: as far as I know, to properly link to C++ code without second-guessing whatever compiler was used to compile it, you need a C++ parser, and C++ syntax is complex enough that a proper parser is very hard to write and basically it requires a whole C++ compiler.

15:36:34 djeis[m] Not just a C++ parser, you have to replicate the name mangling proceedure of whatever C++ compiler that lib was compiled with.

15:36:46 beach I had assumed that LLVM was a requirement for such interoperability to work, but I only recently learned that drmeister had plans to generate x86-64 code directly.

15:37:22 ggole That's always fun.

15:37:23 beach djeis[m]: And the (apparently Turing-complete template code).

15:37:38 beach djeis[m]: And the (apparently Turing-complete) template code, I mean.

15:38:11 djeis[m] I'm fairly certain that LLVM would still be needed for the interop itself, but there's no reason that codegen for the bits that don't involve interop could be done using some other compiler.

15:38:20 djeis[m] asm is asm.

15:38:54 ggole In practice there's other junk like debug data

15:39:12 ggole You can reimplement all (or some) of that, but it's a lot of work

15:39:26 astalla djeis[m]: good point.

15:41:04 djeis[m] *couldn't be done lol

15:41:33 vtomole Does Clasp compile to LLVM IR now instead of x86-64?

15:48:01 beach djeis[m]: drmeister (who is my house guest at the moment) says that all that is required is to follow the calling convention, which is fairly easy.

15:48:48 beach They do plan to generate DWARF debug information from Common Lisp code.

15:48:56 beach vtomole: Yes.

15:49:27 beach I need to go entertain my guests.

15:53:06 ggole The *c++* convention? I don't think what each compiler does is even documented.

15:54:32 ggole For instance, there's a trick which clang(?) does where if every override for a virtual function returns a constant, they replace the function pointer with the relevant constant in the vtables

15:55:07 ggole If you have to reverse-engineer that sort of implementation quirk, you'll be in for quite a ride.

15:58:33 djeis[m] beach: That makes sense I suppose.

16:30:11 razzy beach: i admire you both, drmeister and you :]

16:32:52 razzy warweasle: nazis just love trains :]. who could blame them :].

16:33:06 razzy soooo efficient

16:39:14 shka_ good evening

16:39:33 Shinmera razzy: please try not to keep off-topic chatter going, especially when it's politically charged.

16:40:13 russellw what's the best way to test a variable for being equal to one of several strings? with symbols it's (member x '(a b c)) but that uses eql so doesn't work with strings

16:40:25 dlowe russellw: you can specify :test

16:40:31 shka_ russellw: :test #'string=

16:40:40 russellw ah, thanks!

16:40:48 shka_ or use hashtable

16:41:09 Shinmera a hash-table with standard test won't help either

16:41:17 shka_ yes, that's true

16:41:31 shka_ you will need also specify test for the hash table

16:41:39 shka_ good catch

16:41:50 razzy Shinmera: sorry, i feel ashamed. the politics charge did not occur to me, really

16:42:09 shka_ dear #lisp, i have a question

16:42:54 shka_ data frame implementation i am working on, will use copy on write logic to only copy parts of data frame that are actually changed

16:43:40 shka_ however, obviously, it would be beneficial to check if operation on frame actually changes it

16:44:00 shka_ in order to take burden away from the user

16:44:42 shka_ i was thinking on it, and it appears to me that eql would be the right way to do so

16:44:53 shka_ however, there is one corner case that i see

16:45:18 shka_ namely (make-symbol) and property lists of symbols

16:46:10 shka_ is it worth it to create custom logic for this case, or is it better to ignore this?

16:47:18 shka_ or perhaps there is other corner case, where eql would not be sufficient

17:47:16 warweasle ** NICK warweasle_afk

19:03:40 veinofincubus ** NICK veinofKoRn

19:20:54 warweasle_afk ** NICK warweasle

20:30:04 Myon svillemot: Hi, are you taking over cl-nibbles from dim, or should I try uploading the fixes for https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=908641 ?

20:30:30 drmeister Hello everyone.

20:31:00 frodef hi

20:31:04 longshi hello

20:31:15 svillemot Myon: I

20:31:27 drmeister On reflection, generating code directly from cleavir will probably require figuring out how to set up unwinding and to generate DWARF metadata.

20:31:35 svillemot Myon: I will probably do it, but not sure when, so don't hesitate to go ahead

20:32:39 Myon svillemot: the autoremoval clock is ticking, but there's a whole month left

20:33:23 Myon I'd be more comfortable if you did it because I can't judge any of the patches

20:33:37 Myon I can only see if pgloader explodes or not :)

20:34:03 svillemot ok, I'll do it

20:34:46 Myon merci