freenode/#lisp - IRC Chatlog

23:08:15 sea I think so. I took that off. Now I'm running with: (declaim (optimize (debug 0) (speed 3) (space 0)))

23:08:25 aeth well, you can just do it locally in bar

23:08:32 aeth (defun foo (l) (let ((sum 0)) (loop for i in l do (incf sum i)) sum)) (defun bar (v) (declare (optimize (speed 3) (debug 1)) ((simple-array fixnum (*)) v)) (let ((sum 0)) (loop for j across v do (incf sum j)) sum)) (let ((l (iota 100000))) (time (foo l))) (let ((v (coerce (iota 100000) '(simple-array fixnum (*))))) (time (bar v)))

23:08:53 sea 837,554 processor cycles vs 6,534,470 processor cycles and this time, it takes 8x as long!

23:09:10 aeth that is strange, I don't get that

23:09:24 aeth I get the same time.

23:09:31 aeth Both in time and sb-profile

23:09:57 sea this is interesting. If I repeatedly run bar, the time varies kinda wildly

23:10:03 aeth and making it longer (1 million instead of 100k) has bar win

23:10:15 aeth sea: yeah, that's why sb-profile:report is better.

23:10:25 aeth Run it many, many, many, many times and look at the sec/call

23:10:41 aeth assuming you did (sb-profile:profile foo bar)

23:10:53 sea I'm running it 100 times with profiling on and reset

23:11:17 sea 0.144 | 0.000 | 57,648 | 101 | 0.001425 | BAR

23:11:17 sea and

23:11:22 sea 0.025 | 0.000 | 0 | 101 | 0.000247 | FOO

23:12:48 jack_rabbit what is iota?

23:13:13 sea That just generates all the numbers from 1 to n in a list

23:13:18 TCZ its what you cant change or you go to hell its in bible

23:13:44 jcowan right, not one iota or tilde of the law shall pass away

23:15:40 jcowan also, dot your iotas and cross your tildes

23:16:25 jack_rabbit For me, list took 4,695,880 processor cycles, vector took 722,763 processor cycles

23:16:41 jcowan makes sense

23:16:52 jcowan you don't have to retrieve the cdrs when dealing with a vector

23:17:18 jack_rabbit yes. sea is claiming that for him, the vector one is significantly slower.

23:17:23 jcowan cdr-coded lists would help in this situation, but not enough overall for anyone to implement them any more

23:17:26 jcowan that's bizarre

23:21:47 sea I don't know what else it could be

23:22:03 sea I tried disassemble on both foo and bar but they're exactly the same as far as I can tell

23:22:41 aeth Well, first make sure that they're not the sb-profile wrapper. You might have to (sb-profile:unprofile) before disassembling now

23:22:51 sea ah

23:23:01 aeth My foo and bar are very different.

23:23:24 aeth Same basic structure of generic-+, but the actual surroundings reflect iterating over their respective types

23:24:01 sea http://termbin.com/ylig

23:24:37 sea bar is much longer

23:24:41 aeth that is not my bar

23:24:43 sea I have no idea why

23:24:45 aeth are you using my latest bar?

23:24:58 aeth My latest bar has this: (declare (optimize (speed 3) (debug 1)) ((simple-array fixnum (*)) v))

23:25:33 aeth It's simple-array that contains fixnums, which should certainly make it win.

23:25:43 aeth Your bar looks like a much earlier version

23:25:59 aeth The one that's generic for all sequences

23:26:17 sea Hang on then. I'll put in your declare in my bar

23:26:50 aeth Generic sequence and number code is almost always going to lose to specific sequence and number code in performance. They're basically the only two areas where type declarations are very useful for performance ime.

23:27:04 jack_rabbit It doesn't matter the data type if the code iterating through it is for generic sequences.

23:27:22 sea I need to alter the coerce as well. How do I coerce something to be a simple array of fixnums?

23:27:27 aeth jack_rabbit: but my SBCL still optimizes bar once it knows that it is a simple-array fixnum (*)

23:27:32 sea I get type errors when I try to call bar otherwise

23:27:47 jack_rabbit aeth, Yes, you are correct. I was speaking to sea, not you.

23:27:48 aeth sea: you have to coerce it to '(simple-array fixnum (*))

23:28:15 aeth sea: If it can only hold something of one non-T type, it's going to be a different thing than something that holds something of T

23:28:47 sea Okay so, I reset the profiler, re-profiled foo and bar, and ran them 100x

23:28:57 sea 0.147 | 0.000 | 0 | 100 | 0.001469 | BAR \n 0.013 | 0.000 | 0 | 100 | 0.000129 | FOO

23:28:59 sea but I still get this

23:29:11 sea actually lemme write one per line

23:29:17 sea 0.013 | 0.000 | 0 | 100 | 0.000129 | FOO

23:29:27 aeth You win twice with an array type like I just gave (three times if a length is given): (1) it knows it's a certain kind of sequence and (2) it can infer what type the items are, which usually cannot be done

23:30:19 aeth Unfortunately, this only applies to a small number of things. Portably just bit and character. Non-portably, a bunch of other numeric types like (almost always) single-float and (unsigned-byte 8) and fixnum

23:30:51 sea http://termbin.com/4gln

23:31:10 aeth Something is definitely up

23:31:54 aeth An array with an element-type should almost always be the most performant kind of sequence (or data structure in general) in Common Lisp. It will even beat lists at some things that lists are supposed to be better at.

23:32:19 aeth It's really the one optimization always worth doing if you can get away with it.

23:32:45 sea That's how I discovered this in the first place. I was timing an 'optimized' program, and found it got slower

23:34:20 sea and the thing is that along with the time: 445,976 processor cycles , 4,636,812 processor cycles I get a tonne of time results printed as well, and they all basically look like this. The vector one is much larger

23:35:16 aeth Try restarting CL?

23:36:13 sea Okay, restarted and re-evaluated what I had in the paste before. 0.148 seconds for bar, and 0.014 seconds for foo

23:37:54 sea Maybe my processor has some kind of issue that makes it slower on the vector version.

23:39:24 pierpa arrays with an element-type are not necessarily more performant than arrays with generic element types. It depends on what/when/how much the elements needs unboxing and reboxing.

23:40:17 pillton The disassemble of bar shows that it calls make-sequence-iterator.

23:40:37 pillton Which means it is calling a function to get the next element.

23:41:01 pillton Disassemble (loop for j across v) and you see that it does not.

23:41:26 sea hang on that might not be the latest disassemble. This one is here:

23:41:41 sea http://termbin.com/2za7

23:42:01 sea oh wait my bad

23:42:03 sea I have to unprofile it

23:42:33 sea http://termbin.com/lpyi Okay, there.

23:45:04 jack_rabbit sea, for me 'being the elements of' is waaaaaay slower than 'across'

23:45:28 jack_rabbit for whatever reason. Maybe 'being the elements of' assumes a generic seq.

23:45:33 pillton sea: https://hastebin.com/cawayonuxu.lisp

23:45:46 sea A-HA!

23:45:51 sea I swapped it for across and this time it's faster

23:46:26 sea So, being the elements of is slow. Why?

23:46:37 sea What's it doing that slows it down?

23:47:00 pillton I told you why. It is calling a function to get the next element.

23:47:43 jack_rabbit oh, I didn't even see pillton's comments.

23:47:49 jack_rabbit Yes, he's right.

23:48:02 sea Why does it do that in one case and not the other? What's the behavior of 'being the elements of' supposed to be, and 'across'?

23:49:00 DataLinkDroid energizer (~energizer@unaffiliated/energizer) has joined #lisp ouse disable

23:50:31 jack_rabbit sea, http://termbin.com/izd9

23:50:34 pierpa nobody can tell you why "being the elements" is slow since "being the elements" is not CL. It must be an extension of the implementation you are using.

23:51:06 sea I was wondering why I couldn't find it in CLHS just now

23:51:18 pierpa tht's the reason!

23:51:20 pillton sea: It is defined here http://www.doc.gold.ac.uk/~mas01cr/papers/ilc2007/sequences-20070301.pdf.

23:51:47 pillton It is defined to use the user-extensible sequences protocol.

23:52:02 jack_rabbit ahh, interesting.

23:52:06 aeth So it's the sequence-generic version, but unlike most sequence-generic things it doesn't un-generic when the type is known

23:52:45 pillton I think only SBCL implements the user-extensible sequences protocol.

23:53:24 jack_rabbit Is there another free CL implementation out there that works well aside from SBCL?

23:54:07 pierpa all of them do

23:55:00 Bike ccl's good, ecl's good, something something

23:55:27 jack_rabbit I'll try those. I've only tried clozure and clisp, and both gave me issues.

23:55:36 jack_rabbit wait, ccl is clozure.

23:56:19 pierpa if these gave you issues, most probably the flaws are in your code, IMO

23:56:34 aeth CCL has a superior GC than SBCL and is fairly comparable to performance in SBCL. ECL apparently is better on some niche areas like bignum performance.

23:57:01 aeth SBCL, though, in general is pretty nice. It's usually the fastest, the most helpful, and the most feature-rich.

23:57:31 aeth You could definitely beat SBCL in performance, though, if you really tried. There's definitely lots of room for improvement all over the place.

23:59:10 aeth SBCL is pretty fast, but its optimizations don't really compare to some of the ridiculous optimizations compilers with big budgets can do these days.

23:59:31 jack_rabbit pierpa, ccl gave me an error compiling some quicklisp library. I assume that is the library's fault. clisp crashes trying to load swank, which I assume is clisp's fault.

23:59:48 aeth Libraries are definitely written just for SBCL.

0:00:05 pillton Which is a shame.

0:00:08 aeth Ime, libraries will usually work on CCL, often work on ECL, and give issues with just about any other implementation, especially 32-bit ones.

0:00:11 jack_rabbit yes, it is.

0:00:45 pillton ACTION doesn't want to risk starting the unit test thread again.

0:01:01 aeth It's hard to not write for SBCL, though. There are so many ways to figure out what's going on in SBCL.

0:01:16 aeth I'm pretty sure of how my code behaves in SBCL, at least at the defaut optimization levels.

0:01:27 aeth s/defaut/default/

0:01:49 aeth Last time I checked, I couldn't find a profiling solution for CCL.

0:02:28 jack_rabbit The library is static-vectors, and the error is: "Foreign function not found: X86-LINUX64::|memset|"

0:02:48 aeth Really? static-vectors works for me in CCL. It gives me issues in ECL, though, even though it's supposed to support it.

0:04:16 jack_rabbit hmm :/

0:05:26 aeth But that does seem to match my experience. Things that use CFFI are the most problematic.

0:05:41 pierpa duh

0:05:42 aeth cl-sdl2 alone limits my project to SBCL, CCL, and ECL.

0:10:27 aeth It's unfortunate that unless CLX works for you there's no way to avoid at least some foreign code.

0:19:46 White_Flame aeth: telnet or web interfaces ftw

0:28:05 jack_rabbit Yeah, unfortunately mcclim seems to need static-vectors...

0:28:16 aeth interesting

0:28:30 aeth static-vectors seems to have taken over the world

0:29:30 White_Flame seems like that should have been part of FFI

0:30:11 aeth Now if only there were convenient solutions for other things

0:31:14 pfdietz SBCL is about 500K LOC. Clasp/LLVM is maybe 5M LOC, last I checked.

0:31:57 pillton White_Flame: I'm not sure what problem static-vectors solves. Do some implementations invoke the GC during foreign function calls?

0:32:13 White_Flame it's not during, it would be between

0:32:13 Bike i don't see why they wouldn't

0:32:29 White_Flame you can't pass a pointer to foreign code if it could be moved at any time in the future

0:34:25 pillton It is only a problem if the foreign code retains the pointer.

0:34:34 White_Flame ...right

0:35:11 White_Flame and in a lot of I/O cases, including graphics, the call does not synchronously encapsulate all access to the buffer you give it

0:43:52 aeth pillton: Without static-vectors, you're either going to be working with a foreign array through stuff like mem-aref (not a pleasant experience) or you're going to copy from a CL-native vector into a foreign array at some point (which can kill your performance).

0:44:49 aeth With static vectors, there's no need to do either, as long as you're in control of the allocation and not the foreign library.

0:46:23 aeth The downside is that you're going to either have to use with-static-vector/with-static-vectors or you'll have to explicitly call free-static-vector in your own unwind-protect at some point.

0:47:15 aeth I'm guessing you also can't use (declare (dynamic-extent foo)) on a static-vector to stack allocate, so that's another restriction.

0:48:12 aeth Another downsize is that it seems to fool SBCL's type inference, so I have to (declare (whatever-type foo)) after with-static-vector or a let initializing the static-vector in order to get efficient sequence code, which is unnecessary with a normal vector.

0:49:52 pillton I always used sb-ext:array-storage-vector and cffi:with-pointer-to-vector-data.

1:52:17 iqubic` ** NICK iqubic

2:20:45 jack_rabbit Can anyone with CCL execute (read-from-string "#_memset") and let me know what happens?

2:26:44 pillton (values x86-darwin64::|memset| 8)

2:27:05 jack_rabbit :( Can't figure out why that doesn't work for me.

2:27:43 pillton Does rebuilding help?

2:28:01 jack_rabbit Hmm. Appears I may need to update the "interface database"

2:28:06 jack_rabbit Rebuilding CCL?

2:28:25 pillton Yes.

2:28:45 jack_rabbit I got it as a binary package. Hang on, i'll try building it from source.

2:38:14 jack_rabbit huh. I didn't even need to rebuild. Just used the download from the clozure.com site rather than my distro repo.

3:03:16 dvdmuckle ** NICK cavepaintingmuck

3:03:26 cavepaintingmuck ** NICK dvdmuckle

3:38:46 aeth It would be very unusual for a popular QL library not to work on CCL.

3:39:19 aeth Everything on QL has to run on at least two implementations, so supporting #1 and #2 by popularity is pretty much the absolute minimum.

6:52:20 TMA jack_rabbit: (read-from-string "#_memset") => (values 'WIN32::|memset| 8) or (values 'WIN64::|memset| 8)

6:53:28 jack_rabbit TMA, Thanks. :)

7:05:46 TMA also ccl 1.6 32-bit no longer runs on 64-bit windows 10

7:20:51 rme TMA: Support for running the 32-bit lisp on 64-bit Windows was added (by yours truly) in ccl 1.7.

7:48:32 TMA rme: oh, I never knew. I have an old 1.6 sitting in a directory transferred from an old 32-bit XP system and I tried to run it.

7:48:45 TMA rme: thanks for the addition

7:49:19 schweers TMA: out of curiosity: why do you run a recent OS in 32-bit mode?

7:57:40 beach Good morning everyone!

7:58:04 theemacsshibe[m] afternoon beach!

7:58:31 shrdlu68 Good morning!

8:02:17 schweers Good morning beach!

8:10:30 TMA schweers: I do not. I ran XP in 32 bit mode, I run everything in 64 bits since. I would return to 32-bit mode on low memory devices like low-end netbooks though.

8:11:45 TMA It is wasteful to use eight bytes for pointers when you have 2 gigs total memory.

8:11:56 schweers okay, that explains. I was wondering how windows 10 would perform with a maximum memory of ... 3GB? I know that 4GB or close to that are possible, but if I remember correctly, windows has a weird limit on 32 bit systems.

8:12:43 schweers It sure is, but then again, I was thinking about running windows 10, which -- I presume -- needs lots of memory just to boot.

8:13:54 flip214 TMA: but having more registers in 64bit-mode might mean less memory needs (eg. for temporary data) and faster computation as well

8:19:13 TMA flip214: that's true for arithmetic-intensive workloads. I guess most of what I do is data traversing, not number crunching

8:29:35 hajovonta hi all

8:30:01 shka hello

8:32:35 aeth The specifics of Lisp complicate 32v64 bit further. e.g. larger fixnums and unboxed single-floats

8:36:38 jack_rabbit schweers, windows 32-bit (IIRC) reserved ~1GB of address space for kernel stuff, so only 3GB available to user programs.

8:37:26 schweers I thought there was something else, but I may be wrong. Not my main platform anyway ;)

9:00:49 flip214 TMA: still, being able to hold much more data (eg. pointers!) in CPU registers might help, not only when doing arithmetic.

9:26:57 hajovonta1 ** NICK hajovonta

9:56:12 hlavaty hi i have a fileSystems."/var/lib/foo" entry in configuration.nix. the disk failed and now on boot the machine goes into rescue mode. how do i disable the disk so that the machine starts normally again and i can ssh in an upload and activate new configuration?

9:59:37 hlavaty ah sorry wrong channel

10:15:09 TMA flip214: I refuse to take position on that matter when I have no data. I am saying I can pack twice as many conses into the same amount of memory. I do not have performance data to tell, whether it will be faster. the execution speed is nowadays usually severely constrained by the memory access time (that's why beach's generic function dispatch scheme that removes one memory access is so awesome)

10:16:53 TMA flip214: so I guess, it might be faster for some workloads. your guess of the registers helping might be better or worse than mine. without data there is nothing we can do to tell them appart

10:17:04 TMA *apart (?)

10:51:39 makomo how can i define a function in a different package if i want to do it within a file which has a different package in (in-package ...) on the top?

10:52:04 makomo i tried using (in-package) right before the defun (and then again to switch back). i've also tried rebinding *package*. none worked

10:52:26 makomo it always defines the function within the package that's at the top of the file

10:52:36 Shinmera (defun foo::bar ..)

10:52:36 makomo is this due to slime/emacs or?

10:53:14 makomo oh, a double colon

10:53:30 Shinmera If the symbol is already exported from the other package you can also do foo:bar, of course.

10:53:33 makomo i tried with a single colon and it failed, i guess that makes sense now

10:53:35 makomo mhm

10:53:44 makomo this was a fresh package without any symbols, so that's why it failed

10:53:48 makomo Shinmera: and could you do it using (in-package)?

10:53:55 makomo or do you maybe know the reason the above failed?

10:54:05 Shinmera I don't know what you did, but it should work.

10:54:12 Bike when you say "defined", do you mean slime C-c C-c

10:54:28 makomo what exactly is bound to C-c C-c? i have different keybinds

10:54:35 makomo actually, no i don't

10:54:53 makomo Bike: yes

10:55:03 Bike slime-compile-defun

10:55:30 Bike i don't know how slime decides a package to read code in, it might just look for the first in-package in the file

10:55:36 makomo i did slime-eval-defun, but it should be the same

10:55:42 Bike i would recommend against using multiple in-packages in one file anyway

10:55:46 Shinmera It should work even with multiple in-packages.

10:55:54 makomo Bike: i agree, but i'm curious why it's failing

10:56:00 Shinmera makomo: what exactly is the failure

10:56:07 Bike well, as far as i remember it should work.

10:56:10 makomo it doesn't define the function within the package i want it to

10:56:15 Shinmera I'm gonna go ahead and guess it errors because of something that isn't related to the name of the function

10:56:16 makomo but within the one that is at the top of the file

10:56:29 makomo i.e. when i say "fail" i don't mean "it throws an error"

10:56:41 makomo it fails to do the expected thing

10:57:16 Bike quick test has it working for me.

10:57:54 Bike as in, (in-package #:foo) (defun bar () 'foo) (in-package #:foo2) (defun bar () 'foo2), C-c C-c the last, get (foo2::bar) => foo2::foo2

10:58:33 Shinmera https://plaster.tymoon.eu/view/772

10:59:11 Shinmera So, again, I don't know what you're doing.

11:00:11 makomo hmm, you have them as separate forms, i guess this is the "problem"?

11:00:16 makomo i had it inside a progn

11:00:20 makomo i.e. a single top-level form

11:00:26 Shinmera Well duh, a form is read as a single form.

11:00:31 Bike wait, like (progn (in-package ...) (defun ...))?

11:00:35 makomo yeah

11:00:48 Bike yeah that's going to read the entire progn, and use the previous in-package for it.

11:01:02 Shinmera The reader reads only complete forms. By the time it's evaluated, and the package switch would happen, it's already read.

11:01:09 makomo i remember the same issue i had with quicklisp, loading a library in the same form and using it, but that made sense because reading happens before evaluation

11:01:20 Shinmera So does this, same explanation.

11:01:22 Bike yes, that's what's happening here as well, after a fashion

11:01:26 makomo Shinmera: oh, so the symbol that names the defun

11:01:30 Bike not that slime actually evaluates the in-package

11:01:32 makomo is inside the current package

11:01:50 makomo instead of the one i want it to be in

11:01:54 makomo ah, makes sense

11:02:27 makomo neat, thanks Shinmera, Bike

11:02:34 Bike you're welcome

11:03:15 Shinmera (in-package foo) (progn (in-package bar) (defun baz ..)) is read as: (cl-user::in-package cl-user::foo) (foo::progn (foo::in-package foo::bar) (foo::defun foo::baz ..))

11:03:28 makomo Shinmera: mhm, it's clear now

11:04:32 makomo would there still be a way to create a macro which would temporarily switch packages, evaluate a body and then switch back?

11:04:49 Shinmera Not generally, no.

11:04:55 makomo i see

11:05:01 Bike no, because you read before evaluating/compiling, and macroexpansion happens during evaluation/compilation

11:05:09 makomo right

11:05:22 Shinmera A macro could do nasty things with trying to guess what the read form of a symbol was and translate it according to that

11:05:34 Shinmera But when you read, you lose information, so it's not entirely possibly.

11:05:43 Shinmera *possible

11:05:53 Bike for the purpose of this question, i'm assuming that evil magic is prohibited by the ancient treaty