freenode/#lisp - IRC Chatlog

8:58:21 |3b| looks like HIR-TO-MIR calls SPECIALIZE

8:58:38 earl-ducaine Does there seem to be a generally prefered thead portability library? I was looking at Bordeaux Threads but I'm worried the API (at least the ql) seems to have changed quite a bit.

8:58:58 |3b| bordeaux threads is the preferred thread lib

8:59:03 earl-ducaine Maybe I just got unlucky and stumbled upon a prehistoric version in Trac?

8:59:23 |3b| (or things that use it, if you want a higher-level API like lparallel)

9:00:07 earl-ducaine This is the document I was looking at:

9:00:08 earl-ducaine https://trac.common-lisp.net/bordeaux-threads/wiki/ApiDocumentation

9:00:11 |3b| https://trac.common-lisp.net/bordeaux-threads/wiki/ApiDocumentation appears to be current docs

9:00:15 |3b| yeah

9:00:28 earl-ducaine It's seems to be quite out of date.

9:00:49 earl-ducaine Timeouts are handled differently....

9:01:16 earl-ducaine And the timeout on recursively held locks is ignored completely. (ql version)

9:01:17 LdBeth Go for sleep

9:01:19 LdBeth seeya

9:01:20 earl-ducaine https://quickref.common-lisp.net/bordeaux-threads.html

9:01:25 earl-ducaine seems to be current.

9:01:26 beach |3b|: We don't do much beyond HIR at the moment.

9:01:50 beach |3b|: The plan is for me to define a HIR->MIR translator that can be customized.

9:02:01 beach |3b|: But I haven't done that yet.

9:02:04 |3b| and MIR is still similar to HIR?

9:02:29 beach Yes, except that address calculations are exposed and the objects manipulated may be machine data.

9:02:47 |3b| ok, no address calculations since i'm running on a VM anyway

9:03:10 |3b| so i can just start working on codegen from the HIR, since i probably wouldn't be changing much going to MIR anyway

9:03:22 beach Yes, that sounds very reasonable.

9:04:35 beach |3b|: Depending on the kind of HIR you generate, you will need to generate code for array referencing, CONS-cell referencing, standard-object referencing, perhaps fixnum arithmetic, etc.

9:06:08 |3b| yeah, will need to figure out how to configure that stuff at some point

9:06:36 |3b| ACTION 's current test function needs to pass function arguments and a literal int to 2 native functions

9:07:36 |3b| then i need to figure out how to specify that i'm compiling the function for the native calling convention, and assemble it into something i can put in an apk file

9:08:27 |3b| (specifying the calling convention is one of the things where dropping user/implementation declarations came up)

9:09:41 |3b| actually, i guess i can walk up the tree of environments to find it, so maybe dropping it isn't a problem after all

9:11:51 shrdlu68 Is sb-kernel:hairy-data-vector-ref a potential optimization point?

9:11:58 |3b| usually

9:12:42 |3b| types for arrays and array indices can help

9:12:59 |3b| *declaring types

9:13:37 |3b| and possibly using specialized arrays. i don't remember specifically which things that was for, but usually when i'm optimizing i want specialized arrays anyway :)

9:14:09 no-defun-allowed shka_: (+ a b) -> `assign foo = a + b`

9:14:34 beach |3b|: Interesting project I see.

9:14:46 |3b| (be sure to validate the declarations first if running on other lisps or running at safety 0)

9:17:43 |3b| hmm, i guess i have one more step at IR level, need to distinguish whether i'm compiling for native or lisp

9:18:19 |3b| and propagate some of that env info into the IR while i still have env

9:20:39 |3b| anyone have thoughts on how best to represent java's super keyword in a lisp style? (super (foo this ...))? ((super foo) this ...)? (foo (super this) ..)? (funcall-super 'foo this ...)?

9:21:04 |3b| aside from the specific cases that can just be (call-next-method)

9:22:11 |3b| (if i understand correctly, super is sort of generalized call-next-method, that lets you call any method or access a slot starting dispatch with superclass instead of actual class)

9:22:58 TMA it makes no sense in multiple inheritance setting

9:23:08 |3b| ACTION currently is thinking of (foo (super this) ...) as best representing what it does

9:23:20 |3b| TMA: yeah, purely for single dispatch

9:24:20 |3b| though you could sort of generalize it to multiple, grab classes for current method, find applicable methods for requested function, remove first entry if exact match for current method

9:24:34 TMA |3b|: is this special? [not in (declare (special this)) but extraordinary, separately handled etc.]

9:24:36 |3b| ACTION isn't sure that would actually be a useful thing to do though :)

9:25:15 |3b| yeah, this is for compiling code to interact with java APIs, so i need to subclass java classes, and call methods from superclass, etc

9:25:32 TMA |3b|: if so, just use SUPER as an extraordinary marker as well

9:25:56 |3b| so first need to distinguish it from an intentional recursive call, and 2nd need to generate different bytecode

9:26:05 |3b| TMA: right, but where do i put the marker? :)

9:26:18 TMA instead of THIS you put there SUPER

9:27:00 |3b| well, currently THIS is an explicit argument in my lispy code, so generating SUPER sounds a bit off

9:27:13 TMA not (super this) ... the latter would seem to enable you to use (super foo) for any foo, which is not the case from your description anyway

9:27:42 |3b| actually, maybe that would work for any foo, will have to try that on the VM

9:27:47 |3b| (once i get to that point)

9:28:27 |3b| ACTION isn't sure if foo.super.bar() is valid in java

9:29:43 TMA it is not

9:30:15 Shinmera super is private

9:32:04 |3b| ACTION isn't sure only working on some things is too much of a problem, same way you can't call call-next-method on aything that would change applicable methods

9:32:45 |3b| also seems like it would be fairly rare, since most places could just use call-next-method

9:37:24 TMA using super for other purposes than the equivalent of call-next-method is considered bad style even in java itself

9:38:23 |3b| cool, i'll just leave it as (super this) for now and ignore it :)

9:38:56 |3b| and leave it as an implementation detail with call-next-method as the intended API

9:40:15 TMA [the usage super(argument1, argument2); in constructors is a (call-next-method) of sorts albeit you can choose which superclass constructor you use]

9:41:20 |3b| right

9:46:31 shrdlu68 Spec says "vector is analogous to list", so is (aref <vector> n) the same as (nth n <list>)?

9:47:29 |3b| aside from runtime and improper lists, pretty much

9:47:42 |3b| where does it say that?

9:48:00 shrdlu68 clhs vector

9:48:01 specbot http://www.lispworks.com/reference/HyperSpec/Body/a_vector.htm

9:48:22 |3b| ah, i think it means the functions VECTOR and LIST rather than the types

9:48:56 |3b| (though the types are also analogous to the extent both are sequences)

9:50:31 beach |3b|: You know that the environment is gone by the time you have an AST, so you might need to define a new AST type as well.

9:50:32 |3b| (elt <vector> n) is like (elt <list> n) :)

9:50:55 |3b| beach: yeah, i think that's part of what i meant

9:51:00 beach OK.

9:51:07 |3b| or at least i would have noticed quickly :)

9:51:31 beach Absolutely! :)

9:51:36 |3b| hard to tell what needs stored where until i try to use it, so probably will be a few passes of this sort of thing

9:59:42 pjb shrdlu68: aref is not the same as nth: (nth 4 '(1)) #| --> nil |# (aref #(1) 4) #| ERROR: Array index 4 out of bounds for #(1) . |#

9:59:58 pjb shrdlu68: elt is like aref: (elt '(1) 4) #| ERROR: 4 is not a valid sequence index for (1) |#

10:00:03 pjb this is why we have both nth and elt.

10:00:30 pjb The older NTH is nice and gentle. It doesn't rely on a condition system, it just returns NIL.

10:01:23 pjb The newer AREF and ELT are harsh and strict. They rely on the condition system, and like to dominate and control you.

10:01:36 pjb You use whatever fits your personality.

10:02:08 pjb There are even people who specify the type of array elements!

10:12:18 adlai no-defun-allowed: "a subset", sure, there are many trivial subsets (eg, bit vector operations, "logical" integer operations, etc). the interesting question is hunting for maximal feasible subsets.

10:19:57 no-defun-allowed True

10:20:19 no-defun-allowed There's a few simple parallels: = is let, <= is setf

10:29:33 |3b| hmm, maybe it does eat the declarations in a way i can't get by walking up the env list

11:52:53 Josh_2 Afternoon all

11:53:08 russellw afternoon!

11:55:50 shka_ ok, so i was reading on SDRs used for HTM

11:56:35 shka_ and it is very much like data sketch

11:56:47 shka_ but how they are using those is quite interesting!

12:31:42 hjudt is there any lisp project that has a docker file i can look into? i'd like to deploy a project to an openshift cluster.

12:32:10 Xach hjudt: https://github.com/quicklisp/quicklisp-docker/ is one

12:32:46 Xach it is naïve

12:34:13 hjudt thanks. i'll give it a look

12:36:19 hjudt i will have to find a way to pack all the required libs, or i create a lisp image

12:38:50 Xach ok

12:56:38 dim hjudt: https://github.com/dimitri/pgloader has a dockerfile and is available on docker hub too, if you want to have a look

12:57:32 dim it was contributed a nice spec to have a builder image different from the runtime image, wherein there's just the lisp image itself and its .so dependencies

14:16:59 phoe _death: I found the cause of the random socket errors I was getting

14:17:26 phoe basically - socket operations are *not* thread-safe, and performing them at once from different threads causes things to explode.

14:18:39 shka_ phoe: you have multiple threads reading one socket?

14:19:33 phadthai hmm yes if they're thread-safe or not depends on OS, thread implemenation and polling method (signals may also be sent to the whole process, handled by whatever thread is next scheduled)

14:20:59 phadthai s/implemenation/implementation/

14:21:29 phoe shka_: I had

14:21:33 phoe I don't have anymore

14:21:38 shka_ ok

14:22:12 shka_ btw, even if it would work, it would still be not the greatest idea

14:23:45 astronavt__ ** NICK astronavt

14:27:56 phadthai so usually we can disable signals we don't want in threads we don't want them (usually using pthread_sigmask), configure sockets to not send signals like sigint and block/unblock status (via setsockopt), use custom locks/mutexes around syscalls like recv/accept, etc...

14:29:16 phadthai (something your abstraction lib may or may not allow too)

16:21:58 dale_ ** NICK dale

16:29:02 Demosthenex in case anyone is interested, humble bundle's doing a coder book bundle atm which includes land of lisp (ebooks). https://www.humblebundle.com/books/learn-you-some-code-books

16:33:15 warweasle LOL?

16:40:50 warweasle ** NICK warweasle_afk

17:08:17 cgay I wonder where Land of Lisp's summary came from. "...but it's cryptic syntax...." Way to sell books, guys.

17:12:26 dlowe I'm not a big fan of Land of Lisp for a number of reasons.

17:12:38 dlowe It seems to work for some people, though.

17:13:46 aeth I recently looked through the book (it was part of an ebook bundle a long time ago) and it didn't look as bad as some people said. It moves pretty slowly, though. Takes a while to get to useful features of the language.

17:14:37 aeth It's dated, though. It recommends CLISP, whereas these days people generally recommend SBCL. And it mentions Clojure and Arc as up-and-coming Lisps.

17:14:56 aeth (It's not just a recommendation for CLISP, the web chapter requires it.)

17:15:16 Demosthenex hrm

17:16:34 pjb Well, clisp has a good socket interface.

17:28:27 russellw What would be the most idiomatic way to order parts of a name? That is, in Scheme, there are lots of names like string->number. In my opinion, it would arguably make more sense to write such names like number-string or number-of-string when we are dealing with pure functions, because the function name is written before the argument, but string->number is the Scheme idiom. What is the idiom in

17:28:27 russellw Common Lisp?

17:30:54 Bike i think i've used both foo->bar and foo-to-bar

17:31:08 Bike in the actual language, usually those things take a variety of possible arguments

17:31:27 Bike so e.g. the thing to get a string out of a symbol or character is just called 'string'

17:32:26 russellw right, which is why the distinction does not show up in the standard library. But as far as you are concerned, the idiom is from->to, not to-of-from?

17:33:26 Bike i'm mostly talking about my preference. i don't know that it's something there's really an idiom for.

17:34:00 russellw right, but I need some tiebreaker

17:34:26 Bike okay, then i would do from->to, yes.

17:34:36 russellw cool, thanks

17:35:19 russellw the specific use case that prompted the question is a pair of test functions, one of which tries reading and then writing, the other of which tries writing then reading

17:36:25 pjb russellw: indeed, number<-string in scheme, number-from-string in CL would lead to more readable code. But still, left-to-right transformations are more common.

17:37:01 pjb (a<-b (b<-c (c<-d d))) -> a

17:37:19 pjb vs. (b->a (c->b (d->c d))) -> a

17:38:02 russellw (defun write-read (a) (assert (equal (read-string (fmt 0 a)) a)))

17:38:24 pjb then, you can write a macro ((-> d->c c->b b->a) d) -> a

17:39:19 russellw that has the flavor of Haskell, somehow

17:40:42 pfdietz For accessors of structure types, the common part is a prefix. (defstruct human head arms torso legs) gives human-head, human-arms, etc.

17:41:08 russellw ah, that's also from-to order

17:41:09 pjb Yes, you go for the human structure to the arm structure with human->arm.

17:41:14 pjb arm-of-human could be better.

17:41:18 russellw (human-head human)

17:41:45 pjb (sign (hand-of-arm (arm-of-human (human-ceo-of-enterprise 'apple))) (employment-contract *me*))

17:42:37 pjb the -of- or -from- or <- connectors leads to more readable code than the -to- or -> or just - connectors.

17:42:51 cgay a bold statement

17:42:59 pjb Notice on the other hand, that in CL, you can specify the prefix in defstruct, and you can use foo. instead of foo-

17:43:11 pjb So you write (human.head human)

17:43:26 pjb But usually, and by default, it's human-head.

17:44:04 russellw and I think given that the idiom is as it is, it is probably better if I follow it rather than doing things a different way

17:44:23 pfdietz The alternate style is that of generic function accessors for standard classes. These typically omit the prefix entirely. arms, head, etc.

17:44:56 pfdietz Because generic functions do not belong to any particular (base) class.

17:45:14 pfdietz Are not members of, I should say.

17:51:26 shrdlu68 I have a binary tree data structure such that each node consists of 'left-child, 'right-child, and 'data (which is some arbitrary object stored at the node). I'm currently using a vector to hold the values, initialized as (vector nil nil nil). What's a better alternative, performace-wise?

17:51:57 pjb none.

17:52:05 shrdlu68 I've tried a CLOS object, which was sub-par.

17:52:18 pjb you can (defstruct (node (:type vector)) left right data) for ease of use.

17:53:20 russellw I would expect on theoretical grounds a structure should be slightly faster because the bounds check can be omitted, but the actual performance difference too small to measure

17:54:47 pjb russellw: actually, I don't think structures can do without the bound check: (defstruct 2d-point x y) (defstruct (3d-point (:include 2d-point)) z) (3d-point-z (make-2d-point)) vs. (2d-point-y (make-3d-point))

17:55:06 shrdlu68 The results of profiling are: https://gist.github.com/shrdlu68/081a78c6196b676395ca518a268d9119

17:55:17 pjb of course, it's implementation dependent.

17:55:38 russellw fair enough

17:56:40 pjb But you should realize that it's silly to opimize the access to the node slots of a binary tree, when the access to the tree is a slow O(logn), when you could have O(1) with a hash-table!?!

17:57:32 shrdlu68 pjb: I know, trying to see how far I can take this.

17:58:32 shrdlu68 It's a space-time trade-off.

18:02:05 pjb If you start using vectors for binary trees, why not using only 1 vector?

18:04:09 shrdlu68 The keys are bit vectors of length 1-240. I just walk the bit vector going left or right.

18:04:47 LdBeth maybe for fast concatenation and substring

18:05:12 russellw if all you want to do is lookup, I would expect a hash table to be both faster and smaller

18:05:52 shrdlu68 russellw: Faster, yes. Smaller, no.

18:06:11 russellw why?

18:06:28 russellw are bit vectors not stored very compactly if you just leave them in that form?

18:07:05 shrdlu68 I assume it's because it keeps a copy of each key for comparison.

18:07:35 shka_ good enving

18:07:48 shrdlu68 Hey shka_

18:07:49 russellw ... okay, I suppose it is still 32 bytes/key

18:07:52 russellw evening!

18:08:28 LdBeth Hello

18:08:33 shrdlu68 About 10 times faster, though.

18:08:42 russellw yeah

18:09:12 shka_ shrdlu68: you figured something out?

18:09:45 shrdlu68 shka_: Nope.

18:09:52 shka_ oh, ok

18:10:45 shrdlu68 I tried using an octet vector rather than a bit vector, but octet vector was a bit slower than the bit vector.

18:10:56 shka_ as hash key?

18:11:51 shka_ shrdlu68: i had one idea

18:11:52 shrdlu68 Yes. I had to subseq and mask some of the bits of the last octet where length was not a multiple of 8.

18:12:26 shka_ well, i have something different you can try out

18:12:46 russellw how many of these things are you storing, that you need to worry about 32 bytes per key?

18:12:56 shka_ basicly try something like this

18:13:00 shka_ still a tree

18:13:08 shka_ but composed of hashtables

18:13:28 shka_ so simply divide your key into fixnum chunks

18:13:44 shka_ it shouldn't be ultra hard to implement

18:14:11 shrdlu68 russellw: A lot, I'm keeping record of all 240-bit prefixes in a bit vector.

18:14:58 russellw supposing you have an atom to convert to a (read)able string, is this the fastest/simplest way to do it? (format nil "~s" a)

18:15:28 Shinmera clhs prin1-to-string

18:15:28 specbot http://www.lispworks.com/reference/HyperSpec/Body/f_wr_to_.htm

18:15:57 shrdlu68 shka_: I'm thinking about that.

18:16:08 russellw Shinmera, thanks!

18:16:19 whartung uh.. “all 240-bit prefixes”, we only have 64 bit addressing…arent; you going to run out? “I know, I’ll use virtual memory! — Now you have 2 problems.”

18:17:19 shrdlu68 Actually all prefixes of length 1-240.

18:18:00 whartung yea, you’re still going to run out of bits.

18:18:27 shrdlu68 whartung: I take a file, read it into an octet vector, convert that into a bit vector, and, for each bit, look back 240 bits, keeping track of the last (n=240) bits.

18:19:20 whartung how big are these files?

18:19:34 warweasle_afk ** NICK warweasle

18:19:36 shrdlu68 Around 200kb.

18:19:43 russellw WTF

18:19:58 shrdlu68 Yep.

18:20:09 russellw your entire data set will only be hundreds of kilobytes!

18:20:15 whartung so you’re storing 200,030 bytes total.

18:20:24 russellw you could run that on a 286

18:20:46 russellw use the hash table and be happy

18:20:49 whartung (be boiled in oil and fried in crisco if I ever call bytes octets)

18:20:59 shrdlu68 I think there's something I'm not explaining clearly.

18:21:05 Blackbeard hello

18:21:18 Blackbeard in a .asd file

18:21:19 Blackbeard should i put

18:21:49 shrdlu68 I've tried hashtables, takes around 5s to process a 36k file.

18:21:51 Blackbeard :depends-on (#:stumpwm

18:21:52 Blackbeard #:trivial-mimes)

18:21:52 Blackbeard or

18:21:56 Blackbeard :depends-on ("stumpwm"

18:21:56 Blackbeard "trivial-mimes")

18:22:17 shka_ russellw: simply keys are long

18:22:26 shka_ and calculating hashes is slow

18:22:31 Shinmera Blackbeard: both are ok.

18:22:41 whartung the keys are 30 bytes long

18:22:43 Blackbeard Shinmera (IRC): i see

18:22:47 Blackbeard Shinmera (IRC): thanks :)

18:23:10 russellw but he already said the hash table is 10 times as fast as the binary tree?

18:23:35 shrdlu68 Yes. But consumes much more memory.

18:23:47 russellw Hundreds of kilobytes

18:24:04 shrdlu68 Nope, gigabytes.

18:24:15 russellw you said the files are 200K?

18:24:42 shrdlu68 Actually my test file right now is /bin/cat, which is 36k

18:24:57 shrdlu68 I can barely process a 1MB file.

18:25:01 russellw so where do you get gigabytes?

18:26:12 shrdlu68 russellw: 30Mb x 35000

18:26:29 whartung where’s the 30MB coming from?

18:26:32 shrdlu68 Oh wait, 30b x 35000

18:26:57 whartung that’s 1M

18:26:59 whartung ish

18:27:33 whartung add 8 bytes of overhead, and it’s 1.2M

18:27:38 whartung 1.3

18:28:06 russellw also '5s to process a 36k file'? on a modern computer, it should be much faster than that. Calculating hashes is nowhere near that slow. By orders of magnitude. Have you profiled it? What compiler are you using?

18:30:42 shrdlu68 russellw: Here's the code: https://github.com/shrdlu68/ac-experiment

18:30:44 jasom 30b*35000*8

18:33:25 shrdlu68 russellw: Profiling: https://gist.github.com/shrdlu68/081a78c6196b676395ca518a268d9119

18:34:36 jasom holy bad indentation batman!

18:35:02 shrdlu68 Me?

18:36:00 russellw so if I understand that correctly, you are using SBCL, which is generally reckoned the fastest available compiler, and it is indeed spending a large percentage of the total time calculating hashes. Okay, that is bizarre, and I have no explanation for it. Should be several orders of magnitude faster

18:36:23 russellw afk

18:36:38 shrdlu68 russellw: Other implementations take so long I haven't really let them run to completion.

18:36:47 pjb russellw: err, sbcl is one of the slowest compilers, if not the slowest.

18:39:12 jasom it's entirely possible that the hash function for bitvectors is very bad.

18:39:15 pjb Well, I believe. You would have to benchmark it.

18:40:08 jasom and I see 3.7% of the time in sxhash, right?

18:42:09 whartung 3.7% shouldn’t dominate the discussion. It’s interesting, but obviously not “the reason” its slow

18:42:43 whartung I assume this: (FLET "BODY-FUN-0" :IN SB-IMPL::GETHASH3) includes the body of the function?

18:42:47 whartung in it’s times?

18:43:17 shka_ jasom: 3.7% self

18:43:30 jasom shka_: 3.7% cumulative 0.8% self

18:43:52 shrdlu68 The profiling without inlining, and removing one function (forgot to update repo): https://gist.github.com/shrdlu68/081a78c6196b676395ca518a268d9119

18:45:06 shka_ jasom: i can't see this honestly

18:45:30 shka_ Cumul 81.9 - SXHASH is what i am seeing

18:45:49 shka_ and 50% on some internal local function of gethash3

18:46:31 shka_ i can't even find 3.7 in this file

18:46:35 shka_ am i crazy?

18:47:01 shrdlu68 shka_: Sorry I just updated that gist.

18:47:21 shka_ oooh, ok

18:47:24 shka_ so i am not crazy

18:48:01 shka_ nah it really looks like hashing is a bootleneck to me

18:50:28 jasom does sbcl use most or least significant bytes for hash-tables?

18:50:39 jasom becuase sxhash of two bitvectors of the same length only changes the lower bytes

18:51:05 jasom (sxhash #*11111111) ;=>2321882180057224981

18:51:21 jasom (sxhash #*00000000) ;=> 2321882180057224732

18:52:21 shka_ no idea, but i would expect sbcl to have different logic for short bitvectors and long ones

18:52:30 shrdlu68 "enhancement: The value of SXHASH on bit-vectors of length equal to the word size now depends on the vector's contents instead of being constant; its value on bit-vectors of length divisible by the word size now depends also on the contents of the vector's last word." => http://www.sbcl.org/all-news.html

18:52:53 shka_ ouch

18:53:14 shka_ i think that in your case you will get plenty of collisions

18:53:57 shka_ anyway, it all makes sense

18:53:58 jasom it uses a poor mixng function though; the longer the bit vector the more the difference, but even with 30 bytes the first 9 decimal digits are identical

18:54:24 shka_ jasom: question is: does it matter for hashtable implementation

18:54:52 shka_ it may not need high avalanche factor

18:59:10 shka_ good night everyone

18:59:35 shrdlu68 Goodnight shka_

18:59:37 vtomole Good night

18:59:52 jasom in any event 1M calculations of sxhash are nearly instantaneous

19:01:52 shka_ jasom: since they are literally just taking 64 last bits!

19:02:20 shka_ yeah, this is weird

19:03:12 shrdlu68 And here I thought I was pushing the limits of modern computers :P

19:03:20 shka_ (FLET "BODY-FUN-0" :IN SB-IMPL::GETHASH3) this stuff is weird

19:03:37 shka_ i wonder what it is doing…

19:03:50 shka_ anyway, off to bed

19:07:24 jasom however, the hash-table-size is only 4M when it hits 1GB of heap on my machine

19:08:21 tomaw- ** NICK tomaw

19:16:20 russellw pjb, oh, which ones are faster? I thought nearly everyone was using SBCL

19:16:38 oni-on-ion speed of compiling and speed of produced code are different metrics

19:16:54 jasom space is almost entirely simple-array-unsigned-byte-64 objects

19:16:58 russellw right, the conversation was about speed of the produced code

19:18:00 shrdlu68 Is that how bit vectors are implemented internally?

19:18:30 Bike bitvectors are done as word vectors i think, yeah

19:18:43 jasom I misspoke, it's only 20% byte-64 vectors, but it's the largest fraction

19:18:48 jasom nope

19:19:06 jasom because it's also about 17% simple-vit-vectors

19:19:16 jasom maybe bit vectors that are a multiple of the word size are implemented that way?

19:19:32 jasom no, that's wrong

19:19:51 jasom Perhaps the hash tables use word vectors when there are collisions?

19:20:29 jasom because there are ~3k of the word vectors but they are using over 100M of ram

19:21:05 jasom also how many bit vectors are there supposed to be? I'm seeing 4M already and it hasn't yet run out of ram on my 1GB heap

19:22:15 oni-on-ion heh @ last bullet item "(only with users permission and for maintenance reasons)" isnt that FB

19:22:33 oni-on-ion wrong channel.

19:23:00 trafaret1 hi there

19:23:13 oni-on-ion if you dont need to modify the data as they come from files, perhaps using a more efficient data structure than that ?

19:23:15 oni-on-ion hi

19:23:30 trafaret1 need some help

19:23:50 trafaret1 I'm wondering about effective working wtih tables

19:24:03 trafaret1 does it solution to use lisp and mysql

19:24:06 slyrus1 ** NICK slyrus

19:24:35 oni-on-ion https://www.reddit.com/r/lisp/comments/5oezj6/how_to_interface_with_a_mysql_database_in_lisp/ ?

19:25:48 shrdlu68 jasom: If I run with "--dynamic-space-size 1024" I get heap exhaustion.

19:26:20 shrdlu68 "Heap exhausted during garbage collection: 0 bytes available, 32 requested."

19:26:42 shrdlu68 It says "during" GC.

19:31:06 Blackbeard how can I measure the time a function takes ?

19:32:13 Shinmera clhs time

19:32:14 specbot http://www.lispworks.com/reference/HyperSpec/Body/m_time.htm

19:32:40 dim shrdlu68: try CCL, the GC is much better than SBCL's one in my playing around with pgloader

19:32:48 dim +experience (maybe)

19:33:35 Blackbeard Shinmera (IRC): oh thanks :)

19:35:44 vtomole What's wrong with sblc's gc?

19:36:03 vtomole *sbcl

19:42:28 jasom nothing wrong with GC in this case though Shinmera

19:42:33 jasom rather shrdlu68

19:43:00 jasom shrdlu68: heap will almost always be exhasted during GC; its either during GC or during a point where GC is excluded.

19:44:08 jasom it looks like the hash table is about to grow to ~600MB and we are using 500MB, so that checks out.

19:44:08 dim vtomole: when using pgloader users often reach heap exhausted/game over message in situations where CCL pile through the work at a fraction of the memory usage; it might be my code though

19:44:42 dim but when users report problems with SBCL and big/huge data sets being processed by pgloader, I know I can just recommend CCL and it's going to be ok.

19:44:43 jasom sbcl is very conservative about *when* to invoke the GC, so I've seen heap exhausted when there would have been room had the GC been run sooner

19:45:01 shrdlu68 Me too.

19:45:11 jasom i.e. I see heap exaustion errors fixed by just putting a (gc) somewhere in the loop

19:47:36 jasom but in this case it's jsut that its growing the hash-table to be more than 1/2 the heap which just isn't going to work

19:50:20 jasom but it looks like we have 4M objects in less than 2k buckets, which explains a lot. However ~200MB of simple-bit-vectors seems excessive if the original statement that it's only 1 vector per bit in the file.

19:50:48 jasom 4 million bit vectors would be 512k not the 32k that I'm testing on.

19:51:47 shrdlu68 30 vectors per bit,

19:52:51 jasom oh, that makes sense then. Space usage isn't that high; looks like its 8bytes+size of bit-vector per bit vector and then thhere's another over 100M for the single-float arrays

19:53:08 jasom so it's ~300M of data for 100M of hash-table overhead which isn't great but not terrible.

19:54:53 jasom however, it looks like it may be copying the data on a rehash, as that's the only thing that makes sense for a 600M allocation. I'll inspect the code

20:01:14 jasom also it looks like it always rounds the hash table size up to a power of two, which makes the default rehash-size of 1.5 rather stupid

20:03:28 jasom total hash-table overhead seems to be 4 times the word size of the number of elements rounded up to a power of two

20:05:34 jasom so a 64-bit target the overhead of storing 2^22+1 items would be ~256M.

20:22:25 pjb minion: memo for shrdlu68 trees without pointers: https://pastebin.com/yreWM7NW

20:22:27 minion does torturing a poor bot with things beyond its comprehension please you?

20:22:32 pjb minion: memo for shrdlu68: trees without pointers: https://pastebin.com/yreWM7NW

20:22:32 minion Remembered. I'll tell shrdlu68 when he/she/it next speaks.

20:25:42 razzy aaaah, i remembered shrdlu :]

20:27:22 aeth pjb: whenever I have something where a compose macro/function/whatever might seem useful, I find that it's probably better just to combine two of the three compositions into one new, trivial, inline function

20:28:34 aeth i.e. instead of (another-function (some-function (symbol-to-keyword foo))) I can have (another-function (some-function* foo)) where the variation of some-function converts symbols to keywords

20:29:08 aeth (I say i.e. instead of e.g. because it's pretty much always that, usually even symbol-to-keyword!)

20:31:35 aeth This is common when I have a function with a case working on keywords but I take in arbitrary symbols from a macro.