libera/#clasp - IRC Chatlog

18:35:39 drmeister I have an x86 Macbook Pro on the thirdlaw VPN that can do profiling. I could profile the bytecode interpreter once it works with cvm.

18:48:08 yitzi I think that removing application.config and project_headers.h has made it easier to add an extension that doesn't use clbind. I tried to make a non-clbind version of lila before and I was getting all kinds of weird syntax errors when it tried to compile anything that used project_headers.h. With this method it likes it was pretty easy and no errors yet.

18:48:31 drmeister That makes sense

18:49:00 yitzi Seems like project_headers was very sensitive to the ordering of the declarations.

18:49:20 drmeister Bike, karlosz: If you do get cvm compatible with the bytecode interpreter at the end of the day - tell me and I'll do some profiling.

18:49:42 drmeister yitzi: How so? Is it because of the inheritance between classes?

18:50:17 drmeister I could sort them based on inheritance. We don't use multiple inheritance so it should be a simple matter of sorting the headers based on the precedence of classes

18:50:54 yitzi It could be. I don't really know for sure since I was not able to fix the problem. I was getting all kinds of weird errors from templates. This new method seems to work out of the box.

18:53:10 yitzi I am letting it run the workflow tests. Everything seems to work but just making sure. I have already tested with boehm and boehmprecise. I am gonna run the static analyzer at some point just to be extra sure that I haven't broken anything.

19:20:21 Bike drmeister: oh, i did that this morning

19:32:27 drmeister Bike: I lost the thread - what was it that you did this morning?

19:32:57 drmeister Get cvm working with clasp/vm?

19:33:18 drmeister If that's the case - I'm hitting this at the moment...

19:33:19 drmeister https://www.irccloud.com/pastebin/WrydQffR/

19:33:24 drmeister Unknown opcode 52

19:36:28 Bike 52 is the long opcode. what are you compiling exactly?

19:37:00 Bike actually, i don't think the compiler generates that, so that's weird

19:37:17 drmeister https://www.irccloud.com/pastebin/yf4nBIAN/

19:37:26 drmeister (setf (fdefinition 'fib) (compile-to-vm::bcompile '(lambda (n) (if (eql n 1) 0 (if (eql n 2) 1 (+ (fib (- n 1)) (fib (- n 2))))))))

19:38:43 Bike hm, that should work, but it's not here

19:38:46 Bike lemme get back to you on that

19:40:59 Bike oh right, jumps are messed up probably

19:41:03 Bike gimme a minute to fix that

19:41:08 drmeister Ok

19:44:00 Bike also, sidenote, since you made entry points funcallable, you can actually just use compile-to-vm::compile

19:44:04 Bike i should probably axe bcompile

19:45:24 drmeister Ok

19:50:46 yitzi drmeister: I'm note sure I understand the advantage of the class/name pattern from CL_LISPIFY for CL_DEFMETHOD. Doesn't that defeat the point of the specialization?

19:53:21 drmeister How so?

19:53:54 Bike drmeister: ok, try now

19:54:44 drmeister yitzi: atom and residue inherit from matter. (matter/add-matter a-residue an-atom) works.

19:55:26 yitzi Well, if I have class A and class B both of which have CL_DEFMETHOD A::fu() and CL_DEFMETHOD B::fu() ....

19:56:21 yitzi Then I want (fu x) to call the right one. I don't understand what a single dispatch A/fu and B/fu help there.

19:56:23 drmeister I'm thinking of C++ exposed classes as working more like Common Lisp structs than classes.

19:57:47 yitzi To me matter/add-matter make sense for static CL_DEFUN.

19:57:55 drmeister After all this I think it's more valuable to have A/fu and B/fu to make it easier to discover what methods apply to instances of A and B than to be able to write (fu x).

19:58:47 yitzi So are you thinking of CL_DEFMETHOD of being a virtual function?

19:59:42 drmeister CL_DEFMETHOD is for methods and virtual methods.

19:59:59 drmeister Take a look at build/boehmprecise/generated/rename-methods.sexp

20:00:05 drmeister That is autogenerated by the scraper.

20:00:30 drmeister It lists at the bottom the cases where single-dispatch is currently used.

20:00:46 yitzi Well, in common-lisp-jupyter, it can list applicable methods based on specializers.

20:00:47 drmeister It's very few cases where it's useful.

20:01:31 drmeister Yeah, but listing applicable methods based on specializers isn't useful to programmers.

20:02:40 drmeister I was looking for something to help me find what methods apply to a particular class with name completion. Where I can type atom/<tab> and get all the atom methods.

20:04:26 drmeister Look at cando, build/boehmprecise/generated/rename-methods.sexp

20:05:15 drmeister There are 1517 methods that are easy to rename to class/name

20:06:27 drmeister There are 105 of them that are hard to rename because the same method name is used by multiple classes.

20:06:49 drmeister Let me call you later about this and we can talk about it.

20:08:28 yitzi Sure

20:11:54 karlosz OK, here are the easy VM optimizations to make: Not using AREF for the literals vector, and using casting instead of manual arithmetic for reading labels

20:12:17 Bike think i got lexical nlx working

20:12:33 karlosz that would also make the pc updating for the branch instructions better

20:12:37 karlosz Bike: great!

20:13:05 karlosz does that use the same mechanism as cclasp in terms fo handling C++ unwinding and longjmp stuff?

20:17:14 Bike Yes

20:17:29 Bike so it should work for C++ unwinding cases too, i think, though i didn't try that

20:18:17 Bike oh yeah, related. when we exit out of a nonlocal block does that hit the entry-close? i think it ought to, maybe?

20:18:37 Bike since blocks and tagbodies aren't distinguished in the ir, we can't exit the entry as soon as we do an exit

20:18:41 Bike in the bytecode*

20:18:49 karlosz Bike: you compile the entry-close differently

20:19:52 karlosz like https://github.com/clasp-developers/cvm/blob/a4bc08d869fc0b321c4aeefd46d7ca913d794ffd/compile.lisp#L868

20:20:06 karlosz for block the label to exit to is right before the cleanup

20:20:44 Bike so we do hit the entry-close? like we hit the exit instruction, and then the next instruction is entry-close.

20:20:46 karlosz for tagbody https://github.com/clasp-developers/cvm/blob/a4bc08d869fc0b321c4aeefd46d7ca913d794ffd/compile.lisp#L839 you jump to whereever the go tag is which doesn't have to

20:21:09 karlosz Bike: yes for block return-from, the exit goes to the label right before the entry-close

20:21:19 Bike Right, ok, that's what I expected

20:21:22 karlosz for tagbody the exit goes to whereever the go-tag label is emitted

20:21:31 Bike and then in tagbody it won't in general, unelss the tag happens to be right at the end of the tagbody

20:21:35 karlosz yes

20:21:42 Bike Great

20:22:11 karlosz i think this is how i wanted it to work for cclasp as well but i think the reason we didn't do it is that llvm can't really express inter-function control transfer

20:22:57 Bike Yeah, in llvm the block pretty much needs to control where you go next. we can't encode the destination at the return-from/go/whatever like we can in the vm

20:23:23 Bike this is a limitation of llvm (imo) and i don't have the knowhow to fix it

20:23:48 Bike llvm nlx is entirely based on c++ exceptions, which suck, and are also very dynamic, so encoding a destination in the jump doesn't make sense

20:25:36 Bike anyway, i think that's all the instructions implemented except catch, throw, catch-close, progv, long

20:25:52 Bike so if we do like aclasp and compile catch/throw/progv as thunks, that should about cover everything

20:26:01 Bike i guess next i will fix up macrolet for clasp

20:26:28 Bike and then we can start trying it on clasp and watch it all burn as the edge cases i haven't thought of come up

20:27:49 karlosz_ Bike: why do we need to do this stuff as thunks?

20:28:11 Bike we don't need to, it would just mean a little less to debug to start out with

20:28:18 karlosz_ ah gotcha

20:28:23 karlosz_ we also need to do unwind-protect

20:28:35 karlosz_ i think the clisp insructions will do... haven't thought about it too much

20:30:50 Bike i have not been thinking too hard about it, since aclasp already uses thunks for all of these

20:31:02 Bike so the vm can't possibly do worse by continuing that

20:35:25 drmeister When I compile the FIB example and evaluate (FIB 29) it works fine.

20:35:34 drmeister But (time (FIB 29)) hangs.

20:36:01 karlosz_ very strange

20:36:30 karlosz_ question: how does the side stack get cleaned up during an interrupt?

20:36:32 drmeister Ah - it worked once and the next time hangs.

20:36:38 Bike oh, i might have left printfs in

20:37:22 Bike yeah, sorry, just pushed that out

20:37:34 Bike i don't know if that's the problem, but i imagine a million lines of output doesn't help

20:38:25 drmeister I turned printfs off.

20:39:48 Bike the vm might not react well to interruption. the NLX will react appropriately to exits but normal use might not

20:39:49 drmeister I'll get some debugging going.

20:40:45 Bike i suppose i can make bytecode_call pop the frame in a catch (...) instead of only doing it on nlx or normal return. that should make it work better with exits, i think

20:41:19 Bike (i moved the bulk of the vm interpreter to another function bytecode_vm, to do the recursive calls thing for nlx)

20:41:58 karlosz_ i don't really understand why we need to do recursive calls to be honest

20:41:58 Bike wait, catch ... doesn't include normal exits

20:42:02 Bike this stupid fucking language

20:42:14 Bike we don't, it's just easier to write

20:43:13 Bike if we didn't do that we would (as far as i have thought this through) need to keep track for each PC of what dynamic environments need to be unwound if we exit, so that we can have the C++ code unwind during an alien unwind

20:43:44 Bike which is more or less how C++ unwinding works, except we'd be using bytecode pcs instead of machine

20:49:53 Bike it would probably be a little faster as long as we don't actually NLX

20:54:24 Bike fib 30 does seem to hang still

20:54:56 karlosz_ for non-local exits specifically (not unwind-protect) it would seem like you could just make C++ callouts save the current stack pointer before the call out and then on reutrn check the current stack pointer against the saved stack pointer

20:55:24 karlosz_ then do an unwind and pop off dynenvs like that

20:55:36 karlosz_ i suppose that doesn't really work for tagbody

20:55:38 Bike but the C++ might not return

20:56:27 karlosz_ yeah, i guess we do need a frame then. but calling recursively shouldnt be a big deal

20:56:57 Bike i did just set it up to pop frames gracefully during a C++ exit, hopefully

20:57:00 Bike doesn't seem to have changed fib

20:58:02 karlosz_ yeah and fib doesn't use &optional and &key so it shouldnt be that either

20:58:38 karlosz_ ** NICK karlosz

21:15:12 Bike if repeated calls are causing a problem, maybe the stack discipline is messed up

21:15:23 Bike although if that's the case i wouldn't really expect recursive calls to work at all, so who knows

21:47:08 karlosz Bike: any reason for separating out bytecode_return?

22:10:35 drmeister Bike: (fib 30) was hanging the first time that I tried it - it could be for an unrelated reason such as we are blowing the side-stack. Is that possible yet?

22:11:16 drmeister If it's not possible (we have 16K words at the moment) (fib 30) should only go like 30 levels deep - right?

22:22:04 drmeister Bike: For example, I can evaluate (fib 30) but (fib 35) causes clasp to crash.

22:23:24 drmeister I'll add something to print the high-water mark for the stack.

22:29:58 Bike karlosz: i thought i was going to need to use it in exit, which is why i asked about that

22:30:51 Bike drmeister: should go 30 deep max, yeah... which is why i'm wondering if the stack pointer is being maintained improperly, so that each call leaves the stack pointer higher than it ought to be upon returning

22:36:03 drmeister I'll check with this:

22:36:07 drmeister CL_DEFUN Integer_sp core__side_stack_pointer() { return Integer_O::create((uint64_t) my_thread->_VM._stackPointer);}

22:39:42 drmeister Bike, karlosz: Can we avoid taking pointers to anything on the side stack? If we can then we can dynamically resize it.

22:40:23 drmeister If we must take pointers, use indices relative to the bottom (stack grows up) of the stack.

22:40:47 drmeister Aaaaaand, what do you think about dynamically resizing the stack?

23:00:45 drmeister Nope, the stack is being maintained properly

23:02:46 drmeister https://www.irccloud.com/pastebin/3QVy5Q8D/

23:03:05 drmeister That should say "Stack pointer"

23:05:44 drmeister https://www.irccloud.com/pastebin/GtD3E1qV/

23:06:32 drmeister It's only getting about 1000 bytes deep

23:19:42 drmeister Bike: It crashes while allocating a CONS cell.

23:19:48 drmeister This may be a bad pointer issue.

23:20:41 drmeister I'll turn on guards.

23:23:33 Bike drmeister: we use pointers into the stack as vaslists to calls, so no, we kind of can't avoid that (without having to copy everything for that stuff)

23:23:36 Bike also, ouch

23:25:22 Bike least we know what the issue is

23:26:01 drmeister Here's some info on the bug.

23:26:04 drmeister https://www.irccloud.com/pastebin/EH6HPbB6/

23:26:06 Bike debugging the vm is very pleasant compared to debugging the machine target. i can have it just dump out every instruction no fuss, and we could dump the entire memory state (stack) too if we wanted

23:26:16 drmeister It's on line 309 of malloc.c in boehm

23:26:46 drmeister The problem is down at the bottom GC_obj_kinds[k].ok_freelist[lg] -> 12 = (void *) 0xb9000055e48fbd77

23:26:47 Bike so we're doing something that makes boehm crash?

23:26:55 drmeister That's not a real pointer.

23:27:06 drmeister Yeah - we are making boehm crash.

23:27:07 Bike i guess if the stack isn't maintained we could be using up all memory, which would probably do weird things to boehm

23:27:29 drmeister I'll put a watchpoint on &GC_obj_kinds[k].ok_freelist[lg]

23:27:35 drmeister k = 5, lg = 2

23:27:40 drmeister Those are reasonable values.

23:28:09 drmeister I'm building with guards on at the moment to see if that catches anything.

23:28:57 drmeister What do you mean by "if the stack isn't maintained"?

23:30:04 Bike what i'm imagining is for example, before a particular call to fib, sp = 1032

23:30:28 Bike and then after the call returns, sp is 1034 or something

23:30:35 Bike ...although maybe that would cause more severe problems

23:30:41 Bike but point is it would be like a memory leak

23:31:27 drmeister Bike: I check the side stack pointer after each call from the command line. If there was creep - it should show up - right?

23:32:15 drmeister There's no creep.

23:32:16 drmeister https://www.irccloud.com/pastebin/56FomlQG/

23:32:27 drmeister Note, I set *sp* to the current stack pointer.

23:33:08 drmeister Then I print the difference (- (core:side-stack-pointer) *sp*) whenever the argument to fib is 1.

23:33:20 drmeister Then I call (fib 3) and again (fib 3)

23:33:32 drmeister It prints the same difference: 104 each time.

23:35:44 Bike oh

23:35:59 Bike wow i misread you. when you said the stack was being maintained properly i read the opposite. oops.

23:37:24 drmeister No worries.

23:45:00 Bike ok, so other things that could break boehm... well, the vm uses a law of untagged pointers, which is not something i usually do, so i might have screwed stuff up on that front

23:51:05 Bike a lot of untagged pointers*

0:03:20 karlosz drmeister: like Bike said, we do sort of want stack pointers in the VM instead of indices to keep things fast. for dynamically allocated stacks we could just create a new one twice as big and copy the whole thing and just updating the stack and frame pointer in the vm struct

0:04:05 karlosz also, maybe somehow the side stack isn't keeping objects alive?

0:04:24 karlosz i thought it would if we used the normal C++ stack though

0:04:25 drmeister The side stack should be keeping things alive - I set the whole thing as a source of roots.

0:05:09 drmeister It looks like we are writing garbage into the boehm memory. I see a tagged pointer being written into boehm memory.

0:05:24 karlosz ah ok then we don't even need to allocate the side stack on the C++ stack

0:06:01 drmeister I don't think we do.

0:06:31 drmeister We talked about allocating it on the C++ stack but that doesn't help anything.

0:06:37 karlosz right, that's how we'd get growable stacks too

0:06:45 karlosz (by not doing it on the C++ stack)

0:07:09 drmeister I'm allocating the side-stack here:

0:07:10 drmeister https://github.com/clasp-developers/clasp/blob/vm/src/gctools/threadlocal.cc#L138

0:07:12 karlosz fib 30 used to work before the changes right before the weekend though right?

0:07:24 drmeister I'm using posix_memalign( &mem, pageSize, VirtualMachine::MaxStackWords*sizeof(T_O**) ); to allocate it

0:07:45 drmeister And gctools::clasp_gc_registerRoots((this->_stackBottom),(this->_stackBytes-pageSize)/sizeof(T_O*)); to register the whole thing as a source of roots.

0:08:11 drmeister I can't currently limit it just to the live stack. That's why I recommended that we stick NULL in each word as we pop words.

0:08:42 karlosz yeah, that makes sense

0:09:24 drmeister Actually, there might be a way to limit what part of the side-stack is live. There are ways of making boehm just check a range of memory - but the last time I tried to use it it crashed.

0:09:29 drmeister I could try that again.

0:09:59 karlosz yeah... i guess that might potentially cause problems. i had thought that that would just mean keeping objects live at worst

0:10:04 drmeister I think we need to add more checking to the VM. Check if writing to pointers are all aligned.

0:10:40 karlosz but i suppose boehm won't like it if an object in the dead part of the stack gets overwritten by something else

0:19:33 karlosz just fixed jump-if-supplied for variable length labels so lambda lists should work again

0:49:31 drmeister I don't think garbage on the side-stack will hurt anything. Boehm is very, very tolerant

0:52:20 drmeister I think it's more likely that we are writing into a tagged pointer or a tagged pointer when we should be writing an untagged one.

0:53:52 drmeister The crash is reproducible. It crashes the same way 3 times now.

1:49:04 Bike the vm as written doesn't null out popped stuff

1:49:08 Bike could the problem just be that

2:00:41 drmeister I don't think it's that - but we should null it out for now.

2:01:19 drmeister It looks like the vm._stackPointer is pointing to the boehm free list address.

2:01:53 drmeister I'll add a test that the _stackPointer stays in range.

2:02:06 drmeister stays between _stackBottom and _stackTop

2:03:14 Bike what the hell...?

2:10:11 Bike for the exit stuff i made a tiny little new structure VirtualMachineStackState that includes the frame pointer, and ENTRY uses it to restore the stack pointer after a nonlocal exit

2:10:29 Bike could it like... get reclaimed wrongly by boehm somehow... i guess not, since fib doesn't even do nonlocal exits

2:11:03 drmeister https://www.irccloud.com/pastebin/fpBMr1zZ/

2:11:16 drmeister vm._stackPointer is pointing outside of the stack

2:13:30 Bike ???

2:15:17 Bike i guess we could use udb to go back to where the stack pointer started sucking?

2:15:38 Bike maybe there's a size_t overflowing into a stupid high number, and that's ending up in push_frame or something

2:15:48 Bike although really the difference doesn't seem like enough for that

2:36:27 drmeister We are blowing through the top of the side-stack.

2:37:30 drmeister Huh - we don't have 16k words. We have 16k

2:37:40 drmeister 2048 words

2:39:32 Bike so 16k bytes? some misconversion there?

2:40:31 drmeister Yeah - I'll bump it up and see what happens. The guard isn't working though.

2:41:25 drmeister I do say this though: static constexpr size_t MaxStackWords = 16384; // 16K words for now.

2:41:38 drmeister Maybe I'm dividing where I should be multiplying

2:56:03 drmeister No. Putting in more print statements.

3:02:40 drmeister I was wrong, there are 16383 words on the stack.

3:05:57 drmeister Somehow we are blowing through the top of the stack.

3:06:26 Bike should we put in a test when we push that the stack pointer isn't too high

3:40:46 drmeister The regular instructions seem to work fine.

3:41:03 drmeister I evaluate (fib 33) and print the stack for every instruction and there's no problem.

3:41:11 drmeister It's still running though

3:41:53 drmeister The stack gets maybe 1000 bytes high

4:15:45 drmeister Ok, got something. It's at the end.

4:15:56 drmeister Of course it f*cking is.

4:16:32 drmeister I evaluate (fib 33) and the stack stays at 1000 bytes or so all the way except at the end it balloons.

4:17:33 drmeister Crap - I can't even find the transition.

4:17:41 drmeister There is so much output.

4:21:45 drmeister Double check all the stackPointer changes. Somethings wrong. Maybe with return.

4:27:02 drmeister hundreds of thousands of line of output like this:

4:27:04 drmeister https://usercontent.irccloud-cdn.com/file/ELKlBEDk/image.png

4:27:24 drmeister Then right at the end it the stackpointer starts climbing fast

4:27:37 drmeister https://usercontent.irccloud-cdn.com/file/Mz4DqGMs/image.png

4:27:40 drmeister 40 bytes at a time

4:28:05 drmeister Until kablooee

4:28:08 drmeister https://usercontent.irccloud-cdn.com/file/2qlY8zX7/image.png

4:34:21 drmeister Something else is going on.

4:34:43 drmeister I evaluate (fib 20) (fib 21) (fib 22) (fib 23) ... (fib 29) all work fine

4:34:48 drmeister (fib 30) blows up.