freenode/#sicl - IRC Chatlog

23:56:08 aeth_ ** NICK aeth

2:20:58 aeth_ ** NICK aeth

3:21:38 aeth_ ** NICK aeth

3:27:30 aeth__ ** NICK aeth

4:12:43 beach Good morning everyone!

4:20:33 no-defun-allowed Good morning beach!

5:11:56 ebrasca Good morning!

5:13:45 beach Hello ebrasca.

5:21:31 drmeister Bike: In the paper that you posted (that I got from Steve Blackburn) what are the different yieldpoint methods in Figure 3 describing?

5:22:20 drmeister I understand the (a) conditional one (I think). But (b) and (c) - I don't see how they work.

5:23:17 no-defun-allowed Some Java virtual machines cause threads to read or write an address, which causes a segfault, which then gets handled.

5:23:30 Bike they're explained under "Trap-Based Polling Yieldpoints". Basically, when the yieldpoint is hit it does a meaningless memory operation on some page. When you want the yieldpoint to activate, you protect the page so that memory operations on it cause the system to trigger an interrupt

5:23:35 Bike yeah.

5:23:50 no-defun-allowed That's a better explanation of it.

5:24:16 drmeister That is faster than a comparison and a branch? I guess the slow path is a lot slower - right?

5:24:43 Bike well, that's what they get numbers for

5:24:59 Bike it doesn't seem too much worse based on these graphs

5:25:46 Bike I think figure 5 is the one you want. sometimes the traps do much worse but not always.

5:26:22 drmeister Ok. I should have read the paper again. I was going off the figure 3 that Steve posted in a Zulip post and I needed the paper to decode it.

5:26:42 Bike though it depends. in the results under "Global Yieldpoints" they get a 2.5% overhead for a conditional, 2.0% for the load trap, and then 36% for the store trap

5:26:47 Bike so that's, like, quite a bit worse

5:27:40 Bike that kind of memory trap stuff isn't something i've dealt with before, so i don't know the ins and outs very well, unfortunately

5:27:57 Bike but my impression is that garbage collectors and stuff have often used these mechanisms

5:28:14 beach I plan to stay away from trap stuff as much as possible.

5:28:55 Bike any particular reason? other than that it looks kind of magical in disassembly

5:29:14 beach Gut feeling. It feels messy to deal with.

5:29:58 beach And it is not clear to me what the performance penalty of invoking the operating system would be.

5:33:17 Bike yeah, fair enough

5:35:00 beach What are all the cases that yieldpoints are used for?

5:36:45 beach I know I need something like that when the global collector requests its "roots" from the nursery collectors.

5:38:17 Bike they mention garbage collection, "user-level thread preemption" so i guess interrupt-thread, code patching, "biased locking" which from a quick google is probably irrelevant whatever it is, and profiling.

5:39:18 Bike the paper is mostly about yieldpoints themselves. doesn't go into the applications too much

5:39:31 beach Sure.

5:39:52 beach Thread preemption is important I suppose.

5:40:39 beach I guess my GC case is a special case of thread preemption.

5:41:18 Bike we actually have problems in clasp with thread interrupts. you can only interrupt threads at safe/yield points, which for now are allocations. So if a thread is in a loop that doesn't allocate, you can't interrupt it.

5:41:35 beach Oh, yes, that's a problem.

5:41:47 Bike i suppose cleavir is probably well enough developed now that i could whip up an insertion pass. i know you already wrote some loop detection code

5:41:52 beach You need at least back arrows and function calls.

5:42:38 beach With loop unrolling back arcs can be made less frequent.

5:44:04 beach Hmm. I am thinking that one could have a counter on back arcs, so only test the yieldpoint every 100 times or so.

5:44:58 beach ... or replace 100 by whatever number makes the yieldpoint test insignificant.

5:45:37 Bike you could work type inference into it. if the code has (loop for i below n ...) and n is an (integer 0 100) the compiler doesn't bother inserting points.

5:45:52 beach Yes, that's a very good idea.

5:48:34 beach Here is another idea. Instead of checking at function calls, check at function returns. Then it can be done by modifying return addresses on the stack.

5:48:46 beach Tail calls would have to be tested though.

5:48:54 beach But that's detectable at compile time.

5:49:51 beach Though, I guess modifying the stack would require some synchronization. So maybe not.

5:54:34 splittist Good morning

5:59:19 ebrasca Morning splittist!

6:10:14 beach Hello splittist.

6:17:01 ebrasca Do you plan to support changing endianes at run time?

6:30:21 no-defun-allowed I take it you can change endianness on PowerPC?

6:31:02 no-defun-allowed Is there a use for it, may I ask?

6:35:37 ebrasca no-defun-allowed: You start in big endian , if you like to change to litle endian you need to run some istructions in big endian.

6:36:53 no-defun-allowed I see. Wouldn't you decide when the executable starts? Generating code for both big- and little-endian to support both seems silly even for #sicl standards.

6:38:01 ebrasca no-defun-allowed: To call OPAL you need to be running in big endian.

6:39:24 no-defun-allowed I take it OPAL is how the operating system calls into firmware?

6:39:51 no-defun-allowed If so, you might just keep running in big endian (assuming nothing else requires little endian).

6:41:17 ebrasca I think you can run everiting in big endian.

6:42:45 no-defun-allowed That looks like the easiest approach.

6:43:27 ebrasca What about 32bit mode and 64bit mode?

6:43:48 no-defun-allowed Although stylewarning told me "no one likes big endian" (when I asked about it while porting a Smalltalk implementation to the Wii, which was stupid enough to not do its own endianness conversion when loading images), but evidently the OPAL people like big endian.

6:44:21 no-defun-allowed I think SICL will be 64-bit only, and I haven't heard of any language implementation which lets you switch between 32-bit and 64-bit code as such.

6:44:49 moon-child nasm lets you do that

6:44:51 moon-child 'bits 32'

6:44:53 no-defun-allowed From memory, one switches between modes on x86-64 by modifying the page table, which has a bit for 32/64-bit code, so clearly only the OS can do that.

6:44:58 moon-child (maybe that's cheating... :)

6:45:04 ebrasca I don't care if it is litle or big.

6:45:22 no-defun-allowed Sure, but an executable with 32-bit and 64-bit code can't use both.

6:46:24 no-defun-allowed From memory, macOS has "fat" binaries which could have both PowerPC and x86, and then AMD64 and AArch64 code, but that selects one architecture at load-time.

6:46:53 ebrasca OK I only need to think 64-bit with big endian.

6:49:03 no-defun-allowed Perhaps I need to modify that Smalltalk to do endianness conversion, because I really dislike that one should have to modify the Xerox image to run it on another computer - wasn't the whole point that it was device independent? But I recall that advice came from Xerox, which makes it harder to argue with.

6:49:22 beach I don't imagine switching between endian-ness, and I am not planning to support 32-bit processors.

6:53:38 ebrasca beach: Do you need help with someting?

6:54:40 beach I have been preoccupied for the past few days. I'll think of something today I hope.

6:58:09 ebrasca beach:

7:05:54 ebrasca beach: Is it hard to think some task for others?

7:08:00 beach It depends on the person. I don't think you know enough about the low-level details of implementing Common Lisp to take on something like compiler optimization or generic dispatch. There is lots of very mundane stuff, but that's too trivial for you, so I don't want to give you that. There is not much in the middle.

7:08:51 no-defun-allowed Would there be any other modules left for implementation?

7:09:17 beach Let me check...

7:09:41 no-defun-allowed The first that comes to mind is a random number generator, because I spent a few weeks writing fast RNGs for simulation.

7:09:54 beach Yeah, I was just thinking about that too.

7:10:44 beach Also, chapter 25 Environment.

7:11:14 beach DECODE/ENCODE-UNIVERSAL-TIME for instance.

7:11:24 beach DESCRIBE.

7:12:49 beach Pathnames too I guess.

7:13:29 beach Yeah, chapters 19 and 25 I guess.

7:13:44 beach ebrasca: If you want to take on any of those, please go ahead.

8:01:51 heisig Good morning!

8:02:16 no-defun-allowed Good morning heisig!

8:02:57 beach Hello heisig.

8:03:05 splittist ho heisig

8:13:42 ebrasca beach: 25 is Environment , I think it is Clostrum.

8:14:35 beach No, as it turns out, what the Common Lisp HyperSpec thinks of as Environment is something different.

8:15:09 beach As you can see in that chapter, there is nothing that resembles Clostrum.

8:17:04 ebrasca I don't khow what Clostrum is. I have just read the description.

8:18:33 beach How about you go read the documentation then. That would be quite appropriate for you.

8:26:03 ebrasca I asumed it is due to the description "First Class Global Environments".

8:27:04 beach If you forget about that chapter for a second, then the word "environment" in the context of software often means "a mapping from names to objects".

8:27:51 beach So Clostrum is about mapping names of functions to functions, names of packages to packages, names of classes to classes, names of method combinations to method combinations, names of types to types, names of SETF expanders to SETF expanders, etc.

8:28:47 beach So you will find functions in Clostrum such as FDEFINITION, FIND-PACKAGE, FIND-CLASS, etc.

8:35:38 ebrasca beach: I am going to check 19 first.

8:36:03 beach OK.

9:07:38 frodef good morning

9:08:11 beach Hello frodef.

9:08:25 frodef This is how far the linux kernel goes to avoid simple indirect calls: https://www.usenix.org/system/files/atc19-amit.pdf

9:10:20 beach Hmm.

9:10:36 frodef Nevermind multiple dispatch and method combinations :-)

9:10:43 beach Yeah.

9:20:33 heisig Do I understand correctly that most problems related to Spectre are because people want to execute closed-source software from dubious sources on their machines?

9:21:36 no-defun-allowed You apparently can do it with JavaScript code. It depends if you think the Web is a dubious source...

9:21:38 moon-child I think the concerns are more to do with js than native code

9:22:26 heisig Still, the problems seems to be closed-source JS from parties we don't trust.

9:23:19 no-defun-allowed My policy of disallowing arbitrary network writes with Netfarm hopefully avoids making such an attack useful.

9:24:09 no-defun-allowed Then you'd have to communicate the data you stole by retrieving objects somehow, and you can't create arbitrary references, so how would you encode anything? Maybe there's a way.

9:27:09 heisig I just want to make sure that we don't try to solve a social problem (trustworthiness in the digital age) by technical means (retpoline, ...).

9:28:21 heisig And yes, I consider the web a dubious source. At least while we don't have a reliable infrastructure for managing trustworthiness.

9:28:56 shka_ social problems ARE solved by the technically means

9:29:03 shka_ over the long periods of time

9:29:31 no-defun-allowed On the other hand, how do you convince a JavaScript engine to generate code to speculatively execute bad code?

9:30:07 no-defun-allowed Those tend to invent a new kind of exploit every year, or can you do it in correct JS?

9:34:13 no-defun-allowed (But my approach is pretty mundane, compared to the emerging "alternative" of making one's computer a troff + VT100 emulator.)

9:34:42 heisig no-defun-allowed: If you allow for arbitrary JavaScript code, the question is more like 'Can you prove that there is no program for which your compiler will ever emit bad code'.

9:35:14 no-defun-allowed Indeed. I'm now wondering if you can do it without exploiting the compiler though.

9:35:15 heisig I'm willing to bet the answer to that will always be zero (Unless your compiler is severely restricted).

9:35:39 heisig zero->no

9:37:02 heisig shka_: If that were the case, our top computer scientists wouldn't be fighting for us to keep using paper ballots.

9:37:29 heisig Our computers are not (yet?) very trustworthy.

9:42:22 beach Especially since the computer that manages the motherboard is apparently totally insecure.

9:42:47 beach And apparently, the big chip manufacturers have an on-chip web server.

9:43:52 beach There was an article in a recent CACM about the motherboard management computer.

9:43:58 beach It was quite scary.

9:44:54 shka_ it really is