freenode/#sicl - IRC Chatlog

3:21:38 aeth_ ** NICK aeth

3:27:30 aeth__ ** NICK aeth

4:12:43 beach Good morning everyone!

4:20:33 no-defun-allowed Good morning beach!

5:11:56 ebrasca Good morning!

5:13:45 beach Hello ebrasca.

5:21:31 drmeister Bike: In the paper that you posted (that I got from Steve Blackburn) what are the different yieldpoint methods in Figure 3 describing?

5:22:20 drmeister I understand the (a) conditional one (I think). But (b) and (c) - I don't see how they work.

5:23:17 no-defun-allowed Some Java virtual machines cause threads to read or write an address, which causes a segfault, which then gets handled.

5:23:30 Bike they're explained under "Trap-Based Polling Yieldpoints". Basically, when the yieldpoint is hit it does a meaningless memory operation on some page. When you want the yieldpoint to activate, you protect the page so that memory operations on it cause the system to trigger an interrupt

5:23:35 Bike yeah.

5:23:50 no-defun-allowed That's a better explanation of it.

5:24:16 drmeister That is faster than a comparison and a branch? I guess the slow path is a lot slower - right?

5:24:43 Bike well, that's what they get numbers for

5:24:59 Bike it doesn't seem too much worse based on these graphs

5:25:46 Bike I think figure 5 is the one you want. sometimes the traps do much worse but not always.

5:26:22 drmeister Ok. I should have read the paper again. I was going off the figure 3 that Steve posted in a Zulip post and I needed the paper to decode it.

5:26:42 Bike though it depends. in the results under "Global Yieldpoints" they get a 2.5% overhead for a conditional, 2.0% for the load trap, and then 36% for the store trap

5:26:47 Bike so that's, like, quite a bit worse

5:27:40 Bike that kind of memory trap stuff isn't something i've dealt with before, so i don't know the ins and outs very well, unfortunately

5:27:57 Bike but my impression is that garbage collectors and stuff have often used these mechanisms

5:28:14 beach I plan to stay away from trap stuff as much as possible.

5:28:55 Bike any particular reason? other than that it looks kind of magical in disassembly

5:29:14 beach Gut feeling. It feels messy to deal with.

5:29:58 beach And it is not clear to me what the performance penalty of invoking the operating system would be.

5:33:17 Bike yeah, fair enough

5:35:00 beach What are all the cases that yieldpoints are used for?

5:36:45 beach I know I need something like that when the global collector requests its "roots" from the nursery collectors.

5:38:17 Bike they mention garbage collection, "user-level thread preemption" so i guess interrupt-thread, code patching, "biased locking" which from a quick google is probably irrelevant whatever it is, and profiling.

5:39:18 Bike the paper is mostly about yieldpoints themselves. doesn't go into the applications too much

5:39:31 beach Sure.

5:39:52 beach Thread preemption is important I suppose.

5:40:39 beach I guess my GC case is a special case of thread preemption.

5:41:18 Bike we actually have problems in clasp with thread interrupts. you can only interrupt threads at safe/yield points, which for now are allocations. So if a thread is in a loop that doesn't allocate, you can't interrupt it.

5:41:35 beach Oh, yes, that's a problem.

5:41:47 Bike i suppose cleavir is probably well enough developed now that i could whip up an insertion pass. i know you already wrote some loop detection code

5:41:52 beach You need at least back arrows and function calls.

5:42:38 beach With loop unrolling back arcs can be made less frequent.

5:44:04 beach Hmm. I am thinking that one could have a counter on back arcs, so only test the yieldpoint every 100 times or so.

5:44:58 beach ... or replace 100 by whatever number makes the yieldpoint test insignificant.

5:45:37 Bike you could work type inference into it. if the code has (loop for i below n ...) and n is an (integer 0 100) the compiler doesn't bother inserting points.

5:45:52 beach Yes, that's a very good idea.

5:48:34 beach Here is another idea. Instead of checking at function calls, check at function returns. Then it can be done by modifying return addresses on the stack.

5:48:46 beach Tail calls would have to be tested though.

5:48:54 beach But that's detectable at compile time.

5:49:51 beach Though, I guess modifying the stack would require some synchronization. So maybe not.

5:54:34 splittist Good morning

5:59:19 ebrasca Morning splittist!

6:10:14 beach Hello splittist.

6:17:01 ebrasca Do you plan to support changing endianes at run time?

6:30:21 no-defun-allowed I take it you can change endianness on PowerPC?

6:31:02 no-defun-allowed Is there a use for it, may I ask?

6:35:37 ebrasca no-defun-allowed: You start in big endian , if you like to change to litle endian you need to run some istructions in big endian.

6:36:53 no-defun-allowed I see. Wouldn't you decide when the executable starts? Generating code for both big- and little-endian to support both seems silly even for #sicl standards.

6:38:01 ebrasca no-defun-allowed: To call OPAL you need to be running in big endian.

6:39:24 no-defun-allowed I take it OPAL is how the operating system calls into firmware?

6:39:51 no-defun-allowed If so, you might just keep running in big endian (assuming nothing else requires little endian).

6:41:17 ebrasca I think you can run everiting in big endian.

6:42:45 no-defun-allowed That looks like the easiest approach.

6:43:27 ebrasca What about 32bit mode and 64bit mode?

6:43:48 no-defun-allowed Although stylewarning told me "no one likes big endian" (when I asked about it while porting a Smalltalk implementation to the Wii, which was stupid enough to not do its own endianness conversion when loading images), but evidently the OPAL people like big endian.

6:44:21 no-defun-allowed I think SICL will be 64-bit only, and I haven't heard of any language implementation which lets you switch between 32-bit and 64-bit code as such.

6:44:49 moon-child nasm lets you do that

6:44:51 moon-child 'bits 32'

6:44:53 no-defun-allowed From memory, one switches between modes on x86-64 by modifying the page table, which has a bit for 32/64-bit code, so clearly only the OS can do that.

6:44:58 moon-child (maybe that's cheating... :)

6:45:04 ebrasca I don't care if it is litle or big.

6:45:22 no-defun-allowed Sure, but an executable with 32-bit and 64-bit code can't use both.

6:46:24 no-defun-allowed From memory, macOS has "fat" binaries which could have both PowerPC and x86, and then AMD64 and AArch64 code, but that selects one architecture at load-time.

6:46:53 ebrasca OK I only need to think 64-bit with big endian.

6:49:03 no-defun-allowed Perhaps I need to modify that Smalltalk to do endianness conversion, because I really dislike that one should have to modify the Xerox image to run it on another computer - wasn't the whole point that it was device independent? But I recall that advice came from Xerox, which makes it harder to argue with.

6:49:22 beach I don't imagine switching between endian-ness, and I am not planning to support 32-bit processors.

6:53:38 ebrasca beach: Do you need help with someting?

6:54:40 beach I have been preoccupied for the past few days. I'll think of something today I hope.

6:58:09 ebrasca beach:

7:05:54 ebrasca beach: Is it hard to think some task for others?

7:08:00 beach It depends on the person. I don't think you know enough about the low-level details of implementing Common Lisp to take on something like compiler optimization or generic dispatch. There is lots of very mundane stuff, but that's too trivial for you, so I don't want to give you that. There is not much in the middle.

7:08:51 no-defun-allowed Would there be any other modules left for implementation?

7:09:17 beach Let me check...

7:09:41 no-defun-allowed The first that comes to mind is a random number generator, because I spent a few weeks writing fast RNGs for simulation.

7:09:54 beach Yeah, I was just thinking about that too.

7:10:44 beach Also, chapter 25 Environment.

7:11:14 beach DECODE/ENCODE-UNIVERSAL-TIME for instance.

7:11:24 beach DESCRIBE.

7:12:49 beach Pathnames too I guess.

7:13:29 beach Yeah, chapters 19 and 25 I guess.

7:13:44 beach ebrasca: If you want to take on any of those, please go ahead.

8:01:51 heisig Good morning!

8:02:16 no-defun-allowed Good morning heisig!

8:02:57 beach Hello heisig.

8:03:05 splittist ho heisig

8:13:42 ebrasca beach: 25 is Environment , I think it is Clostrum.

8:14:35 beach No, as it turns out, what the Common Lisp HyperSpec thinks of as Environment is something different.

8:15:09 beach As you can see in that chapter, there is nothing that resembles Clostrum.

8:17:04 ebrasca I don't khow what Clostrum is. I have just read the description.

8:18:33 beach How about you go read the documentation then. That would be quite appropriate for you.

8:26:03 ebrasca I asumed it is due to the description "First Class Global Environments".

8:27:04 beach If you forget about that chapter for a second, then the word "environment" in the context of software often means "a mapping from names to objects".

8:27:51 beach So Clostrum is about mapping names of functions to functions, names of packages to packages, names of classes to classes, names of method combinations to method combinations, names of types to types, names of SETF expanders to SETF expanders, etc.

8:28:47 beach So you will find functions in Clostrum such as FDEFINITION, FIND-PACKAGE, FIND-CLASS, etc.

8:35:38 ebrasca beach: I am going to check 19 first.

8:36:03 beach OK.

9:07:38 frodef good morning

9:08:11 beach Hello frodef.

9:08:25 frodef This is how far the linux kernel goes to avoid simple indirect calls: https://www.usenix.org/system/files/atc19-amit.pdf

9:10:20 beach Hmm.

9:10:36 frodef Nevermind multiple dispatch and method combinations :-)

9:10:43 beach Yeah.

9:20:33 heisig Do I understand correctly that most problems related to Spectre are because people want to execute closed-source software from dubious sources on their machines?

9:21:36 no-defun-allowed You apparently can do it with JavaScript code. It depends if you think the Web is a dubious source...

9:21:38 moon-child I think the concerns are more to do with js than native code

9:22:26 heisig Still, the problems seems to be closed-source JS from parties we don't trust.

9:23:19 no-defun-allowed My policy of disallowing arbitrary network writes with Netfarm hopefully avoids making such an attack useful.

9:24:09 no-defun-allowed Then you'd have to communicate the data you stole by retrieving objects somehow, and you can't create arbitrary references, so how would you encode anything? Maybe there's a way.

9:27:09 heisig I just want to make sure that we don't try to solve a social problem (trustworthiness in the digital age) by technical means (retpoline, ...).

9:28:21 heisig And yes, I consider the web a dubious source. At least while we don't have a reliable infrastructure for managing trustworthiness.

9:28:56 shka_ social problems ARE solved by the technically means

9:29:03 shka_ over the long periods of time

9:29:31 no-defun-allowed On the other hand, how do you convince a JavaScript engine to generate code to speculatively execute bad code?

9:30:07 no-defun-allowed Those tend to invent a new kind of exploit every year, or can you do it in correct JS?

9:34:13 no-defun-allowed (But my approach is pretty mundane, compared to the emerging "alternative" of making one's computer a troff + VT100 emulator.)

9:34:42 heisig no-defun-allowed: If you allow for arbitrary JavaScript code, the question is more like 'Can you prove that there is no program for which your compiler will ever emit bad code'.

9:35:14 no-defun-allowed Indeed. I'm now wondering if you can do it without exploiting the compiler though.

9:35:15 heisig I'm willing to bet the answer to that will always be zero (Unless your compiler is severely restricted).

9:35:39 heisig zero->no

9:37:02 heisig shka_: If that were the case, our top computer scientists wouldn't be fighting for us to keep using paper ballots.

9:37:29 heisig Our computers are not (yet?) very trustworthy.

9:42:22 beach Especially since the computer that manages the motherboard is apparently totally insecure.

9:42:47 beach And apparently, the big chip manufacturers have an on-chip web server.

9:43:52 beach There was an article in a recent CACM about the motherboard management computer.

9:43:58 beach It was quite scary.

9:44:54 shka_ it really is

9:45:50 shka_ and the Intel Management Engine is some real shady stuff

9:46:31 heisig Mine is disabled. (Thanks Purism)

9:47:38 heisig But the scariest part is that most people are not ashamed of carrying such crap devices in their pockets.

9:48:07 frodef Seems to me that Spectre shows that securely running untrusted code in any shape or form is very very difficult.

9:48:52 no-defun-allowed ACTION likes to live life dangerously. Okay, maybe this is too dangerous.

9:50:43 heisig Anyway, I didn't catch enough sleep, so don't listen to my ramblings :)

9:51:07 shka_ frodef: more generally speaking, i would say that "hardware security guarantees" is a flawed concept

9:51:27 shka_ because it is just as likely to be buggy as software, but more difficult to patch

9:51:49 shka_ MS was right with the singularity

10:00:09 beach I also read an article about how easy it is to install a trap door in a chip immediately before manufacturing, so that not even the chip designer (who will of course outsource the manufacturing to China) knows about it. It would be some simple thing that enables supervisor mode or something like that.

10:01:20 beach Anyway, I think we have made a mess of our computing environment(s), just as we did with the natural one.

10:13:02 ebrasca In ppc64 is save , you can change anything you don't like.

10:14:03 beach That's good.

10:15:27 ebrasca If I recal correctly ppc64 plan to use encription in the cpu to make it more save.

10:18:26 ebrasca beach: if cpu does have encription modules inside , does it help againt "trap doors"?

10:18:51 no-defun-allowed There are those "trusted modules" already, but I always wondered how trustable they are.

10:21:42 jackdaniel I'm sure that hardware companies with roots in USA are very trustworthy, they even have a national agance that ensures the security ;)

10:22:27 jackdaniel (not to mention that they in fact manufacture in China - another country with high moral standards when it comes to transparency)

10:22:45 no-defun-allowed Perhaps I should learn how to fab processors myself.

10:23:22 ebrasca The ppc64 is the most save CPU I khow.

10:23:25 frodef hardware moonshining? Very naughty.

10:23:36 beach ebrasca: I doubt it. There is a bit that tells whether the processor is in supervisor mode. Apparently, it is enough to hook up a capacitor to it. Then it can be enabled by a certain sequence of instructions.

10:26:00 beach ebrasca: Friendly advice: You should work on your English. It is easy to take people less seriously of they make lots of mistakes.

10:26:22 beach ebrasca: "recall", "encryption", "safe".

10:27:27 ebrasca ACTION thinks cl-filenames is good name.

10:28:50 ebrasca beach: Sory!

10:29:20 beach ebrasca: If you write the library, you get to name it.

10:32:06 ebrasca beach: I am interested in this part "A conforming implementation is free to accommodate other file system features in its pathname representation and provides a parser that can process such specifications in namestrings.".

10:33:17 beach What makes you interested in it?

10:36:09 ebrasca I think I need to make easy to accommodate this features.

10:36:46 beach That would certainly be good.

10:42:24 ebrasca beach: I guess I can start with Mezzano filenames as my base.

10:42:34 beach Sure.

11:07:58 frodef The Spectre attack is quite interesting from a runtime perspective. Also the implied focus on indirect branch speed/optimization.

11:12:31 beach By "from a runtime perspective", do you mean "from the perspective of designing a `runtime' for some language implementation"? Or just "runtime" in general?

11:16:37 ebrasca ACTION goes AFK.

11:18:59 frodef the former, I suppose.

11:21:28 pjb I thought that most problems related to Spectre were that they thought that more money was to be made from extortion and blackmailing (along with slicing spies in two), than by honest capitalist means.

11:22:22 pjb https://www.youtube.com/watch?v=jTmXHvGZiSY

11:31:22 frodef from a security standpoint, it reveals a flaw in the idea that one can really isolate code by the semi-virtual machine that is "userspace".

11:32:32 frodef ...although this even appears in pure javascript as soon as one isn't very careful about the primitives being made available.

11:34:16 frodef (the existence of javascript "worker threads" yields high-resolition timers that can be used to extract secret information.)

11:34:30 jackdaniel I had a bad joke, but I've passed on it :)

11:35:47 frodef bad jokes are the best jokes :)

11:36:10 heisig My preferred approach to security is to know the developers that wrote my software, to a degree that I could locate them and hit them with a stick.

11:36:31 jackdaniel OK, then here it goes (sorry heisig!) -- just add arbitrary delays to each operation, i.e (1+ heisig) ; that is 25 nanoseconds

11:36:32 frodef heisig: I suspect you

11:36:33 pjb well, when you see all the side channels that can be exploited for data exfiltration even in air-gapped computers...

11:36:40 frodef 'll have a scalability problem.

11:37:11 pjb you can imagine than for stuff running on the same hardware, anything is trivial.

11:38:05 frodef pjb: that's a thing, too, in data-centers?

11:38:51 pjb frodef: I'm sure those exploits can be used in data-center to spy to other servers…

11:39:30 pjb Look at that for example: https://www.securityweek.com/ram-generated-wi-fi-signals-allow-data-exfiltration-air-gapped-systems

11:39:39 frodef are there other ways to extract information that timing?

11:40:10 pjb There's a GSM version: https://dl.acm.org/doi/10.5555/2831143.2831197

11:44:29 frodef pjb: That seems like a force that could be used for good: Implement wifi without wifi hardware :)

11:59:22 pjb frodef: theorically. But in practice, there's energy management problems. It's good when you have receiver hardware that can detect low power (RAM level) wifi…

12:00:47 pjb The thing in this use case, is to be able to transmit data even weakly, using any (limited) resource at hand, but it doesn't restrict the resources you can use to listen to it. You can have a truck of instruments in the road near the building…

12:01:38 frodef pjb: so you can have a special device in your pocket, run your "special program" on location, and have everything downloading without plugging in a pendrive or anything. Or a truck outside..

12:01:51 frodef downloadED

12:02:09 pjb Yes.

12:22:37 ebrasca beach: Are Filenames urgent or I can take my time?

12:55:25 beach ebrasca: Take your time.