freenode/#sicl - IRC Chatlog

22:45:39 Bike the cst quasiquote makes some stuff in cst-to-ast look nicer, actually. hm.

23:12:13 aeth_ ** NICK aeth

23:47:03 no-defun-allowed I changed the hash function that I gave to Luckless, and it worked again.

23:48:06 no-defun-allowed Shinmera: Instead of it either blowing the stack or going incredibly slowly, would it be a good idea for the hash table to signal an error if it has resized however many times, but still can't get a key into the table?

23:52:45 Bike i feel like there ought to be some way to speed up cst:reconstruct, but since practically any macro is going to insert code that's not in the original form, it seems like we have to run an exhaustive search regardless

23:52:55 Bike maybe i could put in some kind of hook for CST-aware macro functions

23:53:02 Bike would make saving inline definitions nicer too...

0:05:57 Bike this might be an interesting case for meters. i can put in meters and see which particular macros slow things down

0:06:03 Bike probably ultra common ones like when or cond...

0:18:32 Bike i guess if circular CSTs were marked as such, they could just be searched without consing up a hash table and stuff.

0:18:47 Bike i mean, non circular CSTs could.

0:23:33 no-defun-allowed What was that instruction traversing thing which would benefit from fast hash table misses again?

0:24:21 Bike map-instructions-arbitrary-order, but as i said, in cleavir now that's no longer an issue.

0:26:05 no-defun-allowed It would only have to check if instructions with multiple predecessors were already mapped, so the table would be used very infrequently. But if it's not an issue, then no problem.

0:26:36 Bike oh, that's true enough. we didn't do that optimization unfortunately.

0:30:29 no-defun-allowed Oh, another CHT thing: should we use the same test function for keys and values? In my case, EQ works fine for testing values when doing conditional updates, as I share structure for everything.

0:31:03 Bike why do you need to test values?

0:31:53 no-defun-allowed We test values for conditional updates.

0:32:36 no-defun-allowed e.g there's a TRY-REMHASH which removes an entry iff the value is some expected value. It's kind of like CASing an entry.

0:33:18 Bike like, this is an extension? or do you actually need it to operate the table?

0:33:48 no-defun-allowed I would say it's necessary for concurrent operation without locking.

0:34:29 Bike but i mean is this an exported interface or no

0:34:44 no-defun-allowed It is part of the concurrent-hash-tables and luckless protocols.

0:34:56 Bike so yes. okay.

0:35:28 Bike well, i don't see any reason the keys and values should necessarily use the same test function

0:35:55 Bike for example, say you have a hash table from function names to functions. then the names have a test function of EQUAL for setf names, but the values can be compared with EQ fine.

0:36:08 no-defun-allowed I have one table mappings strings to lists of source information objects, so the keys — yeah, same thing.

0:42:29 Bike the other day i decided a "real" lisp CAS should have a comparator argument, but i can't think of much reason to do fancy stuff except compare integers via EQL instead of EQ

0:43:01 Bike you can build up fancier comparisons from an EQ one if you don't mind looping, anyway

0:44:25 no-defun-allowed That's basically what we end up doing.

3:04:48 Bike no-defun-allowed: https://github.com/no-defun-allowed/luckless/blob/master/hashtable.lisp#L251 if you had an unconditional atomic store, it could be used here, right?

3:04:52 Bike i see the java does this loop too...

3:07:21 no-defun-allowed Yes.

3:10:11 Bike right.

3:10:28 Bike there was a bit like that in Clasp's CLOS until i added a store. the loop annoyed me

4:02:37 beach Good morning everyone!

4:02:38 Colleen beach: kpoeck said 8 hours, 7 minutes ago: I will change clasp wrt. importing uninterned symbols

4:02:50 no-defun-allowed Good morning beach!

4:07:51 Bike good morning

4:13:24 beach Gnuxie[m]: I have not given any thought to protection against timing attacks. Sorry.

4:14:27 beach Bike: What is CST quasiquote?

4:14:31 beach Did you do that?

4:15:00 Bike yes. i mean, just in this branch.

4:15:12 Bike it's a macro that's like a quasiquote macro, except with CSTs.

4:15:18 beach I see. And it does make a lot of sense to have something like that.

4:19:27 beach no-defun-allowed: That's a clever optimization trick, i.e. checking only instructions with multiple predecessors, and none of us thought about it. While I am still using the SICL-specific Cleavir, I might put it in. But I am hoping to convert to s-expressionists Cleavir some day.

4:51:49 no-defun-allowed If I want to learn about register allocation, what kind of allocators would I start with? I am guessing the one you are writing about would take a while to understand (if the reader doesn't know about register allocation).

4:53:18 Bike i've been flipping through https://www.cs.utexas.edu/~mckinley/380C/lecs/briggs-thesis-1992.pdf. it's old enough that it explains stuff pretty simply.

4:53:29 no-defun-allowed Thanks!

4:53:38 beach The one based on graph coloring is easy to understand, including the fairly clever heuristic to get around the intrinsic intractability of the problem.

4:56:32 beach Bike: Nice reference.

4:59:06 beach no-defun-allowed: My idea is actually much simpler than graph coloring.

4:59:43 beach The stuff I wrote yesterday is complicated, but not because of the algorithm, but because of the stupid x86 restrictions.

4:59:50 no-defun-allowed Right, okay.

5:02:02 beach The idea is very simple: You find yourself at a particular program point with a particular assignment of locations to lexical variables, some on the stack, some in registers, and some in both. You now need a lexical variable that is not in a register, and you are out of registers. The idea is to spill (store it on the stack) the variable that is going to be used the furthest in the future.

5:03:21 no-defun-allowed Gotcha.

5:03:32 beach Doesn't sound complicated, does it?

5:04:33 no-defun-allowed No. It reminds me of reading about linear register allocation, of which one implementation spills the least recently used register.

5:05:13 beach Yes, LRU and OPT are two page-replacement algorithms. Mine is based on OPT.

5:06:09 no-defun-allowed Right.

5:06:48 beach LRU is used because with page replacement, we can't predict the future.

5:07:04 beach But since we know the instruction graph, we actually can.

6:26:35 splittist Good morning

6:26:59 no-defun-allowed Good morning splittist.

6:27:57 beach Hello splittist.

7:27:23 beach With RISC-V, register allocation would have been so much easier.

7:32:31 Shinmera no-defun-allowed: I'm not sure.

7:32:43 ebrasca Hi

7:33:13 beach Hello ebrasca.

7:33:57 ebrasca beach: I don't know what else todo with cl-filenames.

7:34:12 ebrasca I hope it is not bad.

7:35:02 beach Then leave it as it is, and when I need it for SICL, I'll have a closer look.

7:35:49 beach ... or others are of course welcome to have a look before that.

7:35:52 ebrasca beach: If not thanks to Mezzano code I am not sure if I manage it.

7:36:09 beach I see.

7:36:58 ebrasca It is strange to think how it is going to fit in more than 1 implementation.

7:37:39 ebrasca beach: Thanks for helping me to make projents.

7:37:50 beach Yes, I think I understand what you mean. And that aspect is absolutely essential to the SICL project.

7:37:59 beach Pleasure.

7:38:36 ebrasca I think being more constant can help me but not sure how to do that.

7:38:59 beach What do you mean by being "constant"?

7:39:40 ebrasca Like every day at x hour or someting.

7:39:53 beach Ah, yes. I think you are right.

7:40:06 beach Also, reading papers, books, documentation, code.

7:40:43 ebrasca Making documentaition is scary.

7:41:23 beach Sure, and you have an additional problem in that you need to practice your English before writing something in that language.

7:42:30 beach It is interesting that in a typical teaching program for English (that my (admittedly small) family took when she was a student), the students are supposed to read a book per week or so. In a typical software engineering teaching program, students are supposed to read no code whatsoever.

7:43:26 ebrasca book per week , is that posibol?

7:43:34 beach Yep.

7:43:57 beach In addition to all the other homework, that is.

7:44:02 ebrasca I am not the fastest reading.

7:44:21 beach It is a skill that can be practiced, if that is what you need to do.

7:44:50 beach I was trained as an engineer, and I am a very slow reader, because I was trained to understand every detail of every sentence.

7:44:56 beach Different skills.

7:45:33 ebrasca I think undestanding is mone inportant than reading speed.

7:45:47 beach Not if you study English literature.

7:46:06 beach But for writing software, sure.

7:46:22 ebrasca How I am supose to memorize so many books?

7:46:39 beach The point is that you don't read them in order to memorize.

7:47:00 beach You read them to try to understand the difference between good writing and bad writing.

7:47:19 ebrasca I am asperger...

7:47:24 beach We should have software-engineering students do the same for code.

7:47:43 beach ... because even after 5 years of study, their code is crap.

7:48:06 beach ebrasca: That certainly changes what you can do and how you do it.

7:48:18 ebrasca Do you have some recomendation for reading?

7:48:26 beach Code?

7:48:43 beach I am fairly proud of Cluffer.

7:48:44 ebrasca programing related at least

7:49:01 beach It has good tests and reasonable documentation too.

7:49:13 beach And the domain is well defined.

7:50:11 ebrasca I have big problems with patarn maching. ( Like strings )

7:50:32 beach What problems do you have with it?

7:52:38 ebrasca beach: Don't understand ppcre and decoding binary structures is very ugly how I do it.

7:53:18 beach Regular expressions are almost always the wrong answer to a problem.

7:54:01 beach I guess it is the fault of Unix that they are so popular, since every data interchange has to be done in the form of streams of bytes.

7:54:02 ebrasca Here 1 example (cl-ppcre:scan-to-strings "(?i)from:<(.*)>" data)

7:54:23 beach Yes, regular expressions are largely "write only".

7:55:14 ebrasca I am not sure how to continue with my email server in cl.

7:55:34 beach I didn't know you were writing one.

7:56:02 ebrasca I have some part done.

7:57:39 ebrasca My email server is wip

7:58:22 beach Yes, I see.

8:01:59 no-defun-allowed CL-PPCRE (and my own engine, in its own way) allow for writing regular expressions using list structure, but I forget what the former uses.

8:03:16 no-defun-allowed ACTION notes that her accidental approach of the user numbering submatch groups is bad, and makes composition hard. To do: don't do that, number them somewhere in the compiler.

8:05:12 no-defun-allowed Ideally (or as ideal as you get while having to serialize, and use regular expressions) the regular expression would be (let ((address (group (kleene non-whitespace)))) (join "From: <" address ">")).

8:07:07 beach I guess lists are slightly better.

8:09:22 ebrasca no-defun-allowed: Can you validate input with it?

8:09:25 no-defun-allowed See the example code after <http://edicl.github.io/cl-ppcre/#create-scanner2>; I have constructor functions. But in either case, you could try to document by binding subexpressions to variables.

8:09:54 ebrasca subexpressions?

8:11:08 no-defun-allowed e.g. (defvar *word* (kleene letter)) (defvar *whitespace* (plus (either #\Space #\Newline ...))) (defvar *three-words* (join *word* *whitespace* *word* *whitespace* *word*))

8:12:44 no-defun-allowed You could almost see that like the approximate-BNF "word ::= letter*; whitespace ::= (<space> | <newline>)+; three-words ::= word whitespace word whitespace word" with some imagination.

8:14:03 ebrasca Probably I can steal the regular expresions for email.

8:15:30 ebrasca no-defun-allowed: Thanks!

8:17:41 ebrasca no-defun-allowed: Can you decode more than 1 thing at the same time?

8:24:44 no-defun-allowed I don't know, I haven't parsed any email stuff before.

8:53:10 heisig Good morning!

8:53:20 no-defun-allowed Good morning heisig!

8:54:49 beach Hello heisig.

8:56:45 ebrasca Morning heisig!

8:56:56 heisig About the performance of cst:reconstruct - scymtym had this idea of generating custom variants of all functions that are used during macroexpansion.

8:57:25 heisig These variants could automatically propagate ownership information for each datum.

9:00:27 heisig Now that I think about it, there are several problems with this idea. But it might be possible to overcome those.

9:01:08 beach What do you mean by "all functions that are used during macroexpansion"?

9:03:43 heisig All functions called in the dynamic extent of macroexpand-1. Which may be a lot.

9:05:49 beach I don't get it. If I define a macro M then the macro function of M is going to be called during macroexpansion. How do you propose writing a custom version of that macro function automatically?

9:07:53 heisig The custom version would be identical to the regular one, except that it would internally use CSTs instead of objects.

9:08:05 heisig So it can be generated automatically.

9:09:26 heisig One of the problems is when such a CSTified object is placed in a closure or a global data structure.

9:09:37 beach So if my macro definition of M calls, say, REVERSE, a CST version of REVERSE would automatically be generated as well?

9:10:37 heisig Right. I suggest this translation is made lazily, as soon as a function is called the first time in a macroexpansion context.

9:11:08 beach Hmm.

9:11:35 heisig Now that I think about it some more, there definitely needs to be a way to 'give up' this special processing and to fall back to the usual one.

9:11:49 beach Yes.

9:12:14 beach Because otherwise we would have to keep all ASTs around, including those of proprietary application code.

9:12:51 heisig Heh, I thought we don't care about proprietary application code.

9:13:06 beach We could make such a decision, sure.

9:14:21 beach But I smell some decidability issues here. Like what objects are a CSTs and what objects aren't.

9:15:10 beach And what about the code for CST itself? Are the objects it manipulates CSTs or not in this context?

9:18:32 heisig I would say every object is CSTified unconditionally. And the fact that all objects are actually CSTs is only visible to the special macroexpansion code.

9:19:03 heisig As soon as there is some confusion (like when storing a CSTified object in a global, specialized array), we give up.

9:19:52 beach Like I said, I smell decidability issues.

9:20:10 splittist Regexs can be documented. Not a great example, but one I have to hand: https://github.com/splittist/docxdjula/blob/master/regex.lisp

9:20:59 heisig You may be right. But I haven't found them, yet. If an actual CST is handled by the macroexpansion version of a function, it would be stored as a CST whose value is that CST.

9:23:43 heisig Anyway, introducing such a feature is not urgent. I just wanted to write it down for the logs.

9:23:44 beach Suppose someone who knows nothing about this special expansion idea writes a macro that contains a CST as a constant. If the macro expander treats that constant as a CST it would change the meaning compared to what the macro author intended.

9:26:43 heisig Those two CSTs would be on different planes of existence (I hope) and couldn't be confused. Think of the CSTs used when executing such a macro as a special object representation, where objects are represented as (actual-object origin) pairs.

9:27:22 beach I have to take your word for that.

9:28:43 heisig So the only places where a confusion can happen is when such a special macroexpansion function is invoked, and when such a magic object is stored in a shared place.

9:29:27 heisig The former case is easy, the latter one worries me.

9:46:08 beach I am perfectly happy with leaving the details to you.

9:46:27 beach Meanwhile, I need to figure out a better way of explaining the register-allocation algorithm.

9:46:35 beach The current description is incomprehensible.

9:46:52 beach Some factoring is needed, so that I can reduce the number of cases.

9:53:09 heisig One thing I haven't understood yet about register allocation in SICL - why don't we start by defining a register allocation interface and providing a simple default implementation?

9:53:24 heisig Or are you working on such an interface right now?

9:55:16 beach That's pretty much what I am doing, but for x86 only. I have abstracted out EDU, and "allocate a new register". The rest is imposed by the x86 constraints.

9:58:13 heisig Could these constraints be formulated in a way that is independent of the architecture?

10:00:08 beach That is also what I am doing. I am currently translating MIR instructions such as c <- a op b into r <- r op s where a and b are lexical locations or immediate inputs, c is a lexical location, and r and s are register.

10:00:47 beach So the architecture-independent description is that the destination must be the same as the first operand.

10:03:01 beach Er, s can also be an immediate of course. But I am currently simplifying so that s can not be a stack location.

10:03:13 beach It is complicated enough as it is.

10:05:31 beach Is that abstract enough for you?

10:09:48 heisig I was thinking about using a domain-specific logic programming language. (But I admit I don't have much experience with logic programming)