freenode/#clim - IRC Chatlog

3:28:22 beach Good morning everyone!

3:38:37 nyef Hello beach.

3:58:36 beach I recently saw some very real evidence of the result of Kahneman as reported in his book "Thinking, fast and slow", i.e. that the fast module is lousy with things like orders of magnitude and statistics, and the slow module is lazy so it tends to believe the fast module. The evidence I saw makes me very convinced that a large number of wrong decisions are made with respect to software development.

3:58:44 beach In this case, I mentioned that I plan to track source information in SICL by keeping a string containing the text of each source file in memory with the code, and I got a pretty violent reaction.

3:58:46 beach After some verification, it turns out that the entire source code for the Lisp machine is around 50MB, which represents 20 cents of RAM. For the SICL source code, it's 6.4MB or around 3 cents. The fast module had obviously estimated several orders of magnitude more and the fast module didn't bother to check.

3:59:11 beach ... the SLOW module didn't bother to check.

4:01:12 beach The advantages of keeping the source code with the executable code is of course that, even if the source file was not delivered or installed, or if it was modified a posteriori, then I can still have precise error reporting with respect to source location.

4:03:16 beach I am intimately convinced that many wrong decisions about what optimizations to implement are based on a hunch delivered by the fast module, and that if the slow module had bothered to check, it might turn out that the potential gain is very small and that it would be better to spend that energy on more profitable work.

4:05:39 beach Hmm, I might get a chapter in my book about the sorry state of the software industry out of this kind of material.

4:20:24 nyef Note that storing the source code with the running image is one short step from treating the version of the source code in the image as the master copy, which we already discussed as being a Bad Idea.

4:20:59 nyef (I'm not saying "don't store the source code with the running image", I'm saying "here's a line that almost certainly shouldn't be crossed".)

4:21:23 beach It will be stored with each function, so that it will be the true version that the executable code was compiled from. That's the only usage.

4:21:36 beach Yes, I see what you mean.

4:22:53 beach When EDIT is called, the in-memory copy should not be modified. Instead, the file should be looked for, compared to what is in memory (and warnings issued if they are not the same), and then the file should be loaded into an editor buffer.

4:24:50 beach And this scenario should be respected for a LispOS as well. There are no "files" there perhaps, but there is an editable object for the source and there is this read-only copy of it that gets associated with the executable code.

4:25:24 nyef Yes.

4:25:52 nyef I'm not convinced about "no files", but I'm happy to entertain "files are not necessarily bags of bytes".

4:26:01 beach So storing the source code with the executable is just a very cheap way of improving the debugging experience.

4:27:27 beach I didn't say "no files" I said no "files". :)

4:27:32 nyef There's still a scenario where total "memory" usage is an important consideration. And that's low-bandwidth data transfer.

4:27:36 nyef Heh.

4:27:44 nyef "No files, use rasps instead."

4:27:52 beach Heh.

4:29:13 beach Well, my main point with this rant was not about memory usage, but about how the fast module can get it wrong by several orders of magnitude. The ultimate decision about memory usage depends on situations like the ones you mentioned, and whether the creator of the software cares about such situations.

4:29:36 nyef Agreed, then.

4:48:03 beach Another one of my favorite situations where the fast module gets it wrong is (but I have mentioned this situation several times in the past) when a project decides to use C++ "because we need all the speed we can get", but they then do not take into account the huge amount of additional developer time that it will take for them to actually get that kind of speed.

4:49:12 beach Again, I am not saying that C++ is always the wrong decision. I am saying that the decision is often made by the fast module, without taking important considerations such as developer time into account.

4:56:00 loke You might not say that, but I say that C++ is pretty much always the wrong decision, regardless of your requirements.

4:57:16 nyef When is C++ not the wrong decision? When the requirement is "interfaces with C++". A better decision, however, is "renegotiate the requirements".

4:59:14 slyrus I thought the Kahneman Fast and Slow thing failed to reproduce

4:59:22 loke ACTION wonders how often the requirement is "interfaces with C++" and at the same time it can't be solved with a C++ shim.

5:00:09 slyrus ACTION shows up late to the party

5:03:12 beach slyrus: What do you mean by "failed to reproduce"? The research results? I don't things so. They have been quoted time and again, and they have been verified by other researchers.

5:04:11 jackdaniel hello o/

5:04:17 beach Hello jackdaniel.

5:05:14 jackdaniel beach: I very much like the idea of bundling the source code with the image

5:05:24 beach slyrus: Kahneman even got the "Swedish national bank's economics prize in memory of Alfred Nobel" (usually incorrectly referred to as the "Nobel prize in economics") for his work.

5:05:32 jackdaniel although I'd keep the possibility built-in to not bundle it

5:05:43 jackdaniel (preferably as non-default behavior)

5:06:18 jackdaniel on the other hand, even if it is desireable by developers - possibility to hide code encourages mostly bad hacks

5:08:32 beach jackdaniel: There is only one situation I can imagine where it is worth not saving the source, namely when the Common Lisp system is used for commercial development so that the source code should not be distributed. But nyef cited some other considerations that are not high-priority for SICL.

5:09:01 loke Memory-constrained systems perhaps?

5:09:24 loke Having used ECL running on the LEGO mindstorms controller, I know that very well :-)

5:09:39 jackdaniel I don't think SICL aims for that as well

5:09:54 loke Fair enough.

5:09:54 beach loke: In such a system, it would be way more efficient to work on code size.

5:10:18 beach ... i.e. size of compiled code. Like perhaps use a bytecode interpreter.

5:13:50 slyrus beach: http://retractionwatch.com/2017/02/20/placed-much-faith-underpowered-studies-nobel-prize-winner-admits-mistakes/

5:14:59 beach slyrus: Interesting.

5:19:24 beach loke, jackdaniel: My point can be made like this: The SBCL default heap size is 1GB. The total amount of source code in an SBCL distribution seems to be 0.5MB, but to make it simple, let's say 1MB. That is 0.1% of the heap size. If anyone has a small memory to be concerned about, they had better do something else, like remove less important modules.

5:21:48 jackdaniel I agree with that

5:21:55 loke beach: My example was referring to the LEGO thing

5:22:04 loke it has 256 MB of RAM I think

5:22:17 loke Or was it 128?

5:22:18 beach loke: So for that you would use SBCL without the source in memory?

5:22:27 loke beach: I don' tuse SBCL with it. I use ECL

5:22:50 loke It took somewhere many hours to compile ECL on it

5:23:23 jackdaniel loke: hint: you may strip ECL modules even further with unix `strip' command (for native compilation)

5:24:11 loke jackdaniel: That only saves disc sapce though, right? I mean, as ar as I know the linkk loader doesn't map the symbol and debug segments when you run a program.

5:24:46 jackdaniel hm, maybe, I thought it loads the whole module into the memory, but you may be right

5:25:16 jackdaniel probably I should add it to the compiler, that if optimize space equals zero, strip is called

5:26:56 beach slyrus: Thanks for pointing that out to me. The contents of his book is still confirmed by my own observations, even though it may not be scientifically verifiable. But I should be more careful citing his work.

5:27:13 jackdaniel loke: in any case I advise against using sstrip (super-strip), because it breaks the interactive development of such modules

5:29:01 loke jackdaniel: Y9ou mean optimise space is _not_ zero rigth? Doesn't zero mean "i don't care about size", and 3 means "i really, really care about the size)

5:29:34 jackdaniel right, I mean for space equals 3

5:30:24 jackdaniel so if the user really cares about the space (probably space=3 ^ debug<=1)

8:33:38 flip214 think of a sudoku GUI.... if I have some constant defined to 3 (the typical sudoku case),

8:33:59 flip214 the GUI should then have 9x9 panes (and a few others)

8:34:47 flip214 should I do these from a tabular layout? or have them created via #. in the (:panes ...) of define-application-frame?

8:35:35 flip214 have a single pane there that uses my own function to draw? this has the disadvantage that I'd have to receive and parse mouse clicks, and determine the correct cell from the coordinates, etc.

8:36:23 flip214 not sure what the best way would be ... right now I'm thinking about #. in (:panes ...) and (:layouts ...)

8:36:30 flip214 ideas and suggestions welcome

8:37:09 beach I would use a grid of presentations.

8:37:39 beach Each presentation would then have an underlying object that would be recovered automatically when clicked upon.

8:39:07 flip214 beach: thanks for the hint... grepping for grid gives me Documentation/Guided-Tour/simple-spreadsheet.lisp, which sounds good.

8:39:12 flip214 thanks a lot, will investigate!

8:40:07 beach By "grid", I meant the generic term. Let me look up the real term.

8:40:29 jackdaniel flip214: you may check out the puzzle demo in Examples/

8:41:25 beach flip214: Table Formatting, chapter 17.

8:42:48 flip214 beach: yeah, I already saw that. I'm not sure whether I want to put buttons, input fields, or something else in each cell location...

8:43:08 beach I recommend a presentation.

8:43:12 flip214 I'll need a (blinking?) cursor, so the input sounds like an easy first match

8:43:15 flip214 okay.

8:43:31 flip214 will investigate, thanks for the pointers!

8:45:36 beach I have a minimalist Sudoku. Do you want me to give it to you, or do you want to figure this out for yourself?

8:45:52 beach I should put mine up on GitHub either way I guess.

8:48:58 flip214 well, of course I'd like to play around myself!

8:49:04 beach Sure.

8:49:29 flip214 but since the space of possible solutions is so big, and personal time so limited, having others provide suggestions is very valuable...

8:49:31 beach And I had a group of students work on an algorithm for constructing grids. It was amusing, because it would use the various strategies that humans use in order to make sure the grid could be solved using such strategies. Different sets of strategies gave different levels of difficulty.

8:49:38 flip214 that's what I'm here for, anyway!

8:49:46 flip214 so any example might be a good thing.

8:50:04 beach I'll put it up on GitHub and you can do what you want with it. OK?

8:50:25 flip214 yeah, that's what I'm afraid of... doing such a thing "by hand" involves quite a big solution tree to investigate by its own.

8:50:29 flip214 beach: thanks a lot...

8:51:38 flip214 I'll try to put my solver/explainer into your code, perhaps you're interested in it later on.

8:51:59 beach https://github.com/robert-strandh/Sudoku

8:52:16 flip214 great, thanks!

8:52:23 beach Sure, I am interested in it.

8:52:39 beach So it is a solver? You feed it a published grid and it solves it?

8:53:24 flip214 yeah, right. solves step-by-step and explains why some cell must contain some number.

8:53:39 flip214 ah, I see... your version is hardcoded for 3x3

8:54:00 beach Excellent! If we join forces we have all three aspects: A GUI, a solver/explainer, and a grid creator.

8:54:02 flip214 I've got that currently via hunchentoot, but I'm not that happy about that version

8:55:01 beach I have some additional code in a different place. I should try to merge it into the newly created repository.

8:55:28 beach This one also needs an ASDF system definition, etc.

8:55:32 flip214 there are quite a few different sudoku.lisp in the various subdirectories... is there one that I should take?

8:56:53 beach Oh, I am confused. I must have either failed to create a repository because I had one, or I did something else wrong.

8:57:59 flip214 no problem.... take your time sorting that out, I'll have to leave for Family time anyway.

8:58:09 beach I have NO IDEA what just happened here.

8:58:23 flip214 BLACK MAGIC

8:58:28 flip214 is it full moon already?

9:01:07 beach Oh, I see.

9:01:10 beach *blush*

9:04:42 beach flip214: Anyway, plenty of versions to play with. :)

9:17:04 beach flip214: You know what we should do? We should come up with a protocol for the grid. That way, we can experiment with different ways of implementing its graphic layout, and we can write solvers etc. independently of the way it is ultimately presented.

9:17:50 flip214 beach: is there a version that allows non-quadratic 9-cell ranges, too?

9:18:19 beach I don't know of any such version.

9:18:37 flip214 like http://www.cross-plus-a.com/data/sudoku3.gif

9:19:16 beach Nice. First time I see something like that.

9:19:20 flip214 ie. the total area is still a 9x9, but the ranges are horizontal/vertical, and a connected 9-cell range

9:19:34 beach Yes, I think I understand.

9:20:35 flip214 re protocol for the grid: the minimal input for me is a 9x9 array.

9:20:37 beach A protocol would have to take such layouts into account.

9:21:02 flip214 If I can store more data persistently (like possible values for a cell), the next solving step would be accelerated, of course...

9:21:13 flip214 but for interactive use it doesn't matter

9:21:55 flip214 so, as long as there's a button that calls my function with the 9x9 array, I'll just return the new array and an explaining message

9:22:19 flip214 although it would be nice to be able to colour the cells involved in this step, so that the reason is easier to see

9:22:22 beach Are their layouts where a cell does not have to be connected?

9:22:34 flip214 like "this row needs a 9, and this is the only place left for it"

9:23:06 flip214 I only saw connected ranges yet.

9:23:14 flip214 oh wait, wrong.

9:23:48 flip214 I saw such as that, too: http://www.sachsentext.de/gif/plum1.gif

9:24:10 beach Nice.

9:24:18 flip214 I guess the colors determine the range, again.

9:24:41 flip214 but this is just a normal layout, where rows and colums are moved around. nothing special.

9:24:56 flip214 you can shift rows and columns back to a "normal" 3x3 => 9x9 layout.

9:25:19 beach Yes, I see.

9:27:40 beach My hunch is that it is possible to come up with solvers etc that only take into account the fact that an elementary cell belongs to three different (say) aggregates.

9:28:11 beach Again, my hunch is that the rest is just a matter of organizing the elementary cells visually.

9:29:37 beach flip214: Thanks for broadening my views. It will give me something to think about when I am bored with my work.

10:09:19 flip214 beach: yeah, exactly. My solver takes the 3 different sets of cells that one cell belongs to, and derives new knowledge from that.

10:46:41 beach And it doesn't care whether the sets are rows, columns, squares, or something else?

11:12:26 nyef G'morning all.

11:12:36 nyef Sudoku with irregularly-shaped subregions, huh?

11:13:43 nyef I play a variant with irregularly-shaped subregions and instead of the grid being seeded with numbers, there are *more* subregions of varying sizes marked with the total of all the digits inside.