freenode/#lisp - IRC Chatlog

14:01:14 mgsk jackdaniel: interesting. Thanks!

21:06:36 vms14 guys how can I make a list from the input?

21:07:05 vms14 I want symbols, but the read just only works for one atom or list

21:07:13 vms14 and read-line takes a string

21:07:37 vms14 I have no idea how to convert a string to a list of symbols, I just take the first element

21:08:10 aeth (intern "FOO") => FOO

21:08:15 vms14 How can I read symbols until newline and cons them?

21:08:18 vms14 aeth: tried inter

21:08:20 aeth (intern "foo") => |foo|

21:08:22 vms14 intern*

21:08:32 aeth You need to upcase if you want behavior like regular Common Lisp

21:08:57 vms14 it's nice, but it converts several atoms into one

21:09:00 aeth (intern (string-upcase "foo")) => FOO

21:09:23 vms14 so I just get one symbol instead of many

21:09:48 aeth vms14: So the problem is that you're reading in a line into a string, and you're turning "Hello world" into |Hello world| instead of (HELLO WORLD)

21:09:50 vms14 I'd like to read until newline

21:09:51 aeth Correct?

21:09:55 vms14 and get a list of atoms

21:10:12 vms14 aeth: yes

21:11:17 vms14 I'm trying to parse input, I just want read every symbol from the input until the user press enter

21:11:52 vms14 I guess I don't need to go for read-char

21:11:58 aeth You're going to need to use LOOP here

21:12:06 vms14 I'm using a loop

21:12:37 vms14 I want to make some kind of parser for the bad html generator I try to do

21:13:01 vms14 I just want to parse input and do stuff like "h1 hi"

21:13:13 vms14 but I prefer symbols instead of strings

21:13:37 aeth vms14: One thing you could do, and it's quite a hack, is (read-from-string (read-line))

21:13:38 pjb vms14: you can use (read-from-string (read-line))

21:13:50 pjb vms14: or rather: (read-from-string (validate (read-line)))

21:13:53 vms14 the thing is usually I won't know how many symbols will be, and the delimiter is the newline

21:14:04 aeth vms14: the second value in read-from-string is where it left off so you can loop on that second value

21:14:24 vms14 what does validate?

21:14:25 pjb vms14: or: (loop for element = (extract-one-item (read-line)) until (eof-element-p element) collect element)

21:14:33 vms14 I've tried to mix read-from-string with read-line

21:14:40 vms14 maybe I did it wrong

21:15:03 pjb vms14: using READ or READ-FROM-STRING, you allow input to do whatver it wants with your lisp image, by default.

21:15:06 vms14 I like the last one pjb

21:15:11 aeth vms14: (read-from-string (read-line)) for a line "hello world" will return (values HELLO 5)

21:15:37 pjb vms14: consider: (read-from-string "#.(delete-file \"~/your-important-file.txt\")"

21:16:30 aeth vms14: you can then do (read-from-string (read-line) nil nil :start 5) to get (values WORLD 11)

21:16:42 aeth it's really weird to see optional followed by keyword

21:16:53 pjb So you would want to bind *read-eval* to NIL. but other reader macros can be problematic: (read-from-string "#8931289312839012*") for example, could DOS your system by trying to allocate all its RAM. (or just signal a condition, depending on the implementation).

21:17:14 vms14 oh, nice hack

21:17:19 aeth yes, never trust user input.

21:17:19 pjb another thing is that reading symbols will intern them, so if there's a loop, the input could fill your memory with useless symbols.

21:17:47 pjb So you might want to intern the symbols in a throw away package that you can delete-package when you're done.

21:17:47 aeth vms14: The "correct" (safe) way to do things is to parse the string, perhaps with cl-ppcre

21:17:53 pjb (in the loop).

21:18:11 pjb or keep strings, not symbols. strings are garbage collected when lost.

21:18:15 aeth By the time you add in the validation pjb is talking about, the parse solution probably becomes more concise than the elegant solution that pjb and I both said simultaneously

21:18:25 pjb Exactly! :-)

21:18:57 pjb vms14: (split-sequence #\space (read-line) :remove-empty-subseqs t) is usually all you need.

21:19:09 aeth or split with cl-ppcre

21:19:36 aeth Which to use is debatable. split-sequence is a smaller dependency, but if you're doing additional parsing, you might be using cl-ppcre anyway

21:19:40 pjb (ql:quickload :split-sequence) (use-package :split-sequence) (with-input-from-string (*standard-input* "Hello world! How do you do?") (split-sequence #\space (read-line) :remove-empty-subseqs t)) #| --> ("Hello" "world!" "How" "do" "you" "do?") ; 27 |#

21:20:11 vms14 I prefer symbols rather to strings if possible

21:20:16 vms14 isn't it better choice?

21:20:18 aeth The third option is to split manually with position

21:20:23 pjb vms14: possibly, use uninterned symbols?

21:20:29 pjb Then they can be garbage collected just like strings.

21:20:40 vms14 unevaluated symbols

21:20:43 pjb But since symbols have a name that is a string, they're a bigger overhead.

21:20:45 vms14 and convert them to string when needed

21:21:03 pjb So use symbols only if you need their features: interned, plist, value, function, etc.

21:21:03 nirved what is an "evaluated symbol"?

21:21:06 vms14 I was thinking symbols were cheaper than a string

21:21:19 vms14 nirved: an unquoted one

21:21:20 pjb Nope.

21:21:23 aeth If you wanted "absolutely 0" overhead, you can get that. Well, not quite 0, you'd have to track start and end positions for each substring. String/sequence functions take in start and end so you can just work like that.

21:21:36 vms14 so then just go for strings?

21:21:42 vms14 and read-line

21:22:02 aeth vms14: symbols are cheaper when they're already there

21:22:11 aeth so writing :hello and :world right in your code

21:22:24 nirved vms14: unquoted symbol can be many things at the same time

21:22:50 pjb (com.informatimago.common-lisp.cesarum.array:positions #\space "Hello world! How do you do?") #| --> (5 12 16 19 23) |#

21:24:43 vms14 thanks for the hints

21:24:52 vms14 I'll just do it with strings

21:24:58 pjb (let ((string "Hello world! How do you do?")) (loop :for start := 0 :then (1+ end) :for end :in (com.informatimago.common-lisp.cesarum.array:positions #\space string) :collect (cons start end) :into result :finally (return (nconc result (list (cons end (length string))))))) #| --> ((0 . 5) (6 . 12) (13 . 16) (17 . 19) (20 . 23) (23 . 27)) |#

21:25:07 aeth You could store positions in an array with the :element-type alexandria:array-index, which will probably round up to fixnum or "unsigned fixnum" (it will show up as some strange looking unsigned-byte size like (unsigned-byte 62)) or (in 64-bit implementations) (unsigned-byte 64)

21:25:43 pjb And then you can use (foo string :start (car pos) :end (cdr pos)) with most sequence functions to process the substrings. Or (subseq string (car pos) (cdr pos)) when you need to extract it.

21:25:52 pjb (which you may not have to).

21:28:18 aeth You could also do that as two vectors or two lists, one for start position and one for end position. (I think to make the vector, the best solution would be to walk the string twice, first to get the length for the allocated vectors and then to set the elements)

21:28:21 pjb vms14: Notice that displaced arrays just abstract those (car pos) (cdr pos) bounds. So instead of subseq, you can use (make-array (- (cdr pos) (car pos)) :element-type (array-element-type string) :displaced-to string :displacement-offset (car pos))

21:28:43 aeth Two lists or two vectors means you could e.g. use map with a lambda of two inputs.

21:29:57 vms14 there was a function to separate strings using a delimiter right?

21:30:07 vms14 like spaces

21:30:27 aeth there's two, one is split-sequence:split-sequence the other is cl-ppcre:split

21:30:34 aeth well, two popular ones

21:30:55 vms14 I should start soon using ppcre

21:31:14 vms14 regex are very important stuff, and I've read ppcre is a nice library

21:31:21 aeth the alternative is to allocate a list or vector of positions, or, as I recently noticed, two sequences instead of one

21:31:56 aeth s/cl-ppcre:split/ppcre:split/

21:33:27 vms14 Package SPLIT-SEQUENCE does not exist.

21:33:37 vms14 this is sbcl

21:34:06 aeth (ql:quickload :split-sequence)

21:34:07 vms14 just tried (split-sequence:SPLIT-SEQUENCE #\Space "A stitch in time saves nine.")

21:34:22 vms14 oh, I thought there it was a standard function

21:34:27 vms14 tnx xD

21:34:57 aeth splitting isn't the standard way to think about things, the standard way to think about things is with positions, which is why every built-in (and every well-behaved library) has start/end or start1/end1/start2/end2

21:35:03 aeth (either as optional or keyword)

21:35:22 aeth s/every built-in/every sequence built-in/

21:35:39 aeth At least for strings.

21:35:55 vms14 I guess I'll end using read-char

21:36:55 aeth the easiest no-library way to do it is probably read-line and do position tracking, but read-char will probably be the most efficient solution

21:37:41 pjb Even cl-ppcre:scan actually returns positions (any regexp library does).

21:38:38 aeth Thinking about lists can be done with splitting without a library, but only in one direction, splitting the front parts off and keeping the tail.

21:39:27 pjb Depending on the size of the string and the substrings, displaced arrays may spare a lot of RAM. However, in the substrings are short, then subseq will be more efficient both in time and space. (eg. on a 64-bit system, we can assumme that strings up to 8 or 16 bytes (2-4 unicode characters) are better created rather than (list* string start end) or displaced arrays.

21:42:07 aeth vms14 might not need a subseq/displacement at all, if it's about determining what to do based on user commands.

21:42:56 aeth e.g. a state machine could be used on read-char

21:43:19 vms14 it will be a bad html generator taking short strings

21:43:23 aeth ah, html

21:43:26 pjb But don't write the state machine by hand! Write a state machine compiler from a high level description!

21:43:36 aeth you said that once already but I missed it, my bad

21:44:07 vms14 yeah, I want to make a transpiler to c, starting with easy stuff like create a variable, output the value, etc

21:44:11 aeth at least you're generating html, not parsing it.

21:44:26 aeth it's easy to write valid html, and hard to accept all valid html

21:44:33 vms14 but first I want learn lisp and make a html generator as exercise

21:45:47 vms14 did you made your toy language?

21:45:56 pjb vms14: or you may have a look at: https://github.com/informatimago/lisp/tree/master/common-lisp/html-generator

21:46:09 pjb Have a look at https://github.com/informatimago/lisp/blob/master/common-lisp/html-generator/html-generators-in-lisp.txt

21:46:20 vms14 should be hard to do stuff like scoping and worse in more advanced stuff

21:46:23 aeth I have a partially complete GLSL generator so I can already essentially transpile to C if I spent a few weeks on it. Very similar syntax.

21:46:38 vms14 pjb: nah, I want to learn lisp by making this generator

21:46:42 aeth Generally, people avoid the parsing problem altogether when generating another language and just work directly in s-expressions

21:46:46 vms14 if not, I guess I would be using cl-who

21:46:52 pjb Sure but read it to choose its design!

21:47:07 pjb Well, no, you wouldn't use cl-who. Read the document!

21:47:27 vms14 aeth: right, being lisp so mutable is easy to make a dialect

21:47:43 vms14 but I guess it's also one of the best languages to make a parser

21:47:59 vms14 someone said prolog is better

21:48:42 aeth vms14: the problem is that 90% of the cases where you'd need parsers in other languages, people just avoid them altogether in Lisps and start with s-expressions, so there's probably less work on parsers than you might expect

21:48:51 aeth s/that 90%/that in 90%/

21:49:57 vms14 yeah, but I want to make a transpiler

21:50:13 vms14 well, I want to try it

21:50:15 vms14 xD

21:50:59 vms14 also sure lisp is nice to make one IDE

21:51:10 pjb Yep.

21:51:38 vms14 atm I just want to start with html, and maybe later add stuff like some variables

21:51:39 aeth vms14: Almost every "transpiler" in Common Lisp starts with s-expressions. If you don't want to start with s-expressions, you should probably act like you're doing the exact same thing as the normal transpilers and use this as the intermediate format.

21:52:01 aeth Start with s-expression->target then do input->s-expression as the next step

21:52:42 vms14 what I had is a function wrapping the input from read-line with parens using concatenate 'string xD

21:52:48 aeth Lisp itself was written in this way. m-expressions were the next step. https://en.wikipedia.org/wiki/M-expression

21:53:00 vms14 and evaluating those expressions

21:53:24 vms14 but what I want to do with lisp atm is some sort of lex generated program

21:56:56 aeth This sort of thing in Lisp is always done in at least two stages, where the first stage parses to s-expressions and the last stage turns a direct (or near-direct) s-expression mapping into strings like (:+ 1 2 3) into "(1 + 2) + 3"

21:59:05 aeth The last stage is usually written first because it is pretty trivial.

22:00:43 aeth In fact, + is probably one of the harder ones. Mostly you just go (:foo 1 2 3) to "foo(1, 2, 3)" with the only real difficulty being the way to generate the names (e.g. does foo-bar become "fooBar"?)

22:29:44 grewal /scrollback goto -100

22:29:55 grewal Not again :(

22:30:13 vms14 (read-delimited-list #\Newline)

22:30:22 vms14 does not work with newline

22:30:28 t58 ACTION feels grewal's pain from here

22:30:30 vms14 ACTION cries

22:41:09 digash ** NICK dig`

22:51:25 grewal vms14: Wouldn't (read-from-string (read-line)) do what you want (read-delimited-list #\Newline) to do?

22:52:00 vms14 grewal: that will just return the first atom

22:55:48 vms14 (loop while (not (char= #\Newline (peek-char))) do (push (read) input))

22:55:56 vms14 tried this, but it needs to newlines

22:56:29 vms14 two*

23:04:05 pjb vms14: you could make read-delimited-list #\newline work. For this, you need to copy the character syntax from #\) to #\newline.

23:06:55 pjb theorically. It stil doesn't work :-( (let ((*readtable* (copy-readtable))) (set-syntax-from-char #\newline (character ")") (with-input-from-string (*standard-input* (format nil "hello world~%How do you do~%")) (values (read-delimited-list #\newline) (read-delimited-list #\newline)))) #| ERROR: Unexpected end of file on #<string-input-stream :closed #x3020025DED1D> |#

23:12:10 vms14 https://plaster.tymoon.eu/view/1309#1309

23:12:13 vms14 seems to work

23:12:31 vms14 but it's very dirty

23:14:12 grewal Why doesn't a split function approach work?

23:14:20 vms14 it works

23:14:28 vms14 but wanted to do it without libraries

23:15:46 grewal https://pastebin.com/38HYRr1w

23:16:03 grewal Does that do what you want?

23:16:05 vms14 now I just need to change it in order to just intern the first word

23:17:05 grewal I should have used collect instead of push

23:18:06 vms14 grewal: it does not work

23:18:10 vms14 it works on your machine?

23:18:49 vms14 oh, yeah it does

23:19:01 grewal It runs, and it gives me what I expect. I don't know if it's what you expect

23:19:14 vms14 yeah, it does what I want

23:20:43 vms14 (test (read-line)) but seems to yield strange results with this

23:21:20 vms14 lol, nvm it works well

23:21:22 vms14 xD

23:21:24 vms14 tnx

23:25:40 vms14 I love how format lets you write in fill-pointer strings

23:26:28 vms14 and there are more things I'm missing about format, I need to practice a bit with things like ~:* and so on

23:30:14 pjb vms14: your last paste is crazy.

23:30:55 vms14 pjb: that's my code and also why I cannot say I'm a programmer

23:30:57 vms14 xD

23:31:19 vms14 but usually this happens when I'm testing stuff and patching things over and over

23:31:39 vms14 anyway it's not an excuse, I do not know yet how to write nice code

23:32:15 vms14 and my code usually is slow, walking around because I'm noob and idk the direct way

23:32:54 pjb vms14: https://plaster.tymoon.eu/view/1309#1311

23:33:00 pjb and it doesn't seem to work that well.

23:33:52 vms14 xD what I want is the input var

23:34:09 vms14 inputstring it's useless, and I shouldn't be using it

23:34:33 vms14 also I should be using let

23:35:59 vms14 I shouldn't be coding yet, but I want to get used to lisp, and the best way is coding

23:36:18 vms14 hope with the time your eyes stop bleeding with my code xD

23:36:28 grewal What do you mean by "I shouldn't be coding yet"?

23:36:51 vms14 grewal: I mean I should be reading and doing test stuff and wait a bit to make this program

23:38:03 vms14 also I still thinking the On lisp book should teach me nice things, but I need to understand lisp better before this book, or I'll miss some important stuff

23:39:01 vms14 and have the PAIP waiting too

23:47:25 pjb vms14: https://plaster.tymoon.eu/view/1309#1312

23:49:30 pjb vms14: loop is nice because it's versatile. Instead of having loops for, while, until, etc, loop does everything. (loop :while … :do …) (loop :do … :until …) (loop :for i :from 0 :to 10 :do …) and other variants: (loop :while … :do … :until …) (loop :do … :while … :do …) etc.

23:50:45 pjb vms14: note that the :finally clause is jumped to as soon as one terminating clause is validated. So (loop … :until … :do … :finally …) doesn't evaluate :do when the :until condition is true.

23:52:02 aeth vms14: what do you want your input to look like?

23:54:09 aeth What makes LOOP good for reading is its behavior for :for ... := ... is different than DO's behavior when you do not have an iteration step. With LOOP, it will do the thing initially and then repeat it, with DO it will only do it once so you wind up having to repeat yourself twice (once for the initial value and once for the step) unless you abstract over this with a custom macro.

23:55:39 aeth So even if you're primarily using DO and/or DO* in your coding style, this is one of those good exceptions where you should use LOOP

23:56:40 aeth (correction for the nitpickers, you repeat yourself once, which is writing the same code twice, you don't "repeat yourself twice")

23:57:10 vms14 pjb: nice code, I'll save it

23:57:17 pjb or you can write (do ((i #1=(read) #1#)) ((null i)) (print i))

23:57:38 vms14 aeth: idk, I guess what I really want is just a list of atoms being read from input

23:57:55 vms14 but I need to learn more

23:58:57 pjb And it's safe: (with-input-from-string (input " #.(delete-file \"~/.bashrc\")") (read-token-list input)) #| --> ("#.(delete-file" "\"~/.bashrc\")") |#

23:59:27 aeth vms14: Imo, you shouldn't think in terms of "list of atoms being read from input" imo. That's eval()-style behavior (CL's EVAL is different, and eval("1 + 1") in other languages is closer to (eval (read-from-string "(+ 1 1)")) in CL)

23:59:45 aeth vms14: You should be thinking in terms of what kind of syntax you want to support, and parsing that syntax.

0:00:16 aeth None of this that we've been talking about is strictly necessary with a sufficiently restrictive syntax

0:01:53 aeth e.g. you could require the user write things like "foo 42\nbar 43\n" (replace \n with newlines in your head; IRC is limited to one-line-per-message) in which case you don't technically need any intermediate strings.

0:02:03 vms14 well I'll go for strings them, and study that code later

0:02:09 vms14 thanks for the help

0:02:16 vms14 see you guys

0:02:23 vms14 <3

0:02:25 aeth bye

0:05:30 aeth (I guess reading wouldn't be unsafe if CL's read wasn't so powerful.)

0:07:19 pjb and macros wouldn't be unsafe (and worse, unhygienic) if CL's macros weren't so powerful.

0:07:33 pjb and eval wouldn't be unsafe if CL's eval wasn't so powerful.

0:07:41 pjb CL is too powerful! :-)

0:10:39 aeth I guess my point is that for untrusted user input you don't want power, so you wind up having to write your own (or use a library) functionality. Shortcuts here are bad.

0:12:22 aeth read-line vs. read-char is up to you (unless you *need* to not hang, then you have to use read-char-no-hang)

0:13:25 aeth read-char/read-char-no-hang will probably use a FSM.