freenode/#lisp - IRC Chatlog

0:51:36 mobile_c is it true that parsing grammar can be done easily in lisp

0:51:53 didi mobile_c: Specially if its lisp.

0:51:59 didi it's*

0:52:17 mobile_c o.o

0:52:19 mobile_c how

0:52:25 didi (read)

0:52:33 mobile_c ?

0:52:38 didi Not _you_ read, but the function READ.

0:52:47 mobile_c ._.

0:52:51 didi It's that simple.

0:53:14 didi Really, I'm not messing with you.

0:53:24 malice You're just cheating

0:53:26 mobile_c how tf does read parse grammar ._.

0:53:41 malice didi lied to you(a bit)

0:53:41 didi malice: Indeed, I'm using lisp.

0:53:47 didi I did not.

0:53:49 malice by lisp he meant Common Lisp

0:53:55 didi Indeed.

0:54:06 elderK mobile_c: Read itself converts the characters and stuff into a tree of lists and things. READ is fundamental.

0:54:18 malice (read) won't parse Scheme

0:54:22 elderK If you're used to parsing like, things from another language like C, you will have to roll that mechanism yourself.

0:54:25 malice (read) will accept Common Lisp code though

0:54:26 elderK And like they say, this only parses Lisp.

0:55:03 malice Is it easy to parse any grammar though?

0:55:22 malice What would you use for that?

0:55:24 mobile_c how can i parse something like this https://paste.pound-python.org/show/K4QGpELlMYw0VWUjbl37/ using lisp

0:56:10 elderK mobile_c: If you'd like to learn about how the reader works, you can find out here: http://clhs.lisp.se/Body/02_.htm

0:56:34 didi I think we are dismissing READ too quickly. It's amazing that we can do it. We should praise it more.

0:56:58 aeth mobile_c: (loop :for line := (read-line file) :while line :do ...)

0:57:00 elderK didi: I agree :) And it's very flexible :D

0:57:17 aeth mobile_c: You could also do it character-by-character, but that's harder

0:57:19 malice mobile_c: why do you want to parse it? what is your expected return value?

0:57:52 aeth oh sorry, it's (read-line file nil)

0:57:54 elderK mobile_c: To parse something like that, you'll probably do a lot of stuff you're used to doing in other languages if you've parsed by hand. Or, you can learn to use one of the many parser-generator libraries available for Lisp.

0:59:05 malice mobile_c: also are there any properties that you'd expect your parser to have?

0:59:10 mobile_c as i want to parse it like a parser grammar (since technically at the moment it is very similar to one)

0:59:43 mobile_c eg REG = 0000|0001|0002

0:59:45 mobile_c REG

1:01:31 malice With problem definition like this, I'd take a look at parser generators: https://www.cliki.net/parser%20generator

1:02:47 mobile_c as the main problem is figuring how how to parse it like a rule definition/rule expamsion

1:03:10 mobile_c or rather implement rule definition and rule expansion and identification

1:03:18 malice ?

1:03:22 mobile_c identification and expansion*

1:03:27 elderK mobile_c: If you want to do it by hand, you could just write a simple lexer and recursive descent parser :)

1:03:39 elderK But like malice said, check out the libraries.

1:03:54 mobile_c as at the moment im parsing it using a parser

1:03:59 malice (also note that the site could use updating; some of the entries are 404 and there are probaly a couple of new ones not listed)

1:05:34 malice mobile_c: I'm afraid I don't understand the problem well enough to suggest an optimal solution. One of the things I do not understand is the need for Lisp parser

1:05:40 malice but that might be just curiosity

1:06:14 malice then I do not understand the goal - do we want any parser, some specific parser, what representation of AST should we produce, how do we handle the errors, etc.

1:07:06 mobile_c idk, a friend said this would be easy with lisp

1:07:38 malice you can ask your friend! ;)

1:08:59 malice although writing your own parser won't be much different from other languages, I guess.

1:10:02 malice also wow, the parser generator section sucks

1:10:08 malice half of the links are dead

1:10:16 malice and no really great solutions there

1:11:22 rpg malice: are you looking at cliki?

1:11:43 malice yes

1:12:02 malice rpg: I pasted the link few lines above

1:12:06 aeth keep in mind that cliki is probably 15 years old, and not as popular as random github pages like https://github.com/CodyReichert/awesome-cl these days

1:12:11 aeth So it will be somewhat stale

1:12:14 rpg malice: just re-logged in

1:12:32 malice rpg: sure. I meant this page: https://www.cliki.net/parser%20generator

1:12:52 malice aeth: good note. I also keep forgetting about those random github pages

1:12:54 rpg I've used cl-yacc -- it wasn't a great experience.

1:13:13 rpg Not horrible, but not great.

1:14:56 malice mobile_c: there's also rdp generator here: http://www.informatimago.com/develop/lisp/index.html

1:16:11 aeth if it's on informatimago it's probably AGPL so keep that in mind

1:24:26 elderK Guys, what is a good way to test reader macros?

1:24:36 elderK Like, to check they correctly expand to what I expect? :)

1:25:31 rocx what all would you use a reader macro for?

1:26:08 didi Lambdas! #L(if (oddp _) (1+ _) _)

1:26:09 elderK I'm writing a basic quasiquote expander for learning purposes.

1:26:26 elderK So I want to have my own "short terms" for ` and , and ,@

1:26:34 elderK So that they hook into my expander, rather than the CL one.

1:27:15 elderK As far as I am aware, I can't really macro-expand a reader macro?

1:29:39 aeth What happens if you put a ' in front of a reader macro?

1:30:18 elderK Nothing?

1:30:38 elderK Like, '`a gives an error :)

1:30:46 aeth strange

1:31:00 aeth For read-eval you can do this: '#.(list 1 (list 2 3) 3)

1:31:17 elderK My bad, REPL, `'a works

1:31:20 elderK Gives me `a

1:33:11 aeth I think that for a reader macro as long as it returns (turns into?) one thing you can just quote it, but I could be wrongly generalizing from read-eval.

2:46:31 antonv I have a dilemma

2:48:15 antonv a library (an ASDf system) shoudl chose a dependency (another ASDF system) based on what OS / distro it runs on

2:48:52 antonv simply speaking, depending on OpenSSL version installed, we should choose an FFI wrapper to load

2:49:45 antonv HHow to describe that in ASDF?

3:00:05 elderK thanks aeth

3:41:58 fiddlerwoaroof antonv: (:component "foo" :if-feature :darwin)

3:43:35 fiddlerwoaroof however, if it's something like "which openssl version is installed", you might have to do a bit of work to get the features setup appropriately.

3:44:06 fiddlerwoaroof ... I guess he's left

3:44:14 fiddlerwoaroof minion: memo for antonv: (:component "foo" :if-feature :darwin)

3:44:14 minion Remembered. I'll tell antonv when he/she/it next speaks.

5:33:30 shka_ good morning

5:51:30 elderK Moin shka_

6:02:17 LdBeth Good evening

6:13:22 fiddlerwoaroof ** NICK hhgbot

6:14:18 hhgbot ** NICK fiddlerwoaroof

8:03:35 beach Good morning everyone!

8:16:34 esrse good morning

8:30:38 beach esrse: Are you new here? I don't recognize your nick.

8:31:19 shka__ beach: tell me if you have shorcut in emacs to say exactly the above? ;-)

8:31:37 beach Heh. I do not. Maybe I should.

8:31:47 shka__ :-)

8:41:47 fiddlerwoaroof morning beach

9:09:42 hhdave_ ** NICK hhdave

9:45:07 p_l 'morning

9:52:46 beach Hello p_l.

9:52:53 ogamita elderK: to test reader macros easily, you can use read-from-string: (read-from-string "`(foo \"string\" ,x)") #| --> (list* 'foo (list* "string" (list x))) ; 18 |# be sure to escape double-quotes and backslashes!

9:55:29 ogamita aeth: when you prefix a reader macro by a quote, this prevents what is read to be evaluated. So it should print what has been read. Unfortunately, the pretty printer, and even the printer, will often print some objects in a special way. For example: (prin1-to-string '(function foo)) #| --> "#'foo" |# instead of printing as a normal (function foo) list. You can use your own printing function to avoid this caveat, eg. (print-conses

9:55:29 ogamita '(function foo)) #| (function . (foo . ())) --> #'foo |# ; notice how the result after --> is printed by cl:print.

10:11:24 elderK ogamita: Hey! Thanks! I discovered that on my own ;)

10:11:27 elderK * :)

10:23:22 ogamita elderK: another trick when you are developping a reader macro is to use 'my-reader-macro instead of #'my-reader-macro in set-macro-character or set-dispatch-macro-character.

10:23:41 ogamita elderK: with ' when you redefine the reader macro, it's taken into account immediately.

10:40:17 elderK Ah nice, so the symbol my-reader-macro is coerced to a function?

10:40:33 elderK symbol-function is called, is that it?

10:41:43 no-defun-allowed Yep.

10:42:14 no-defun-allowed The symbol adds redirection so the name is looked up during a funcall instead of being handed the old function object.

10:42:39 ogamita it's used with apply or funcall, so a symbol denotes the global function of same name. (actually, symbol-function is used).

10:48:43 elderK Handy :)

10:49:01 elderK Is there any particular reason to say #'name rather than just 'name for say, apply or reduce or whatever?

10:50:20 jackdaniel elderK: if you have (flet ((name () "foo")) …) then 'name will refer to a global function definition, while #'name will refer to the local one

10:51:28 jackdaniel also #'foo gives you a function itself, so if you (let ((foo #'foo)) (loop (funcall foo))), it will always call the same function when looping (even if you redefine it in a different thread)

10:53:57 elderK jackdaniel: Thanks :) That's good info :)

11:03:25 jmercouris how might I go by sorting a list of strings alphabetically?

11:03:53 elderK :D make a higher order sort function that accepts an ordering predicate? :D

11:04:19 jmercouris I'm not looking to reinvent the wheel

11:04:22 jmercouris I'm sure this has been done before

11:04:54 elderK I guess you might want to look at Alexandria. Maybe it includes a sorting function.

11:05:00 elderK Also, HI jmercouris! :)

11:05:15 jmercouris elderK: hello!

11:06:08 jmercouris I did look in Alexandria, its possible I missed something though

11:08:00 jackdaniel try this:

11:08:00 jackdaniel (sort (list "abc" "aab" "cab" "baa")

11:08:00 jackdaniel #'(lambda (seq1 seq2)

11:08:00 jackdaniel (uiop:lexicographic< #'char<

11:08:01 jackdaniel (coerce seq1 'list)

11:08:03 jackdaniel (coerce seq2 'list))))

11:08:46 jackdaniel you probably want to replace lambda and uiop piggyback with your own function working on strings

11:14:35 elderK I should've known it was in the language :)

11:14:43 elderK jmercouris: You might want to check out Zeal

11:14:53 elderK It's a documentation viewer but like, super-searchable.

11:15:00 elderK It's made grappling with the CLHS much easier for me.

11:15:14 elderK Searching lisp:sort for instance, shows sort and stable-sort in the CLHS :)

11:15:26 elderK If you have trouble like, navigating the CLHS as it is online, Zeal could really help :)

11:15:32 jmercouris elderK: UIOP is not part of the language

11:15:43 elderK sort is

11:15:49 elderK And so is stable-sort.

11:15:51 jmercouris I navigate the CLHS locally, and UIOP is not at all aprt of the language

11:15:54 jmercouris not even "sort of"

11:16:09 jmercouris jackdaniel: thanks, I didn't know about uiop:lexicographic!

11:16:11 jmercouris very useful!

11:16:14 elderK http://clhs.lisp.se/Body/f_sort_.htm

11:16:51 jmercouris I know about sort, did know how to write the predicate to compare two strings and see which one has alphabetical precedence

11:16:57 jmercouris s/did/didn't

11:17:40 White_Flame (sort list #'string<=)

11:18:06 elderK White_Flame: Is that case sensitive?

11:18:09 jmercouris even better!

11:18:28 jmercouris I wonder, why does uiop:lexicographic exist then?

11:18:29 jackdaniel ACTION is embarassed now :) 1+ White_Flame

11:18:31 White_Flame clhs string<=

11:18:31 specbot http://www.lispworks.com/reference/HyperSpec/Body/f_stgeq_.htm

11:18:33 jmercouris is string<= platform dependent?

11:19:05 jmercouris I can't think of why else fare would include such a function in uiop

11:19:06 jackdaniel jmercouris: lexicographic in uiop is there for lists, that's why I did coerce them

11:19:11 White_Flame there are some forms that ignore case

11:19:29 jackdaniel i.e versions (1 2 23) (1 4 14)

11:19:57 jmercouris I see

11:20:00 elderK ACTION nods

11:20:07 White_Flame elderK: string< is case sensitive, string-lessp is case insensitive

11:20:17 elderK White_Flame: Thanks :)

11:20:31 elderK Just like char= vs char-equalp

11:21:28 White_Flame the inequality tests are based on char< etc, which do defer to implementation specifics

11:21:51 jmercouris indeed, so implementation specifc, that is okay for my use case though

11:24:02 elderK jmercouris: string<= doesn't seem to be implementation-specific. Although I guess like, if you like, want to compare unicode strings in an ASCII-only Lisp, I guess would hit trouble.

11:24:39 White_Flame it relies on character codes, which used to not be very standardized, and thus implementation specific

11:24:57 White_Flame but now with unicode, they'll tend to be compared by unicode codes

11:25:15 White_Flame but technically free to declare whatever codes they want

11:25:51 White_Flame (with a few constraints on a..z, A..Z, 0..9, etc)

11:26:10 White_Flame constraints on their relative order, not on their specific code

11:28:01 elderK White_Flame: Right, so it's "super portable" only if you stick to like, the "standard ASCII" stuff, right?

11:28:25 White_Flame if you're on modern platforms, you'll tend to be okay

11:28:29 elderK As soon as you say, have any kind of international characters - you need to either use an implementation that supports Unicode for its stuff, or implement your own predicates, right?

11:28:43 elderK Say, if you were adding unicode support to a CL that didn't support it natively.

11:29:16 White_Flame all the major CL implementations support unicode natively, afaik

11:29:36 zmv ** NICK notzmv

11:30:09 White_Flame if they didn't, I dont' believe you'd be able to use character types to represent unicode characters. (but I wouldn't bet my life on it)

11:31:04 White_Flame ah, there's CHAR-CODE-LIMIT, which things will refuse to work with if you go outside fo

11:31:49 elderK White_Flame: Right. So, much like in other languages that don't natively support Unicode strings, you'd have to implement your own stuff.

11:32:09 elderK That's no major issue though. At least, not if you just want codepoints. Unicode, while annoying, is easy enough to decode.

11:32:14 elderK :P It's the other stuff to do with it that is hard.

11:32:36 elderK Like, normalization or mapping between cases and stuff.

11:32:39 elderK Thanks for the info, White_Flame.

11:32:46 Bike you could use flexi streams or the like to manipulate things to some extent, if there were no characters past ascii or anything. it would suck though.

11:33:49 elderK Bike: How do you mean? :) I'm unfamiliar with flexi-streams.

11:34:07 elderK I imagine as long as you can read binary, you can read Unicode. Just to varying degrees of "annoying."

11:34:08 elderK :)

11:34:23 Bike well, right, it takes care of that. but you wouldn't be able to manipulate the result as lisp strings.

11:35:27 elderK How would you add support if you so wanted? Would you have to go as far as creating like, your own types and predicates and everything?

11:35:46 Bike you'd have to redo a lot of the standard library. it would be silly.

11:35:56 Bike the set of characters is determined by the implementation and can't be extended by users.

11:35:56 elderK I guess you would. Maybe implement a code-point type, create predicates for that, then define "unicode strings" on that.

11:36:04 elderK ACTION nods

11:36:29 White_Flame basically, dig into the implementation and send a pull request when you're done :-P

11:36:47 elderK It kind of seems like an oversight, not allowing some way for users to extend this.

11:36:52 elderK I guess it makes sense. But, still.

11:36:59 jmercouris Oversight? not extendable?

11:37:01 White_Flame it was defined by the OS platform in the past

11:37:04 jmercouris are we talking about the same language?

11:37:27 elderK jmercouris: As far as I am concerned, at least compared to the languages I usually deal with, CL is pretty well built and is quite flexible.

11:37:35 elderK So yeah, to me, it seems like a surprising oversight.

11:38:01 elderK Then again, it does not surprise me.

11:38:13 jmercouris maybe I'm just in an argumentative mood, but it doesn't seem like an oversight to me

11:38:44 elderK Taken in context, it doesn't. I mean, shit, C's support for strings of any kind is kind of crap :P

11:38:53 elderK It doesn't support Unicode that well either unless you roll your own abstractions.

11:39:13 elderK And White_Flame is right: Back then, well, it was by the platform. Maybe you had code-pages or something, maybe not.

11:39:20 elderK So perhaps you're right, jmercouris.

11:39:27 elderK Maybe it only seems like an oversight in hindsight :D

11:39:50 White_Flame and as Bike listed, this decision on what the character encoding is is wound up in all sorts of ways in character & string handling

11:40:16 jmercouris hindsight is always 20/20 and, as far as I understand, it is mostly up to the implementation to decide

11:40:21 jackdaniel for instance extending valid character set would require implementations to allow specializing stream encoders and decoders

11:40:35 jmercouris which makes it hardly something the language design imposes upon its usage

11:40:40 jackdaniel and that would requier standarizing a lot of things which are not

11:40:40 White_Flame the ability to define new character code ranges would mess with the low level byte representation of internal characters/strings, which is beyond the recompilation sensibilities of the day

11:40:53 elderK True.

11:41:14 elderK Still, I guess it only really matters if you're reading stuff that you intend to evaluate or something anyway, right?

11:41:23 elderK If it's just program data you're moving around, it's not really an issue.

11:41:36 elderK Like, if it's just strictly read in, do stuff, write out. You can deal with Unicode yourself.

11:41:38 jmercouris I am evaluating things with this data, so it is a bit of a problem for me ;)

11:41:42 White_Flame anyting you specifically want as a character or string would require the characters' codes to be within range

11:41:44 jackdaniel elderK: you have arrays for that

11:41:57 elderK jackdaniel: Yeah, exactly what I'm speaking about.

11:42:04 jmercouris I mean, the range is pretty high, it's a massive number

11:42:11 jmercouris I think it was 1 million something on SBCL on MacOS

11:42:29 elderK Well, that AFAIK is roughly the Unicode cap at current.

11:42:41 elderK 0x10ff00 or something like that?

11:42:48 White_Flame 10ffff

11:42:48 jmercouris if we would just remove emojis, I'm sure we could drop that number tremendously

11:42:49 elderK Around about that area.

11:42:57 White_Flame which is 1,114,111 dec

11:42:59 elderK White_Flame: You got it! :)

11:43:15 White_Flame jmercouris: I concur and wish to subscribe to your newsletter

11:43:26 jmercouris :D

11:43:41 White_Flame but in actuality that char code limit was established long before the emoji craze

11:43:52 elderK But still, just to get an answer: It is really an issue if and only if you're actually say, relying on CL's native string stuff, right? If you aren't say, intending on having the user give you stuff to read-from-string, or if you aren't reading "text streams", it's not really an issue, right?

11:44:27 White_Flame again, anything you specifically want as a character or string type in the runtime at some point would require the characters' codes to be within range

11:44:33 elderK jmercouris: I too see little point in emojis being in Unicode

11:44:50 White_Flame if you're reading from a byte stream, it will decode in whatever ways it supports

11:44:55 jackdaniel while emoji does seems silly at first sight, adding pictograms to unicode doesn't anymore

11:45:15 jackdaniel also 1644 characters isn't a big number

11:45:16 White_Flame it might even decode those bytes into something character-like

11:45:34 White_Flame although maybe not what you want

11:45:34 jmercouris emoji seems silly even at second, or third glance

11:45:41 jmercouris its been a few years now, and I am still not happy about it

11:45:52 jmercouris the only outcome has been people putting poop emojis in their github readmes

11:46:00 jmercouris because that's somehow hilarious

11:46:03 White_Flame emoji is not creating a record of existing glyphs, it's inventing new ones for the purpose

11:46:06 jackdaniel jmercouris: you don't see any utility in simple pictograms being part of the charaset?

11:46:06 elderK White_Flame: I mean like, if you're reading as binary, not as text, straight bytes. Uninterpreted or altered. And you, yourself, perform decoding and implemenet your own predicates, etc. As long as you aren't then saying: Yo, CL, read this <bunch or raw stuff>, it's not going to matter.

11:46:29 jmercouris jackdaniel: no, they just irritate me

11:46:40 White_Flame READ this is going to require the ability for it to be characters

11:46:42 elderK jackdaniel: I would rather useful pictograms be added. Not things like poop or cats.

11:46:46 jackdaniel such "signs" are universally recognizable disregarding written language knowledge

11:47:06 elderK White_Flame: Right. So, I'm saying if that isn't necessary and you aren't using string<= and stuff, then not having native Unicode support is not necessarily a killer.

11:47:09 White_Flame and unicode still doesn't have petscii or klingon

11:47:13 White_Flame (last I checked)

11:47:35 White_Flame elderK: depends on what you mean by "killer". It means you can't use any string, character, READ, etc utilities

11:48:01 jmercouris jackdaniel: ok, that's a pretty convincing argument actually, however let's say I disagree about WHICH emojis are necessary or not

11:48:02 White_Flame it would probably make sense to marshall such a thing into some custom escaped string representation

11:48:20 jackdaniel this list http://unicode.org/emoji/charts/full-emoji-list.html doesn't seem half bad

11:48:28 Bike the hieroglyph for poop looks more like a pear

11:48:29 jmercouris jackdaniel: for example, I can imagine a pictogram indicating "restroom" would be very useful

11:48:34 Bike that seems like an odd decision, egypt

11:48:39 Bike guess it wad distorted over time

11:48:48 jackdaniel jmercouris: there is emoji indicating restroom, yes

11:49:01 jmercouris right, but then you have one indicating cowboy smiling

11:49:02 shka__ aaaaaaaaaaaand now we are discussing emojis

11:49:10 jmercouris I can't imagine the practical application of such a sign

11:49:15 Bike elderK: i mean it's like implementing arithmetic yourself. you can do it but it's not great.

11:50:07 jackdaniel you disagree with about, say, 256 characters in million, I'm sure someone found cowboy character useful if its there

11:50:27 elderK Bike: Aye. A couple years back I implemented a whole set of such encoding / decoding libraries in C for a project I was doing.

11:50:31 White_Flame aren't there a ton of combining forms for them as well as skin color modifiers and other meta stuff adding to implementation woes?

11:50:41 elderK I learned a lot. And it was just pure code-point decoding, no normalization or... you know, the harder stuff

11:50:44 Bike well, as you mentioned, C's built in string support is kind of crap anyway

11:50:50 Bike so you're not missing much