freenode/lisp - IRC Chatlog

9:24:26 beach p_l: Yes, but many data sets now fit in the RAM of a reasonable PC. For instance a complete social-security database for a country, the register of all automobiles in a country, the bank transactions of all customer of a particular bank, etc.

9:25:54 beach p_l: But I often see inexperienced project leaders and developers that are unable (or too lazy) to make this back-of-the-envelope calculation, so they will pay way too much money for something like Oracle without actually needing many of its features.

9:26:57 beach p_l: Furthermore, when I do the calculation for them, they get *really* angry, as if, deep down, they didn't want to know the truth.

9:26:59 p_l beach: dunno about other countries, but in many cases the issue is not "will it fit in naive memory implementation" but "will it not eat data, is tested to hell and back, can survive direct aircraft strike (not joking)

9:27:42 beach Sure. The cases I have seen were all non-critical applications.

9:28:02 p_l but you just mentioned datasets which are all critical

9:28:06 TMA beach: that's not entirely accurate. Given that there are several billion transactions even in a small bank, just storing their ID would eat a good chunk of RAM

9:28:40 p_l UV3000 is not a "reasonable PC", and one of its marked use cases is running single-instance DB for single ERP instance

9:29:19 p_l said DB is in-memory store

9:29:22 beach TMA: "several billion" is no longer that much.

9:30:27 Cymew That sentence is so scary.

9:30:41 beach p_l: I guess we have different criteria for "critical".

9:31:17 TMA beach: if you consider that each transaction is several hundred bytes worth of data, that makes the total several hundred gigs -- that's not a "reasonable pc" anymore

9:31:36 beach For a bank? Come on!

9:31:49 p_l yeah, UV300H is marketed exactly at banks

9:31:54 p_l among other things

9:32:02 beach They spend more than that on their Oracle license.

9:32:09 p_l they generally fit one application

9:32:23 p_l beach: many banks don't use Oracle RBDMS for accounts

9:32:38 beach Good. What do they use instead?

9:32:52 p_l there are specialized banking DBs used for decades, often with data storage systems tailored for particular bank

9:33:09 beach I see.

9:33:24 p_l Oracle, SAP HANA, etc. etc. shows up in analysis applications or various non-account parts

9:34:16 p_l for example, in a project I work with, we use distributed in-memory store from Oracle (it was acquisition) for various "funds" data (it's called even "General Funds Manager")

9:34:42 beach Despite what shka says, if I ever get the time, I should read up on all these issues and then form an opinion about the current state of things.

9:35:11 shka beach: you will loose your faith in the humanity

9:35:20 p_l some of the most costly (in licensing and hw) systems are on the other hand low-memory (comparatively) but very highly available and more concerned with huge I/O rates as they talk all over the world

9:35:26 beach shka: Ah, that happened a long time ago. :)

9:36:19 beach p_l: That sounds like a very valid concern.

9:36:29 p_l beach: generally, there's a lot of in-memory stores in play in OLAP, while many OLTP tasks are low-memory intensive but need disk for persistence and I/O for comms

9:36:49 p_l meanwhile with OLAP it's all about "load as much data as you can so we can analyse it in all directions"

9:37:10 shka one those not loose faith until he sees mongo + node.js stack

9:37:18 p_l It's usually only big customers that can afford in-memory, though

9:37:43 shka well, it changes though

9:38:06 p_l shka: well, yes, but I am not talking about lazy programmers who go with "what is easiest without considering drawbacks"

9:38:20 p_l I'm talking about design tradeoffs that might be taken by serious people :)

9:38:48 shka what could be done and should be done is distributed, P2P, transactional, in-memory storage

9:39:18 p_l shka: guess what we use... from Oracle

9:39:32 shka so I people could stack large numbers of unexpensive machines together

9:39:55 shka p_l: oh, so oracle is already doing it?

9:40:41 shka anyway, just let me finish my thought

9:41:19 shka it is EASIER to build relaible system out of large number of crappy parts

9:42:52 shka anyway, single machine with 1TB of RAM is just huge single point of failure

9:43:01 phoe it is EASIER to build reliable software out of large number of crappy developers*

9:43:03 phoe FTFY

9:43:07 phoe ACTION ducks

9:43:13 shka and nowdays, almost everything is supposed to run 24/7

9:43:32 shka phoe: yes, because they are exchangable :P

9:43:38 shka that's how java works :D

9:44:18 phoe shka: where are our 1000s of crappy lispers then!?

9:45:04 shka phoe: lisp is not exactly popular, dunno if you noticed

9:45:15 phoe shka: I noticed, yes.

9:47:10 shka well, until big business will be able to swap developers in their lisp project, you won't see lisp projects in big business

9:47:19 shka sad but true

9:53:19 ebrasca shka: I am not sure about it.

9:53:36 shka that's good i guess

9:53:51 shka you don't have to see it all bleak

10:12:23 p_l shka: Oracle is a lot of products, even if you discount the ones they bought

10:12:38 p_l I think they have two or three in-memory offerings

10:13:00 p_l the one we have here even in its name suggests what it is about (Oracle Coherence)

10:16:02 p_l heh, then one hits the prices that show up for in-memory hw platforms

10:16:29 p_l 3m USD for just memory, for example

10:17:17 p_l or 1m usd for certified parts and 750k for "from the street" memory

10:25:25 shka lol

10:26:01 shka p_l: honestly, those prices are crazy

10:26:51 phoe they ain't crazy

10:26:54 phoe they're "enterprise"

10:26:58 p_l shka: now consider that some of those machines require 2x the memory vs. what you can use, due to RAIM in mirror mode

10:27:25 p_l phoe: "on the street" is the pricing without "enterprise markup" (actually an extra guarantee among other things, important when dealing with that kind of money)

10:27:52 shka there is room for competition

10:28:38 p_l and if you deal with things like z-series mainframes you have a machine that has 2.5 the amount of memory you can maximally spec, because mirror RAIM and that .5 for internal use by machine itself

10:31:10 p_l I heard anecdotes of a big z-Series losing whole I/O drawer under max load and it having not affected the application

10:33:47 flip214 well, we've got a customer who didn't notice that his RAID controller was dead for two weeks, because the data was accessible via the software-mirror (DRBD)

10:35:23 p_l flip214: this is more a case where the I/O of the system, which was under load (including things like mirroring to another machine) was unaffected

10:36:06 p_l IO drawer is where things like Ethernet, Infiniband and FC (plus occassional ancient ESCOM) cards sti

10:36:07 p_l *sit

10:36:20 shka p_l: well, thing is that mainframes are expensive

10:36:37 shka it would be better to figure out how to use commodity machines

10:37:31 p_l reliable in-memory systems tend to be expensive as well, especially when you don't want to directly deal with disk (either huge expensive NUMA systems, or RDMA)

10:53:52 shka p_l: RDMA is getting cheaper, and besides, even without RDMA you can build reasonable system

10:56:16 p_l shka: yes, but the latencies make it very far from "let's just think of memory instead of multiple completely disparate systems"

10:57:00 p_l and in my experience, companies balk when I suggest "ok, let me hit ebay, I'll have the 56GBit low-latency fabric ordered by tomorrow on the cheap" ;)

10:58:11 shka hehe

10:59:19 shka anyway, those would be different beast, i agree, but this approach has some advantage over ultra expensive huge IO mainframes

10:59:56 p_l most companies that lease mainframes have very specific reasons for doing so (combination of RAS requirements, power, software base, etc.)

11:00:18 p_l yet companies still develop on them

11:00:37 flip214 shka: rdma is already cheap.

11:00:40 p_l (it would be interesting to have CL on z/OS ...)

11:01:01 p_l flip214: if you're willing to hit ebay, it's cheaper than 40/10GbE

11:01:05 shka flip214: *cheaper

11:01:05 flip214 you can get 56gbit IB, 2 dual-port adapters and 2 cables (so two machines can talk 112gbps) for < €1000

11:01:35 shka you can have rdma over ethernet nowdays

11:01:40 shka btw :P

11:01:42 flip214 and that's new, with guarantee etc.

11:01:47 flip214 shka: yes, ROCE.

11:01:51 p_l shka: which is a) expensive b) not as good

11:01:58 flip214 even over virtual network links...

11:02:07 flip214 and the latency is much worse than with eg. IB

11:02:15 shka it should get better

11:02:17 p_l you can of course emulate RDMA, you always could

11:02:18 shka over time

11:02:25 p_l shka: it won't because ethernet is ethernet

11:02:49 p_l the changes necessary to make it reach the same perf as IB are fundamental

11:02:50 flip214 shka: ethernet resp. UDP have the checksum at the front of the packet.

11:02:59 p_l there's also flow control

11:03:05 flip214 so all of the data needs to have been seen before the first data byte can be transmitted

11:03:10 flip214 so latency is worse than with eg. IB

11:03:29 shka uhm

11:03:37 shka i guess i don't understand it

11:03:54 shka i thought that UDP has nothing to do with ethernet on it's own

11:04:16 flip214 shka: ROCE is doing RDMA over ethernet, by encapsulating the data in UDP

11:04:25 p_l shka: there are some low-level details that impact how fast you can go with Ethernet without changing it into non-Ethernet, including how it's designed for Store-and-Forward data

11:04:28 flip214 RDMA hardware like IB doesn't need that

11:04:40 p_l flip214: and combining it with the weird stack for loss-less ethernet

11:04:57 shka ok

11:05:19 shka well, i can't be huge fan of ROCE i guess

11:05:25 p_l shka: IB doesn't have store-and-forward inside subnet (most sites anyone sees are single subnet)

11:05:39 flip214 ROCE is still better than TCP, for some use-cases at least.

11:06:00 p_l and has better flow control (token system) with actually working SDN ;)

11:06:02 flip214 and yes, ROCE can be routed... IB cannot (TTBOMK)

11:06:21 p_l flip214: DoD has continent-spanning routed IB network

11:06:36 p_l IB on purpose uses IPv6 addressing to make it easier

11:07:20 shka p_l: what for?

11:07:26 shka radars?

11:07:38 p_l shka: or possibly linking supercomputers

11:07:48 shka right

11:08:05 flip214 IB uses IPv6? I thought low-level addressing is the 20byte address?

11:08:17 p_l Netherlands used Myrinet on mixed Myrinet/10GbE links, in poland I think we did something shitty with plain IP... ;)

11:08:45 p_l flip214: IB uses IPv6 *addressing*, and intra-subnet packets have 16bit address for wormhole routing

11:09:32 p_l the 16bit addresses are internal affair of the subnet that software stack is not supposed to bother itself with (outside of management software for subnet itself)

11:11:54 p_l it's much harder to do wormhole switching when you have to match complex 128bit addressing to ports ;)

11:52:55 [X-Scale] ** NICK X-Scale

17:34:29 Posterdati hi

17:37:10 shrdlu68 Posterdati: hi

19:27:33 rpg I suspect that CL-DOT cannot handle subgraphs, but I'm not 100% sure. Can anyone confirm?

20:47:21 Ven ** NICK Guest27801

20:56:42 Guest27801 ** NICK Ven```