Linked by Thom Holwerda on Mon 30th Jan 2012 20:39 UTC
General Unix Finally something really interesting to talk about. If you've used UNIX or any of its derivatives, you've probably wondered why there's /bin, /sbin, /usr/bin, /usr/sbin in the file system. You may even have a rationalisation for the existence of each and every one of these directories. The thing is, though - all these rationalisations were thought up after these directories were created. As it turns out, the real reasoning is pretty damn straightforward.
Permalink for comment 505655
To read all comments associated with this story, please click here.
RE[5]: We are stuck in the past.
by jabjoe on Thu 2nd Feb 2012 13:20 UTC in reply to "RE[4]: We are stuck in the past."
Member since:

Size. Author. Title. Date. Compression rate. Encoding rate. Decoding rate. Etc. There are many other attributes to query.

As I said before, metadata. Or maybe media information from something like 'mediainfo'.

But it is put there by specialized software. Querying for metadata is not a standard feature of most filesystems, as is, let's say, the POSIX file interface.

Nor should it be. Wrong level to have it.

But it is not relational.

Not quite, but it's not really hierarchical either, not with links, it's more of a network. Still easy to argue is a database of a form.

These tools fail to return structured data, especially from non-text formats.

The case in question is meta data, the tool 'extract' sticks the meta data in a set structure out to stdout. With that, find and grep, you could write a "query" that find all jpegs that where taken with a certain camera. Of course this is a sequential search and a indexed one would be better. Plenty of programs that build custom indexs/database of meta data for this kind of purpose, but you don't want to index everything in every way, just in case.

Nope, it's the right tool for the job. The various development problems we are having today are due to the lack of databases in a large degree.

Example please. Windows WMI certainly hasn't convinced me.

These GBs that you speak of would be broken down to their individual parts, if stored in a database, and they will be indexable, and queriable, discoverable by any program

What parts exactly? Not all fileformats are like that, many are just blobs. What about a video, once you dismiss the metadata stuff, how are you going to store the stream? Binary blob block/chunks, just like a filesystem? On their own one is likely meaningless, and won't have anything in it you can search. Then why bother? Creates lots of needless big vacuuming for no gain.

they would support transactions, and they would allow programs to be notified of changes in the data store. All these capabilities are absent, more or less, from today's data storage systems.

Many filesystem do offer transactions, and even with those that don't, you can still do it. The SQLite site has some a doc on how they do it generically:

Pretty much every OS has a notification system for file or folder changes.
Linux's is inotify, or even just the "select" call.

Found this, sounds like what you want:

Reply Parent Score: 2