Sql – Simple, fast SQL queries for flat files

flat-filelinuxsortingsql

Does anyone know of any tools to provide simple, fast queries of flat files using a SQL-like declarative query language? I'd rather not pay the overhead of loading the file into a DB since the input data is typically thrown out almost immediately after the query is run.

Consider the data file, "animals.txt":

dog 15
cat 20
dog 10
cat 30
dog 5
cat 40

Suppose I want to extract the highest value for each unique animal. I would like to write something like:

cat animals.txt | foo "select $1, max(convert($2 using decimal)) group by $1"

I can get nearly the same result using sort:

cat animals.txt | sort -t " " -k1,1 -k2,2nr

And I can always drop into awk from there, but this all feels a bit awkward (couldn't resist) when a SQL-like language would seem to solve the problem so cleanly.

I've considered writing a wrapper for SQLite that would automatically create a table based on the input data, and I've looked into using Hive in single-processor mode, but I can't help but feel this problem has been solved before. Am I missing something? Is this functionality already implemented by another standard tool?

Halp!

Best Answer

I wrote TxtSushi mostly to do SQL selects on flat files. Here is the command chain for your example (all of these commands are from TxtSushi):

tabtocsv animals.txt | namecolumns - | tssql -table animals - \
'select col1, max(as_int(col2)) from animals group by col1'

namecolumns is only required because animals.txt doesn't have a header row. You can get a quick sense of what is possible by looking through the example scripts. There are also links to similar tools on the bottom of the main page.

Related Solutions

Sql – Should I use != or <> for not equal in T-SQL

Most databases support != (popular programming languages) and <> (ANSI).

Databases that support both != and <>:

MySQL 5.1: != and <>
PostgreSQL 8.3: != and <>
SQLite: != and <>
Oracle 10g: != and <>
Microsoft SQL Server 2000/2005/2008/2012/2016: != and <>
IBM Informix Dynamic Server 10: != and <>
InterBase/Firebird: != and <>
Apache Derby 10.6: != and <>
Sybase Adaptive Server Enterprise 11.0: != and <>

Databases that support the ANSI standard operator, exclusively:

IBM DB2 UDB 9.5: <>
Microsoft Access 2010: <>

Linux – How to change permissions for a folder and its subfolders/files in one step

The other answers are correct, in that chmod -R 755 will set these permissions to all files and subfolders in the tree. But why on earth would you want to? It might make sense for the directories, but why set the execute bit on all the files?

I suspect what you really want to do is set the directories to 755 and either leave the files alone or set them to 644. For this, you can use the find command. For example:

To change all the directories to 755 (drwxr-xr-x):

find /opt/lampp/htdocs -type d -exec chmod 755 {} \;

To change all the files to 644 (-rw-r--r--):

find /opt/lampp/htdocs -type f -exec chmod 644 {} \;

Some splainin': ^{(thanks @tobbez)}

chmod 755 {} specifies the command that will be executed by find for each directory
chmod 644 {} specifies the command that will be executed by find for each file
{} is replaced by the path
; the semicolon tells find that this is the end of the command it's supposed to execute
\; the semicolon is escaped, otherwise it would be interpreted by the shell instead of find

Best Answer

Related Solutions

Sql – Should I use != or <> for not equal in T-SQL

Linux – How to change permissions for a folder and its subfolders/files in one step

Related Topic