R – Large/complex dataset for ORM validation

data modelingdatabaseorm

I'm looking for freely available sample data to do some ORM validation and performance testing on. My main requirements are:

  • Discriminated types: it needs to include some kind of inheritance eg Party -> Person and Organisation;
  • Reasonably complex data model;
  • Some reasonably large tables (in the hundreds of thousands of rows at least);
  • Multi-platform: at a minimum Oracle, MySQL and SQL Server;
  • Readily understandable. For example a CRM application is what I call readily understandable. People understand clients, customers and so forth.

Can anyone point me to anything like that?

Best Answer

The SQL Server AdventureWorks database fits most of your criteria. It has around 69 tables and a fair amount of data. It comes scripted, so with a bit of editing it could be made to work with Oracle and MySQL. There are business scenarios and use cases.

Not sure it has discriminated types though.

Related Topic