one thing to keep in mind: there is no distributed filesystem (HDFS in Hadoop's case). So depending on your workload, you may need to make a choice about which DFS to use along side Disco.
1) I would recommend creating a scratch account to test the import with. You can always nuke that account and retry the import process.
2) You will want to include more fields than the sample does. Definitely date. Maybe things like tags; though their tag support for reading is pretty limited at present.
Cool. Yours looks more like something with long-term developer value. Mine was just a quick tool designed to be used once by people who don't want to code.