Using torrent to share very large files

bittorrentfile-transfer

At our institute, we will soon need to routinely share large volumes of data (multi-terabyte range).

  1. Would it make sense to use BitTorrent for this task?

  2. How large of a CPU/memory overhead is to be expected, compared to common FTP servers?

  3. Is it possible to achieve speeds matching a direct FTP transfer when copying from exactly one BitTorrent peer (the original storage server) to another?

Thank you very much.

Best Answer

  1. I would think so. Be careful about the block-size you chose as it will need to be larger than standard for such a large amount of data
  2. Not significant during the transfer, your bandwidth will be the bottleneck not your CPU. Generating the torrent meta-file (which involves hashing each block and the whole set of data) in the first place will take quite some time as will the final hash check after the transfer has completed on teh client
  3. Yes. Unless your connectivity provider, the client's provider, or somewhere between, is selectively shaping P2P traffic.

To mitigate issues regarding points 1 and 2, if you can split the data into smaller chunks and have separate torrents for each chunk you might find the size of the data easier to handle.

Also note that you will need to regenerate the torrent metafiles if any data in the file(s) they cover is updated. If small parts of the data change without the rest changing, you probably find rsync to be a much more efficient solution.

How large are the files in the dataset and what is the spread like (several multi-gig files?, many smaller ones?, ...)?