TARO Database schema

TARO records about 100Mb of data per single query. Deployed TARO database size is 13 Gb. Database dump size 1.2 Gb. Table sizes are:

  1. 4.8Gb - pipe_display_main
  2. 4.3Gb - pipe_annotation
  3. 1.9Gb - pipe_sequence_homology
  4. 0.76Gb - pipe_sequence_statistics
  5. 0.59Gb - pipe_rps_blast_top_hit
  6. 0.34Gb - pipe_targetdb_status
  7. <0.6Gb - other tables

Suggested Improvements

Several foreing key constraints appears to be missing. The database consistency would have benefited if they are added. Unused tables may be safely dropped (pipe_astral_blast_top_hit)

TARO database scheme part 1

TARO database scheme part 2