When making annotation transfers using RATT, you will get a report about the general transfer stats, e.g., number of gene models being correctly transferred, and those not transferred.
But you might want to know about more details. One simple way is to align the sequences before and after transfer using pairwise sequence alignmnet. The Associative Array in Bash can simply perform this job with just the following files:
- a list containing id pairs (e.g., id-pairs.txt)
- single aa/nt sequence files for each gene
And the bash script could look like this:
Once you have finished the blast job, you need to parse the results and get top hits. I used the Perl script ncbiblastparser.pl, which works quite well.