Additional content

Additional content#

This content is an additional resource for interested students, who want to learn about a wider variety of topics and commands used by scientists working on the command line on a daily basis.

Annotating other features#

Other than genes, there are techniques and software for annotating an array of other features. Many of them use similar methods as we have discussed above - build a statistical model of a feature based on example sequences and use that model to find similar features. This is typical for promoters, binding sites and other sequence motifs.

There are various different annotation tasks you might want to perform on a novel sequence, for example:

  • Gene prediction

  • Searching for a gene with a particular function

  • Promoter and protein binding site prediction

  • The prediction of various non-messenger RNAs

  • The prediction of CRISPR arrays

  • Finding signal peptide features in coding sequences