Mastering BLAST tutorial
Mastering BLAST tutorial
BLAST, which stands for Basic Local Alignment Search Tool, is a powerful
bioinformatics program used for comparing an input sequence (query) against a
database of sequences. It identifies regions of local similarity, helping researchers
infer functional and evolutionary relationships between sequences and identify
members of gene families. Developed by Altschul et al. in 1990, BLAST has
become one of the most widely used tools in molecular biology and genetics.
BLAST comes in several flavors, each tailored for specific types of sequence
comparisons:
Each program serves specific research needs, from identifying homologous genes
to studying evolutionary patterns.
Before running a BLAST search, it's crucial to prepare your query sequence and
choose the appropriate database and parameters. Here are the key steps:
3. **BLAST Program**: Select the appropriate BLAST program based on the type
of your query and the database.
Most BLAST searches are conducted online through the NCBI BLAST web
interface. Here’s a step-by-step guide to running a BLAST search:
3. **Input the Query Sequence**: Paste your sequence into the query box or
upload a file in FASTA format.
4. **Select the Database**: Choose the database you want to search against.
6. **Submit the Search**: Click the “BLAST” button to initiate the search.
7. **Review Results**: Once the search is complete, results will be displayed in a
format that includes:
- **Summary Table**: Lists significant alignments with scores, E-values, and
descriptions.
- **Graphical Overview**: Visual representation of alignments along the query
sequence.
- **Alignments**: Detailed alignments showing the query and subject sequences
with matching regions highlighted.
1. **Score and Bit Score**: Indicate the quality of the alignment. Higher scores
suggest better alignments.
4. **Query Coverage**: Percentage of the query sequence that is aligned with the
subject sequence.
5. **Alignment**: Shows the actual alignment between the query and subject
sequences, highlighting matches, mismatches, and gaps.
#### Conclusion