Skip to content

Alignments Tutorial#29

Draft
danielle-pinto wants to merge 2 commits intomainfrom
2026-03-08-alignment
Draft

Alignments Tutorial#29
danielle-pinto wants to merge 2 commits intomainfrom
2026-03-08-alignment

Conversation

@danielle-pinto
Copy link
Collaborator

Explaining how to use BioAlignments.jl and providing some examples!

@danielle-pinto danielle-pinto requested a review from kescobo March 9, 2026 01:48
@danielle-pinto
Copy link
Collaborator Author

As I've been diving further into this package, I am realizing that this isn't exactly what I thought it was. Bioalignments.jl is for aligning two sequences in string format. Do we have any julia packages for aligning fastx files together? We could convert the fastx files to strings and then use Bioalignments, but I wasn't sure if the package was optimized for working with such big strings. What have you done in the past? Do you usually just use a command line tool in a bash script?

@github-actions
Copy link

github-actions bot commented Mar 9, 2026

@github-actions
Copy link

@danielle-pinto
Copy link
Collaborator Author

@kescobo I have finished the rough draft of my overview of the BioAlignments package. However, before wrapping up the tutorial, I would love some insight on the above comment/the best example to display (whether that's an alignment between some strings, or actual fastx files).

In a pairwise alignment, there is one reference sequence, and one query sequence,
Pairwise alignment differs from multiple sequence alignment (MSA) because
it only aligns two sequences, while MSAs align three or more.
In a pairwise alignment, there is one reference sequence and one query sequence,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While true - this is really an implementation detail. Most of the time, there's no difference if one thing is used as reference vs the other. The same is not true for MSA

- GlobalAlignment: global-to-global alignment
- `GlobalAlignment`: global-to-global alignment
- Aligns sequences end-to-end
- Best for sequences that are already very similar
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Might be nice to give examples here - an example here might be you have a particular gene from two closely related bacteria, or you're comparing alleles of a gene between two individuals.

It imposes an affine gap penalty for insertions and deletions,
which means that it penalizes the opening of a gap more than a gap extending.
This aligns (pun intended!!) with the biological principle that creating a gap is a rare event,
while extending an already existing gap is less so.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would word this difficulty - "creating a gap" and "extending" it, are analysis terms, not biological ones. Maybe something like "Deletions are rare mutations, but if there's a deletion, the length of the deletion is variable. Longer deletions are less likely than short ones only because they change the structure of the encoded protein more"

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants