Track topics on Twitter Track topics that are important to you
While mutation rates can vary markedly over the residues of a protein, multiple sequence alignment tools typically use the same values for their scoring-function parameters across a protein's entire length. We present a new approach, called adaptive local realignment, that in contrast automatically adapts to the diversity of mutation rates along protein sequences. This builds upon a recent technique known as parameter advising, which finds global parameter settings for an aligner, to now adaptively find local settings. Our approach in essence identifies local regions with low estimated accuracy, constructs a set of candidate realignments using a carefully-chosen collection of parameter settings, and replaces the region if a realignment has higher estimated accuracy. This new method of local parameter advising, when combined with prior methods for global advising, boosts alignment accuracy as much as 26% over the best default setting on hard-to-align protein benchmarks, and by 6.4% over global advising alone. Adaptive local realignment has been implemented within the Opal aligner using the Facet accuracy estimator.
This article was published in the following journal.
Name: Journal of computational biology : a journal of computational molecular cell biology
Machine-learning models trained on protein sequences and their measured functions can infer biological properties of unseen sequences without requiring an understanding of the underlying physical or b...
Protein thermostability engineering is a powerful tool to improve resistance of proteins against high temperatures and thereafter broaden their applications. For efficient protein thermostability engi...
Protein-protein interactions (PPIs) play crucial roles in almost all cellular processes. Although a large amount of PPIs have been verified by high-throughput techniques in the past decades, currently...
Advances in sequencing technologies led to rapid increase in the number and diversity of biological sequences, which facilitated development in the sequence research. In this paper, we present a new m...
Intrinsically disordered proteins lack stable 3-dimensional structure and play a crucial role in performing various biological functions. Key to their biological function are the molecular recognition...
The overall objective of this 3-year project is to determine whether the provision of realignment therapy in patients with medial knee osteoarthritis (OA) relieves knee pain and improves f...
It is prospective randomized comparative clinical trial comparing primary urethral realignment vs. suprapubic cystostomy in initial management of pediatric pelvic fracture urethral injury ...
Soft tissue realignment of the tibial insertion of the patella tendon is a simple operative technique for treating dislocation of the patella in childhood and adolescence. It is performed ...
To test and validate newly developed magnetic resonance (MR) pulse sequences for their ability to enhance the collection of morphological, biomedical, and functional information from the h...
This study aims to use a type of radiation (adaptive radiotherapy) to deliver curative-intent treatment to patients with non-small cell lung cancer, whose tumors would otherwise be too lar...
The excision of in-frame internal protein sequences (INTEINS) of a precursor protein, coupled with ligation of the flanking sequences (EXTEINS). Protein splicing is an autocatalytic reaction and results in the production of two proteins from a single primary translation product: the intein and the mature protein.
The systematic study of annotated genomic information to global protein expression in order to determine the relationship between genomic sequences and both expressed proteins and predicted protein sequences.
Repetitive nucleic acid sequences that are principal components of the archaeal and bacterial CRISPR-CAS SYSTEMS, which function as adaptive antiviral defense systems.
Transforming glycoprotein coded by the fms oncogene from the Susan McDonough strain of feline sarcoma virus (SM-FeSV). The oncogene protein v-fms lacks sequences, which, in the highly homologous proto-oncogene protein c-fms (CSF-1 receptor), normally serve to regulate its tyrosine kinase activity. The missing sequences in v-fms mimic the effect of ligand and lead to constitutive cell growth. The protein gp120(v-fms) is post-translationally modified to generate gp140(v-fms).
A form of GENE LIBRARY containing the complete DNA sequences present in the genome of a given organism. It contrasts with a cDNA library which contains only sequences utilized in protein coding (lacking introns).
Within medicine, nutrition (the study of food and the effect of its components on the body) has many different roles. Appropriate nutrition can help prevent certain diseases, or treat others. In critically ill patients, artificial feeding by tubes need t...