Bioinformatics Search Tools Creation. Best Practices of Russian offshore IT outsourcing companies. RUSSOFT

Supported by:

Project Title: Bioinformatics Search Tools Creation

Company:Azati Software Corporation View Company Profile
Client (Country):Biotechnology Corporation
Duration, months:Confidential
Total Effort, person/months:Confidential
Total Views:246

The Challenge

Biological scientists, engaged in searching for complementarity determining regions (CDRs), fusion/chimeric constructs, recombinant plasmid constructs and bispecific antibodies, were in need of a reliable tool, which would enable search for multiple nucleotide (or protein) queries against the nucleotide (or protein) database. The challenge was risen due to the question of the project feasibility, as the tool for searching with several query sequences at that time hadn’t been introduced before.

The Solution

Azati team designed, developed and released a general purpose Multiple Sequence Search (MSS) tool, than can accept and search on up to six query sequences.

The MSS tool can be used to find and list documents that contain CDR sequences of interest. Also, one may perform a simultaneous search on multiple CDRs contained within the same patent document, taking into consideration that the CDRs might be present in different claims. The tool uses our enhanced version of the Smith-Waterman algorithm in order to produce not only accurate and comprehensive results, but also do it in a considerably shorter time (30 to 50 times than the standard Smith-Waterman algorithm). The interface makes it easy to track and see whether there is one or more than one similar sequence that might have been claimed within a single document. An advanced scoring system was designed to assign a higher score to those documents that contain a greater number of CDRs matching the search query, which produces more relevant results. The system also provides an option to show combined alignment. We also implemented a functionality to generate the reports (of 4 different file formats) based on the search results, allowing the users to clearly identify which query sequences are being aligned to the subject ones.

The created Multiple Sequence Search algorithm fully satisfies the requirements set and brings a higher degree of sophistication to the researchers.

Tools and Technologies

● C/C++
● NVIDIA® CUDA® Toolkit

Related links

Post Your Story, Tell All About Your Success!

If you want the story of your company to appear on the portal please fill out this form and send it to We would also like you to leave contact information (name, e-mail, phone) of a person who is responsible for filling the form to clarify any questions, which could appear.

Available Success Stories From Companies:


All information contained in this Section is owned by and its Participants and is protected by Russian and international copyright laws. Any reproduction or republication of all or part of this Section has to remain intact and include a notice on the copyright of or the Participants, as applicable.

While the information of this Section has been presented with all due care, does not warrant the accuracy, completeness, usefulness and truth of Section’s information, links and logos derived from third parties. is not liable for any loss or damage occurring from the use of this Section’s materials.