Off-campus UNL users: To download campus access dissertations, please use the following link to log into our proxy server with your NU ID and password. When you are done browsing please remember to return to this page and log out.

Non-UNL users: Please talk to your librarian about requesting this dissertation through interlibrary loan.

Formal Concept Analysis for Image Classification and Machine Learning Models for Anti-CRISPR Protein Discovery in Bioinformatics

Minal Khatri, University of Nebraska - Lincoln

Abstract

This study investigates two critical areas in bioinformatics: enhancing transparency in medical image analysis and advancing the discovery of Anti-CRISPR (Acr) proteins, which have potential in developing more precise and controlled CRISPR-Cas gene editing tools. While CNN’s are increasingly applied in critical fields like medical diagnosis, understanding their decision-making process remains a challenge. Although visualization techniques like Saliency maps offer insights into CNN’s decision-making for individual images, they do not explicitly establish a relationship between the high-level features learned by CNN’s and the class labels across dataset. To bridge this gap, Formal Concept Analysis (FCA) framework is leveraged as a image classification model, establishing a novel method for understanding the relationship between abstract features and class labels in medical imaging. The model’s performance is validated across a range from the simpler MNIST dataset to more complex histopathological image datasets like Warwick-QU and BreakHIS. Simultaneously, the study explores the genomic context of Acr genes and the 3D structure of Acr proteins for Acr discovery, which are not extensively explored in current bioinformatics tools. By leveraging genomic context, we overcome data scarcity and develop a machine learning model capable of discovering new Anti-CRISPRs. Additionally, the 3D structure analysis aids in developing machine learning classifiers to classify proteins by Acr type and CRISPR-Cas systems. Overall, this research makes significant contributions to the field of bioinformatics, by developing robust methodologies, enhancing our understanding of medical image analysis and advances Acr protein discovery.

Subject Area

Computer science|Medical imaging|Bioinformatics

Recommended Citation

Khatri, Minal, "Formal Concept Analysis for Image Classification and Machine Learning Models for Anti-CRISPR Protein Discovery in Bioinformatics" (2023). ETD collection for University of Nebraska-Lincoln. AAI30813682.
https://digitalcommons.unl.edu/dissertations/AAI30813682

Share

COinS