Deep Learning for Proteomics Data for Feature Selection and Classification

Sahar Iravani; Tim Conrad

doi:10.1007/978-3-030-29726-8_19

Conference Papers Year : 2019

Deep Learning for Proteomics Data for Feature Selection and Classification

(1) , (1, 2)

1
2

Sahar Iravani

Function : Author
PersonId : 1067028

Zuse Institute Berlin

Tim Conrad

Function : Author
PersonId : 1067029

Zuse Institute Berlin

Free University of Berlin

Abstract

Todays high-throughput molecular profiling technologies allow to routinely create large datasets providing detailed information about a given biological sample, e.g. about the concentrations of thousands contained proteins. A standard task in the context of precision medicine is to identify a set of biomarkers (e.g. proteins) from these datasets that can be used for disease diagnosis, prognosis or to monitor treatment response. However, finding good biomarker sets is still a challenging task due to the high dimensionality and complexity of the data and the often quite high noise level.In this work, we present an approach to this problem based on Deep Neural Networks (DNN) and a transfer learning strategy using simulation data. To allow interpretation of the results, we compare different approaches to analyze the learned DNN. Based on these interpretation approaches, we describe how to extract biomarker sets.Comparison of our method to a state-of-the-art L1-SVM approach shows that the new approach is able to find better biomarker sets for classification when small sets are desired. Compared to a state-of-the-art $$\ell _1$$-support vector machine ($$\ell _1$$-SVM) approach, our method achieves better results for the classification task when a small number of features are needed.

Keywords

Domains

Computer Science [cs]

Fichier principal

485369_1_En_19_Chapter.pdf (777)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-02520063

Submitted on : Thursday, March 26, 2020-1:52:32 PM

Last modification on : Friday, June 30, 2023-10:24:04 AM

Long-term archiving on : Saturday, June 27, 2020-2:41:55 PM

Dates and versions

hal-02520063 , version 1 (26-03-2020)

Licence

Attribution

Identifiers

HAL Id : hal-02520063 , version 1
DOI : 10.1007/978-3-030-29726-8_19

Cite

Sahar Iravani, Tim Conrad. Deep Learning for Proteomics Data for Feature Selection and Classification. 3rd International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2019, Canterbury, United Kingdom. pp.301-316, ⟨10.1007/978-3-030-29726-8_19⟩. ⟨hal-02520063⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-TC IFIP-TC5 IFIP-WG IFIP-TC12 IFIP-WG8-4 IFIP-WG8-9 IFIP-CD-MAKE IFIP-WG12-9 IFIP-LNCS-11713

94 View

122 Download

Deep Learning for Proteomics Data for Feature Selection and Classification

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share