RNDr. Petr Somol, Ph.D.
research fellow

Department:
Pattern Recognition Department
Research interests:
security, feature selection, pattern recognition, machine learning
Biography
Publication list
Petr Somol earned his master's degree in 1995 from the Faculty of Mathematics and Physics at Charles University in Prague. In 2000, he was awarded the academic titles RNDr. and Ph.D. by the same faculty, in cooperation with the Institute of Information Theory and Automation of the Academy of Sciences of the Czech Republic.
His research focuses on feature selection, pattern recognition, machine learning, and computer security. He has been involved in numerous Czech and European research projects and holds four U.S. patents in the field of computer security.
He has also supervised Ph.D. students from the Faculty of Electrical Engineering and the Faculty of Nuclear Sciences and Physical Engineering at the Czech Technical University in Prague.
Books and chapters
- A Statistical Review of the MNIST Benchmark Data Problem, Advances in Pattern Recognition Research, p. 172-193, Eds: Lu T., Chao T.H. Download [2018] :
- Průmysl 4.0 Výzva pro Českou republiku, Management Press (Praha, 2016) Download [2016] :
- Moderní metody výběru příznaků ve statistickém rozpoznávání, Umělá inteligence 6, p. 424-468, Eds: Mařík V., Štěpánková O., Lažanský J. Download [2013] :
- Efficient Feature Subset Selection and Subset Size Optimization, Pattern Recognition, Recent Advances, p. 75-98 Download [2010] :
- Recent feature selection methods in statistical pattern recognition, Pattern Recognition and String Matching, p. 1-51, Eds: Chen D., Cheng X., Kluwer (Dordrecht, 2003) [2003] :
Journal articles
- Efficient anomaly detection through surrogate neural networks, Neural Computing & Applications 34 23 (2022), p. 20491-20505 Download Download DOI: 10.1007/s00521-022-07506-9 [2022] :
- Improving feature selection process resistance to failures caused by curse-of-dimensionality effects, Kybernetika 47 3 (2011), p. 401-425 Download [2011] :
- Feature Selection Software to Improve Accuracy and Reduce Cost in Automated Recognition Systems, ERCIM News 2011 84 (2011), p. 54-54 Download [2011] :
- Statistical Model of the 2001 Czech Census for Interactive Presentation, Journal of Official Statistics 4 (2010), p. 1-23 Download [2010] :
- Evaluating Stability and Comparing Output of Feature Selectors that Optimize Feature Subset Cardinality, IEEE Transactions on Pattern Analysis and Machine Intelligence 32 11 (2010), p. 1921-1939 Download DOI: 10.1109/TPAMI.2010.34. [2010] :
- Interaktivní statistický model dat ze sčítání lidu v České republice v r. 2001, Statistika: Statistics and Economy Journal 89 4 (2009), p. 285-299 Download [2009] :
- Computer-Aided Evaluation of Screening Mammograms Based on Local Texture Models, IEEE Transactions on Image Processing 18 4 (2009), p. 765-773 DOI: 10.1109/TIP.2008.2011168 [2009] :
- Evaluating the Stability of Feature Selectors that Optimize Feature Subset Cardinality, Lecture Notes in Computer Science 2008 5342 (2008), p. 956-966 Download DOI: 10.1007/978-3-540-89689-0 [2008] :
- Identifying the most informative variables for decision-making problems – a survey of recent approaches and accompanying problems, Acta Oeconomica Pragensia 16 4 (2008), p. 37-55 Download [2008] :
- Notes on the evolution of feature selection methodology, Kybernetika 43 5 (2007), p. 713-730 [2007] :
- Conditional Mutual Information Based Feature Selection for Classification Task, Lecture Notes in Computer Science 45 4756 (2007), p. 417-426 [2007] :
- Color Texture Segmentation by Decomposition of Gaussian Mixture Model, Lecture Notes in Computer Science 19 4225 (2006), p. 287-296 Download [2006] :
- Feature Selection Based on Mutual Correlation, Lecture Notes in Computer Science 19 4225 (2006), p. 569-577 Download [2006] :
- Oscillating feature subset search algorithm for text categorization, Lecture Notes in Computer Science 44 4225 (2006), p. 578-587 [2006] :
- Flexible-hybrid sequential floating search in statistical feature selection, Lecture Notes in Computer Science 44 4109 (2006), p. 623-639 Download [2006] :
- Filter- versus wrapper-based feature selection for credit scoring, International Journal of Intelligent Systems 20 10 (2005), p. 985-999 Download [2005] :
- Probabilistic neural network playing and learning Tic-Tac-Toe, Pattern Recognition Letters 26 12 (2005), p. 1866-1873 [2005] :
- Fast Branch & Bound algorithms for optimal feature selection, IEEE Transactions on Pattern Analysis and Machine Intelligence 26 7 (2004), p. 900-912 [2004] :
- Feature selection toolbox, Pattern Recognition 35 12 (2002), p. 2749-2759 Download [2002] :
- Multiple classifier fusion in probabilistic neural networks, Pattern Analysis and Applications 5 7 (2002), p. 221-233 [2002] :
- Feature selection toolbox software package, Pattern Recognition Letters 23 4 (2002), p. 487-492 [2002] :
- Branch & Bound algoritmus s částečným řazením uzlů výpočetního stromu, Acta Oeconomica Pragensia 8 2 (2000), p. 33-40 [2000] :
- Oscilační algoritmy pro vyhledávání příznaků, Acta Oeconomica Pragensia 8 2 (2000), p. 25-32 [2000] :
- Výběr nejinformativnějších faktorů při akvizici podniků pomocí metod rozpoznávání obrazů, Acta Oeconomica Pragensia 8 2 (2000), p. 143-159 [2000] :
- Znalostní přístup k výběru nejinformativnějších proměnných pro rozhodovací problémy klasifikačního typu, Acta Oeconomica Pragensia 8 2 (2000), p. 11-24 [2000] :
- Probabilistic information retrieval from census data based on distribution mixtures, Acta Oeconomica Pragensia 8 2 (2000), p. 41-47 [2000] :
- Road sing classification using Laplace kernel classifier, Pattern Recognition Letters 21, p. 1165-1173 [2000] :
- Adaptive floating search methods in feature selection, Pattern Recognition Letters 20, p. 1157-1163 Download [1999] :
- Conceptual base of feature selection consulting system, Kybernetika 34 4 (1998), p. 451-460 [1998] :
Other publications
- Multiple Instance Learning with Bag-Level Randomized Trees, Machine Learning and Knowledge Discovery in Databases, p. 259-272, Eds: Berlingerio M., Bonchi F., Gärtner T., Hurley N., Ifrim G. Download DOI: 10.1007/978-3-030-10925-7_16 [2019] :
- Density-Approximating Neural Network Models for Anomaly Detection, ACM SIGKDD 2018 Workshop, p. 1-8 Download [2018] :
- End-node Fingerprinting for Malware Detection on HTTPS Data, Proceedings of the 12th International Conference on Availability, Reliability and Security (ARES'17), p. 1-7 Download DOI: 10.1145/3098954.3107007 [2017] :
- Discriminative models for multi-instance problems with tree-structure, Proceedings of the 2016 ACM Workshop on Artificial Intelligence and Security (AISec'16), p. 83-91 Download DOI: 10.1145/2996758.2996761 [2016] :
- Discriminative Models for Multi-instance Problems with Tree Structure, Proceedings of the 9th ACM Workshop on Artificial Intelligence and Security 2016 Download DOI: 10.1145/2996758.2996761 [2016] :
- Finding New Malicious Domains Using Variational Bayes on Large-Scale Computer Network Data, NIPS Workshop: Advances in Approximate Bayesian Inference, p. 1-10 Download [2015] :
- Materials Classification using Sparse Gray-Scale Bidirectional Reflectance Measurements, Computer Analysis of Images and Patterns - CAIP 2015, p. 289-299, Eds: Azzopardi George, Petkov Nicolai Download DOI: 10.1007/978-3-319-23117-4_25 [2015] :
- On Stopping Rules in Dependency-Aware Feature Ranking, Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, p. 286-293 Download DOI: 10.1007/978-3-642-41822-8_36 [2013] :
- Predikce hospitalizační mortality u akutního infarktu myokardu, Sborník příspěvků MEDSODFT 2011, p. 128-138 Download [2011] :
- Fast Dependency-Aware Feature Selection in Very-High-Dimensional Pattern Recognition, Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics (IEEE SMC 2011), p. 502-509 Download DOI: 10.1109/ICSMC.2011.6083733 [2011] :
- Fast Dependency-Aware Feature Selection in Very-High-Dimensional Pattern Recognition Problems, ÚTIA AV ČR, v.v.i (Praha, 2011) Download [2011] :
- Introduction to Feature Selection Toolbox 3 – The C++ Library for Subset Search, Data Modeling and Classification, ÚTIA (Praha, 2010) Download [2010] :
- Sequential Retreating Search Methods in Feature Selection, ÚTIA (Praha, 2010) [2010] :
- Feature Selection - A Very Compact Survey Over the Diversity of Existing Approaches, ÚTIA AV ČR, v.v.i (Praha, 2010) [2010] :
- The Problem of Fragile Feature Subset Preference in Feature Selection Methods and A Proposal of Algorithmic Workaround, Proc. 2010 Int. Conf. on Pattern Recognition, p. 4396-4399 Download [2010] :
- Digital Image Forgery Detection by Local Statistical Models, 2010 Sixth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, p. 579-582, Eds: Echizen Isao, Pan Jeng-Shyang, Fellner Dieter, Nouak Alexander, Kuijper Arjan, Jain Lakhmi C. Download DOI: 10.1109/IIHMSP.2010.147 [2010] :
- Improving Sequential Feature Selection Methods Performance by Means of Hybridization, Proc. 6th IASTED Int. Conf. on Advances in Computer Science and Engineering Download [2010] :
- On the Over-Fitting Problem of Complex Feature Selection Methods, Proc. 5th International Computer Engineering Conference - A better Information Society Through the e@, p. 12-17 Download [2009] :
- A New Measure of Feature Selection Algorithms’ Stability, ICDMW '09: Proceedings of the 2009 IEEE International Conference on Data Mining Workshops, p. 382-387 [2009] :
- Criteria Ensembles in Feature Selection, Multiple Classifier Systems, LNCS 5519, p. 304-313, Eds: Benediktsson J.A., Kittler J., Roli F. [2009] :
- Evaluating Stability of Single and Multiple Feature Selectors that Optimize Feature Subset Cardinality, ÚTIA AV ČR (Praha, 2009) [2009] :
- Structural Poisson Mixtures for Classification of Documents, Proceedings of the 19th International Conference on Pattern Recognition, p. 1324-1327 Download [2008] :
- Dynamic Oscillating Search Algorithm for Feature Selection, ICPR 2008 Proceedings (Int. Conf. on Pattern Recognition), p. 2308-2311 Download [2008] :
- Texture Oriented Image Inpainting based on Local Statistical Model, Proc. 10th IASTED Conf. on Signal & Image Processing, SIP 2008, p. 15-20 Download [2008] :
- Facial expression recognition using angle-related information from facial meshes, Proc. 16th European Signal Processing Conference (EUSIPCO- 2008), p. 1-5 Download [2008] :
- Diagnostic Enhancement of Screening Mammograms by Means of Local Texture Models, ÚTIA AV ČR (Praha, 2008) [2008] :
- Are Better Feature Selection Methods Actually Better? Discussion, Reasoning and Examples, Proceedings of the International Joint Conference on Biomedical Engineering Systems and Technologies - BIOSTEC 2008, p. 246-253 Download [2008] :
- Does It Make Sense to Develop New Feature Selection Methods?, ÚTIA AV ČR (Praha, 2007) [2007] :
- Methodology of selecting the most informative variables for decision-making problems of classification type, Proceedings 6th Int. Conf. on Information and Management Sciences, p. 1-18, Eds: Lee Tien Sheng, Liu Yankui, Zhao Xiande Download [2007] :
- Selection of Most Informative Variables in Statistical Pattern Recognition, UWB (Plzeň, 2007) [2007] :
- Model-Based Visual Inspection, UWB (Plzeň, 2007) [2007] :
- Statistical Analysis of Medical Images and Its Possible Impact on Medical Practice, International Conference Efficiency, Quality and Consumer Satisfaction in Healthcare and Welfare, p. 1-1 [2007] :
- Introduction to Statistical Pattern Recognition, MATEO - The European Network of Mechatronics Centres and Industrial Controllers 2006, p. 163-170 [2006] :
- Pattern Recognition Based on Multidimensional Models, MATEO - The European Network of Mechatronics Centres and Industrial Controllers 2006, p. 80-86 [2006] :
- Advances in Feature Selection Methodology: an Overview of Recent ÚTIA Results, ÚTIA AV ČR (Praha, 2006) [2006] :
- A subspace approach to texture modelling by using Gaussian mixtures, Proceedings of the 18th Conference on Pattern Recognition. ICPR 2006, p. 235-238, Eds: Haralic B., Ho T. K. [2006] :
- Multi-Subset Selection for Keyword Extraction and Other Prototype Search Tasks Using Feature Selection Algorithms, Proceedings of ICPR 2006 - The 18th International Conference on Pattern Recognition, p. 736-739, Eds: Tang Y.Y., Wang S.P., Yeung D.S., Yan H., Lorette G. Download [2006] :
- Texture Synthesis on Surfaces, ÚTIA AV ČR (Praha, 2005) [2005] :
- Texturing Library - Reference Manual, ÚTIA AV ČR (Praha, 2005) [2005] :
- Novel Path Search Algorithm for Image Stitching and Advanced Texture Tiling, ÚTIA AV ČR (Praha, 2005) [2005] :
- Current feature selection techniques in pattern recognition, Proceedings of the 4th International Conference on Computer Recognition Systems, p. 53-68, Eds: Kurzynski M., Puchala E., Wozniak M., Springer (Heidelberg, 2005) [2005] :
- Novel path search algorithm for image stitching and advanced texture tiling, WSCG'2005. 13th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision 2005. Proceedings, p. 155-162, Západočeská univerzita (Plzeň, 2005) Download [2005] :
- An overview of feature selection techniques in statistical pattern recognition, Proceedings of the Sixtheenth Annual Symposium of the Pattern Recognition Association of South Africa, p. 1-14 [2005] :
- A statistical approach to local evalution of a single texture image, Proceedings of the Sixtheenth Annual Symposium of the Pattern Recognition Association of South Africa, p. 171-176 [2005] :
- Texture Tiling and Patching with a Novel Path Search Algorithm for Quick and Realistic Texture Synthesis, ÚTIA AV ČR (Praha, 2004) [2004] :
- RealReflect Library - Reference Manual, ÚTIA AV ČR (Praha, 2004) [2004] :
- Advances in BTF Modelling, ÚTIA AV ČR (Praha, 2004) [2004] :
- Specification and Prototype Description of Texture Mapping and Synthesis, ÚTIA AV ČR (Praha, 2004) [2004] :
- Information analysis of census data by using statistical models, Proceedings of the International Conference on Statistics - Investment in the Future, p. 1-7 [2004] :
- On prediction mechanisms in Fast Branch & Bound algorithms, Structural, Syntactic, and Statistical Pattern Recognition. Joint IAPR International Workshops SSPR 2004 and SPR 2004. Proceedings, p. 716-724 Download [2004] :
- A Gaussian mixture-based colour texture model, Proceedings of the 17th IAPR International Conference on Pattern Recognition, p. 177-180 [2004] :
- BTF Parametric Database, ÚTIA AV ČR (Praha, 2003) [2003] :
- Probabilistic neural network playing a simple game, Artificial Neural Networks in Pattern Recognition. Proceedings, p. 132-138, Eds: Marinai S., Gori M., University of Florence (Florence, 2003) [2003] :
- Specification and Prototype Description of BTF Database and Model, ÚTIA AV ČR (Praha, 2003) [2003] :
- Specification and Prototype Description of Texture Model, ÚTIA AV ČR (Praha, 2002) [2002] :
- Boosting in probabilistic neural networks, Proceedings of the 16th International Conference on Pattern Recognition, p. 136-139, Eds: Kasturi R., Laurendeau D., Suen C., IEEE Computer Society (Los Alamitos, 2002) Download [2002] :
- Branch & Bound algorithm with partial prediction for use with recursive and non-recursive criterion forms, Advances in Pattern Recognition - ICAPR 2001. Proceedings, p. 425-434, Eds: Singh S., Murshed N., Kropatsch W., Springer (Heidelberg, 2001) Download [2001] :
- Recent advances in methodology of feature selection for statistical pattern recognition, Computer Data Analysis and Modeling. Proceedings of the Sixth International Conference, p. 163-171, Eds: Aivazian S., Kharin Y., Rieder H., Belarussian State University (Minsk, 2001) [2001] :
- Feature selection toolbox as a multi-purpose tool in pattern recognition, Pattern Recognition in Information Systems. Proceedings of the 1st International Workshop on Pattern Recognition in Information Systems, p. 91-102, Eds: Fred A., Jain A. K., ICEIS Press (Setúbal, 2001) Download [2001] :
- Advances in statistical feature selection, Advances in Pattern Recognition - ICAPR 2001. Proceedings, p. 425-434, Eds: Singh S., Murshed N., Kropatsch W., Springer (Heidelberg, 2001) [2001] :
- Information analysis of multiple classifier fusion, Multiple Classifier Systems, p. 168-177, Eds: Kittler J., Roli F., Springer (Berlin, 2001) [2001] :
- Algoritmy a programová realizace řešení problémů redukce vysoké dimenzionality vstupních dat ve statistickém rozpoznávání obrazů. Doktorská disertační práce (2000) [2000] :
- Comparison of classifier-specific feature selection algorithms, Advances in Pattern Recognition, p. 677-686, Eds: Ferri J., Inesta M. J., Amin A., Pudil P., Springer (Berlin, 2000) [2000] :
- Fast Branch & Bound algorithm in feature selection, Proceedings of SCI 2000. The 4th World Multiconference on Systemics, Cybernetics and Informatics, p. 646-651, Eds: Sanchez B., Pineda M. J., Wolfmann J., IIIS (Orlando, 2000) Download [2000] :
- Oscillating search algorithms for feature selection, Proceedings of the 15th International Conference on Pattern Recognition, p. 406-409, Eds: Sanfeliu A., Villanueva J. J., Vanrell M., IEEE Computer Society (Los Alamitos, 2000) Download [2000] :
- Conventional and evolutionary feature selection of SAR data using a filter approach, Proceedings of SCI 2000. The 4th World Multiconference on Systemics, Cybernetics and Informatics, p. 427-433, IIIS (Orlando, 2000) [2000] :
- Improving statistical measures of feature subsets by conventional and evolutionary approaches, Advances in Pattern Recognition. Proceedings, p. 77-86, Eds: Ferri J. F., Inesta M. J., Amin A., Pudil P., Springer (Berlin, 2000) [2000] :
- Multivariate structural Bernoulli mixtures for recognition of handwritten numerals, Proceedings of the 15th International Conference on Pattern Recognition, p. 585-589, Eds: Sanfeliu A., Villanueva J. J., Vanrell M., IEEE Computer Society (Los Alamitos, 2000) [2000] :
- Combining multiple classifiers in probabilistic neural networks, Multiple Classifier Systems, p. 157-166, Eds: Kittler J., Roli F., Springer (Berlin, 2000) [2000] :
- Recognition of handwritten numerals by structural probabilistic neural networks, Proceedings of the Second ICSC Symposium on Neural Computation, p. 528-534, Eds: Bothe H., Rojas R., ICSC (Wetaskiwin, 2000) Download [2000] :
- Road sign classification using the Laplace kernel classifier, Proceedings of the 11th Scandinavian Conference on Image Analysis, p. 275-282, Eds: Bjarne K. E., Peter J., Pattern Recognition Society Denmark (Lyngby, 1999) [1999] :
- Knowledge based feature selection in statistical pattern recognition, International Symposium on Pattern Recognition. "In Memoriam Pierre Devijver", p. 129-134, Royal Military Academy (Brussels, 1999) [1999] :
- Feature selection expert - user oriented approach. Methodology and concept of the system, Advances in Pattern Recognition. Proceedings, p. 573-582, Eds: Amin A., Dori D., Pudil P., Freeman H., Springer (Berlin, 1998) [1998] :
- Initializing normal mixtures of densities, Proceedings of the 14th International Conference on Pattern Recognition, p. 886-890, Eds: Jain A. K., Venkatesh S., Lovell B. C., IEEE (Los Alamitos, 1998) [1998] :
- Vize a koncepce výstupu z projektu "Multidisciplinární přístupy k podpoře rozhodování v ekonomii a managementu", Multidisciplinární přístupy k podpoře rozhodování v ekonomii a managementu. Workshop '97 - Grant VS 96063, p. 41-49, FM JU (Jindřichův Hradec, 1997) [1997] :
- Different approaches to initialization of the EM algorithm for use in Gaussian mixture modelling methods, Multidisciplinární přístupy k podpoře rozhodování v ekonomii a managementu. Workshop '97 grantu VS 96063, p. 85-91, Fakulta managementu JU (Jindřichův Hradec, 1997) [1997] :
- User oriented approach to feature selection in statistical pattern recognition, Multidisciplinární přístupy k podpoře rozhodování v ekonomii a managementu. Workshop '97 grantu VS 96063, p. 15-26, Fakulta managementu JU (Jindřichův Hradec, 1997) [1997] :
- Conceptual base of feature selection consulting system, Proceedings of the 1st IAPR TC1 Workshop on Statistical Techniques in Pattern Recognition, p. 125-134, Eds: Pudil P., Novovičová J., Grim J., ÚTIA AV ČR (Praha, 1997) [1997] :
- Feature selection in statistical pattern recognition via modified model with latent structure, Computer-Intensive Methods in Control and Signal Processing. Preprints of the 2nd European IEEE Workshop CMP'96, p. 217-222, Eds: Berec L., Rojíček J., Kárný M., Warwick K., ÚTIA AV ČR (Praha, 1996) [1996] :