Tomas Kerepecky

2019-2025
Department of Mathematics, Czech Technical University in Prague, Czechia

PhD degree in Mathematical Engineering. PhD thesis on Inverse Problems in Image Restoration.
2021-2022
Washington University in St. Louis, Missouri, USA

Fulbright-Masaryk Award
2018-now
TCM International Institute, Vienna, Austria.

MA student in Practical Theology concentrated on Leadership studies.
2015-2019
Department of Physical Electronics, Czech Technical University in Prague, Czechia

M.Sc. degree in Computational physics. Master thesis on Inverse Compton scattering by laser-accelerated electrons.

Multidisciplinary leader at the intersection of technology, AI, education, and theology, researching to innovate and inspire.

Implicit Neural Representation for Image Demosaicking

2025 Digital Signal Processing: A Review Journal

We propose a novel approach to enhance image demosaicking algorithms using implicit neural representations (INR). Our method employs a multi-layer perceptron to encode RGB images, combining original Bayer measurements with an initial estimate from existing demosaicking methods to achieve superior reconstructions. A key innovation is the integration of two loss functions: a Bayer loss for fidelity to sensor data and a complementary loss that regularizes reconstruction using interpolated data from the initial estimate. This combination, along with INR's inherent ability to capture fine details, enables high-fidelity reconstructions that incorporate information from both sources. Furthermore, we demonstrate that INR can effectively correct artifacts in state-of-the-art demosaicking methods when input data diverge from the training distribution, such as in cases of noise or blur. This adaptability highlights the transformative potential of INR-based demosaicking, offering a robust solution to this challenging problem.

Read the article

Automated Actor Recognition in Video Content

2024 Chapter in Book: Data Science in Applications: Towards AI-driven Approaches.

This chapter presents an AI pipeline designed for automated recognition and analysis of actors in video content. The pipeline incorporates advanced methodologies in computer vision, allowing for a comprehensive analysis of actor presence and screen time across various video formats, such as movies, television shows, and surveillance footage.\newline\indent To evaluate the pipeline performance, we conducted extensive experiments using a carefully annotated test videos from a Czech TV show available for download. The evaluation criteria focus on precision, recall, mean absolute error metrics for actor recognition and screen time calculation under varying conditions. Additionally, we discuss challenges encountered during the pipeline development and consider its potential implications for the future of AI-driven content analysis and security surveillance.

Paper accepted, link soon

Inverse Problems in Image Restoration

2024 Chapter in Book: Extended Abstracts IPMS 2024 conference - "Inverse Problems: Modeling and Simulation"

This work addresses inverse problems in image restoration, focusing on recovering high-quality images from degraded observations, a critical task in fields like microscopy and digital photography. We examine both traditional variational methods and modern deep learning techniques, highlighting hybrid approaches that merge mathematical modeling with data-driven learning. Classical model-based methods use explicit regularization, like total variation, to incorporate prior knowledge and stabilize the inversion process. Meanwhile, deep learning approaches, both supervised and self-supervised, leverage implicit regularization, where network architectures capture and learn prior information from data. We present our recent advancements in this field and discuss the effectiveness of these complementary approaches in solving complex image restoration problems in theory and practice.

Paper accepted, link soon

STAR: Screen Time and Actor Recognition in Video Content

2024 The German Conference on Pattern Recognition (GCPR)

Accurately measuring the duration of actors' presence in videos is a challenging task that goes beyond actor recognition. We propose the STAR pipeline, the new model designed to analyze the time performers appear on screen across diverse video content, including movies and TV shows. The proposed model has been successfully deployed and tested by the Czech TV infrastructure provider. Our pipeline uses machine learning techniques for shot detection, face detection, tracking, recognition, and introduces a novel shot-based method for calculating screen time. We present extensive experiments proving the robustness and real-time performance of our approach. Alongside the pipeline, we introduce the STAR dataset to address the need for high-quality benchmarks in evaluating screen time models, now available for download.

Paper accepted, link soon

3D Non-separable Moment Invariants and Their Use in Neural Networks

2024 SN Computer Science Journal

Recognition of 3D objects is an important task in many bio-medical and industrial applications. The recognition algorithms should work regardless of a particular orientation of the object in the space. In this paper, we introduce new 3D rotation moment invariants, which are composed of non-separable Appell moments. We show that non-separable moments may outperform the separable ones in terms of recognition power and robustness thanks to a better distribution of their zero surfaces over the image space. We test the numerical properties and discrimination power of the proposed invariants on three real datasets-MRI images of human brain, 3D scans of statues, and confocal microscope images of worms. We show the robustness to resampling errors improved more than twice and the recognition rate increased by 2-10 % comparing to most common descriptors. In the last section, we show how these invariants can be used in state-of-the-art neural networks for image recognition. The proposed H-NeXtA architecture improved the recognition rate by 2-5 % over the current networks.

Read the article

NeRD: Neural field-based Demosaicking

2023 IEEE International Conference on Image Processing (ICIP)

We introduce NeRD, a new demosaicking method for generating full-color images from Bayer patterns. Our approach leverages advancements in neural fields to perform demosaicking by representing an image as a coordinate-based neural network with sine activation functions. The inputs to the network are spatial coordinates and a low-resolution Bayer pattern, while the outputs are the corresponding RGB values. An encoder network, which is a blend of ResNet and U-net, enhances the implicit neural representation of the image to improve its quality and ensure spatial consistency through prior learning. Our experimental results demonstrate that NeRD outperforms traditional and state-of-the-art CNN-based methods and significantly closes the gap to transformer-based methods.

Read the article

Real-Time Wheel Detection and Rim Classification in Automotive Production

2023 IEEE International Conference on Image Processing (ICIP)

This paper proposes a novel approach to real-time automatic rim detection, classification, and inspection by combining traditional computer vision and deep learning techniques. At the end of every automotive assembly line, a quality control process is carried out to identify any potential defects in the produced cars. Common yet hazardous defects are related, for example, to incorrectly mounted rims. Routine inspections are mostly conducted by human workers that are negatively affected by factors such as fatigue or distraction. We have designed a new prototype to validate whether all four wheels on a single car match in size and type. Additionally, we present three comprehensive open-source databases, CWD1500, WHEEL22, and RB600, for wheel, rim, and bolt detection, as well as rim classification, which are free-to-use for scientific purposes.

Read the article

Dual-Cycle: Self-Supervised Dual-View Fluorescence Microscopy Image Reconstruction using CycleGAN

2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Three-dimensional fluorescence microscopy often suffers from anisotropy, where the resolution along the axial direction is lower than that within the lateral imaging plane. We address this issue by presenting Dual-Cycle, a new framework for joint deconvolution and fusion of dual-view fluorescence images. Inspired by the recent Neuroclear method, Dual-Cycle is designed as a cycle-consistent generative network trained in a self-supervised fashion by combining a dual-view generator and prior-guided degradation model. We validate Dual-Cycle on both synthetic and real data showing its state-of-the-art performance without any external training data.

Read the article

D3Net: Joint Demosaicking, Deblurring and Deringing

2020 IEEE International Conference on Pattern Recognition (ICPR)

Images acquired with standard digital cameras have Bayer patterns and suffer from lens blur. A demosaicking step is implemented in every digital camera, yet blur often remains unattended due to computational cost and instability of deblurring algorithms. Linear methods, which are computationally less demanding, produce ringing artifacts in deblurred images. Complex non-linear deblurring methods avoid artifacts, however their complexity imply offline application after camera demosaicking, which leads to sub-optimal performance. In this work, we propose a joint demosaicking deblurring and deringing network with a light-weight architecture inspired by the alternating direction method of multipliers. The proposed network has a transparent and clear interpretation compared to other black-box data driven approaches. We experimentally validate its superiority over state-of-the-art demosaicking methods with offline deblurring.

Read the article

Iterative Wiener Filtering for Deconvolution with Ringing Artifact Suppression

2019 27th European Signal Processing Conference (EUSIPCO)

Sensor and lens blur degrade images acquired by digital cameras. Simple and fast removal of blur using linear filtering, such as Wiener filter, produces results that are not acceptable in most of the cases due to ringing artifacts close to image borders and around edges in the image. More elaborate deconvolution methods with non-smooth regularization, such as total variation, provide superior performance with less artifacts, however at a price of increased computational cost. We consider the alternating directions method of multipliers, which is a popular choice to solve such non-smooth convex problems, and show that individual steps of the method can be decomposed to simple filtering and element-wise operations. Filtering is performed with two sets of filters, called restoration and update filters, which are learned for the given type of blur and noise level with two different learning methods. The proposed deconvolution algorithm is implemented in the spatial domain and can be easily extended to include other restoration tasks such as demosaicing and super-resolution. Experiments demonstrate performance of the algorithm with respect to the size of learned filters, number of iterations, noise level and type of blur.

Read the article

Inverse Compton scattering by laser-accelerated electrons

Czech Technical University in Prague. Computing and Information Centre, 2017

This thesis deals with the study of X- and -radiation during the interaction of relativistic electrons with the intense electromagnetic field. This mechanism is called inverse Compton scattering. For the purpose of examining the properties of radiation from inverse Compton scattering, a new COCO code has been implemented. Radiation spectrum is computed through the use of fast Fourier transform of the radiation field of electrons.

Read the thesis

Tutorial for Machine Learning 2, Czech Technical University in Prague. (2021, 2022, 2023, 2024)

The course covers advanced machine learning methods with a primary focus on deep learning. Apart from the theory of deep learning and optimization of deep neural networks, we present various network architectures and applications.
Lecturers: Doc. Ing. Filip Sroubek, Ph.D., DSc., Prof. Ing. Jan Flusser, DrSc.
Tutorial for Optimization, Washington University in St. Louis, MO, USA. (2022)

Topics include unconstrained and constrained optimization, convex optimization, iterative optimization algorithms, optimality conditions, and duality theory. Algorithms covered include the gradient and accelerated gradient methods, the Newton method, proximal methods, and penalty methods.
Lecturers: Prof. Ulugbek S. Kamilov
Tutorial for Image Processing and Pattern Recognition 1, Czech Technical University in Prague. (2019, 2020)

An introductory course on image processing and pattern recognition. Major attention is paid to image sampling and quantization, image preprocessing (noise removal, contrast stretching, sharpening, and de-blurring, Wiener filtering, blind deconvolution), edge detection, morphology and geometric transformations and warping. Numerous applications and experimental results are presented in addition to the theory.
Lecturers: Prof. Ing. Jan Flusser, DrSc., doc. RNDr. Zitova Barbara Ph.D.
Tutorial for Image Processing and Pattern Recognition 2, Czech Technical University in Prague. (2019)

The course is a continuation of Image Processing and Pattern Recognition 1. Major attention is paid to features for shape description and recognition, and to general pattern recognition techniques. Numerous applications and experimental results are presented in addition to the theory.
Lecturers: Prof. Ing. Jan Flusser, DrSc., doc. RNDr. Zitova Barbara Ph.D.
Tutorial for Numerical methods 1, Czech Technical University in Prague. (2018, 2020, 2021)

There are explained the basic principles of numerical mathematics important for numerical solving of problems important for physics and technology. Integrated computational environment MATLAB is used as a principle programming language as a demonstration tool.
Lecturers: Prof. Ing. Jiri Limpouch, CSc. and Ing. Pavel Vachal, Ph.D.

Czech Science Fair 2025 (6/2025)

Presentation on the role of AI in education, exploring practical uses, current trends, and future impact on teaching and learning.
Institute of Physics of the Czech Academy of Sciences (2/2025)

AI presentation for physicists on the practical use of LLMs in research and academia, focusing on text processing, content generation, and data analysis.
PORG School (1/2025)

AI presentation for secondary school teachers - a special lecture on AI progress and prompt engineering, discussing what AI is, how it learns, and the educational risks and opportunities it presents.
Czech Academy of Sciences (12/2024)

AI Presentation for Directors of the Institutes of the Czech Academy of Sciences on AI Progress, Tools, and Effective Communication with AI
CETAV AV CR (6/2024)

AI presentation for employees from The Centre of Administration and Operations of the CAS about AI technology and prompt engineering.
SCS software s.r.o. (3/2024)

AI presentation for software developers - current capabilities of artificial intelligence and prompt engineering.
Institute of Information Theory and Automation, Prague (12/2023)

AI presentation for scientists - how is AI entering our lives (text, audio, image and video in AI)
New PORG School (11/2023)

AI presentation for teachers - examining AI advancements and prompt engineering for educators.
Open Gate School - Science Cafe (11/2023)

AI presentation for students and their parents - exploring the current capabilities of artificial intelligence and prompt engineering.
FTV Prima (10/2023)

AI Workshop for Top Management - understanding the impact and strategic implementation of artificial intelligence in business.
Trnka Dobris Elementary School (5/2023)

AI seminar for students - what is learning, machine learning, artificial intelligence, and how to talk with AI.
Charles University (1/2023)

ML seminar for students - seminar on the basics of neural fields in machine learning.

Co-supervisor to master student at the Czech Technical University in Prague (2023)

Master thesis: Evaluation of advertisement effectiveness using people detection and gaze estimation
Co-supervisor to master student at the Czech Technical University in Prague (2022)

Research project: Detection of people, estimation of age, gender and direction of gaze with and without a mask to evaluate the effectiveness of advertising
Supervisor to bachelor student at the Czech Technical University in Prague (2021)

Bachelor thesis: Convolutional neural network-based human body detection and tracking in the interior

Master student at the Czech Technical University in Prague (2022)

Reserch project: Comparison of methods for unconstrained face detection
Hich school student at Stredni prumyslova skola stavebni akademika Stanislava Bechyne, Havlickuv Brod (2022)

Graduation thesis: Graphical application enabling interactive viewing of spatial raster images

Primary and lower secondary education at New Hope Mission School, India. (2017, 2019)

Teaching English, mathematics and science in our elementary school located in one of the poorest states in India, Bihar. We are focusing on improving educational quality. We want to educate and inspire our teachers, too, so that they learn to care about their self-growth.
Physics Teaching at Dolni Brezany Elementary School (2017)

Introductionary seminars on the physics of sound and light.

kerepecky@utia.cas.cz +420 266 052 864

Filip Sroubek's Lab,
Department of Image Processing,
Institute of Information Theory and Automation,
Czech Academy of Sciences.

Download CV

Education and Research

Department of Mathematics, Czech Technical University in Prague, Czechia

Washington University in St. Louis, Missouri, USA

TCM International Institute, Vienna, Austria.

Department of Physical Electronics, Czech Technical University in Prague, Czechia

Professional and Research Interests

Publications

Implicit Neural Representation for Image Demosaicking

Automated Actor Recognition in Video Content

Inverse Problems in Image Restoration

STAR: Screen Time and Actor Recognition in Video Content

3D Non-separable Moment Invariants and Their Use in Neural Networks

NeRD: Neural field-based Demosaicking

Real-Time Wheel Detection and Rim Classification in Automotive Production

Dual-Cycle: Self-Supervised Dual-View Fluorescence Microscopy Image Reconstruction using CycleGAN

D3Net: Joint Demosaicking, Deblurring and Deringing

Iterative Wiener Filtering for Deconvolution with Ringing Artifact Suppression

Inverse Compton scattering by laser-accelerated electrons

Teaching and AI

Contact.

Education and Research

Professional and Research Interests

Image Processing & Computer Vision

Artificial Intelligence

Prompt Engineering

E10 Leadership

Experiential Education

Publications

Implicit Neural Representation for Image Demosaicking

Automated Actor Recognition in Video Content

Inverse Problems in Image Restoration

STAR: Screen Time and Actor Recognition in Video Content

3D Non-separable Moment Invariants and Their Use in Neural Networks

NeRD: Neural field-based Demosaicking

Real-Time Wheel Detection and Rim Classification in Automotive Production

Dual-Cycle: Self-Supervised Dual-View Fluorescence Microscopy Image Reconstruction using CycleGAN

D3Net: Joint Demosaicking, Deblurring and Deringing

Iterative Wiener Filtering for Deconvolution with Ringing Artifact Suppression

Inverse Compton scattering by laser-accelerated electrons

Teaching and AI

Contact.

E¹⁰ Leadership