Efficient face detection and replacement in the creation of simple fake videos

Sheremet, Oleksii I.; Шеремет, Олексій Іванович; Sadovoi, Oleksandr V.; Садовой, Олександр Валентинович; Harshanov, Denys V.; Гаршанов, Денис Володимирович; Kovalchuk, Oleh S.; Ковальчук, Олег Степанович; Sheremet, Kateryna S.; Шеремет, Катерина Сергіївна; Sokhina, Yuliia V.; Сохіна, Юлія Віталіївна

eONPUIR
→
1. Періодичні видання національного університету "Одеська політехніка"
→
Applied Aspects of Information Technology = Прикладні аспекти інформаційних технологій
→
2023, Vol. 6, № 3
→
Посмотреть элемент

dc.contributor.author	Sheremet, Oleksii I.
dc.contributor.author	Шеремет, Олексій Іванович
dc.contributor.author	Sadovoi, Oleksandr V.
dc.contributor.author	Садовой, Олександр Валентинович
dc.contributor.author	Harshanov, Denys V.
dc.contributor.author	Гаршанов, Денис Володимирович
dc.contributor.author	Kovalchuk, Oleh S.
dc.contributor.author	Ковальчук, Олег Степанович
dc.contributor.author	Sheremet, Kateryna S.
dc.contributor.author	Шеремет, Катерина Сергіївна
dc.contributor.author	Sokhina, Yuliia V.
dc.contributor.author	Сохіна, Юлія Віталіївна
dc.date.accessioned	2023-10-02T22:34:37Z
dc.date.available	2023-10-02T22:34:37Z
dc.date.issued	2023-09-14
dc.identifier.issn	2617-4316
dc.identifier.issn	2663-7723
dc.identifier.uri	http://dspace.opu.ua/jspui/handle/123456789/14086
dc.description.abstract	Face detection and facial recognition technologies are among the most intensively studied topics within the field of computer vision, owing to their vast application potential across a multitude of industries. These technologies have demonstrated practical applicability in varied contexts such as identifying suspicious individuals in crowded urban spaces, real-time recognition of smartphone owners, creating compelling deepfakes for entertainment applications, and specialized applications that modify the movements of facial features such as the lips or eyes. With the current state-of-the-art advancements in hardware and software technology, today's technological infrastructure provides more resources than are necessary for video streaming. As a result, simple face recognition systems can be implemented without the need for high-cost server instances that require specified pre-trained models. This abundance of resources is changing the landscape of face recognition, and the discussion within this paper will revolve around these emerging paradigms. The primary focus of this article is an in-depth analysis of the key concepts of face detection in streaming video data using prominent pre-trained models. The models under discussion include HRNet, RetinaFace, Dlib, MediaPipe, and KeyPoint R-CNN. Each of these models has its strengths and weaknesses, and the article discusses these attributes in the context of real-world case studies. This discussion provides valuable insights into the practical applications of these models and the trade-offs involved in their utilization. Moreover, this paper presents a comprehensive overview of image transformation techniques. It introduces an abstract method for affine image transformation, an important technique in image processing that changes the geometric properties of an image without affecting its pixel intensity. Additionally, the article discusses image transformation operations executed through the OpenCV library, one of the leading libraries in the field of computer vision, providing a highly flexible and efficient toolset for image manipulation. The culmination of this research is presented as a practical standalone system for image replacement in video. This system leverages the RetinaFace model for inference and employs OpenCV for affine transformations, demonstrating the concepts and technologies discussed in the paper. The work outlined in this article thereby advances the field of face detection and recognition, presenting an innovative approach that makes full use of contemporary hardware and software advances.	en
dc.description.abstract	Технології виявлення та розпізнавання обличь є одними з найбільш інтенсивно досліджуваних тем у галузі комп’ютерного зору завдяки їх величезному потенціалу застосування в багатьох галузях. Ці технології продемонстрували практичне застосування в різних контекстах, таких як виявлення підозрілих осіб у багатолюдних міських просторах, розпізнавання власників смартфонів у реальному часі, створення переконливих діпфейків для розважальних додатків і спеціалізованих програм, які змінюють рухи рис обличчя, наприклад губ або очей. Завдяки сучасним досягненням апаратного та програмного забезпечення, сучасна технологічна інфраструктура надає більше ресурсів, ніж необхідно для потокового відео. У результаті прості системи розпізнавання обличь можуть бути реалізовані без використання дорогих серверів, які вимагають певних попередньо навчених моделей. Така велика кількість ресурсів змінює ландшафт розпізнавання обличь, і дискусія в даній статті обертається навколо цих нових парадигм. Основна увага в цій статті – поглиблений аналіз ключових концепцій детектування обличчя в потокових відеоданих за допомогою відомих попередньо навчених моделей. Обговорювані моделі включають HRNet, RetinaFace, Dlib, MediaPipe і KeyPoint R-CNN. Кожна з цих моделей має свої сильні та слабкі сторони, і дана стаття розглядає ці атрибути в контексті практичних прикладів із реального світу. Такий розгляд дає цінну інформацію про практичне застосування цих моделей і компроміси, пов’язані з їх використанням. Крім того, стаття представляє вичерпний огляд методів трансформації зображення. Представлено абстрактний метод афінного перетворення зображення, важливу техніку обробки зображень, яка змінює геометричні властивості зображення, не впливаючи на інтенсивність його пікселів. Крім того, у статті розглядаються операції перетворення зображень, які виконуються за допомогою бібліотеки OpenCV, однієї з провідних бібліотек у галузі комп’ютерного зору, що забезпечує дуже гнучкий і ефективний набір інструментів для маніпулювання зображеннями. Кульмінацією цього дослідження є практична автономна система для заміни зображення у відео. Ця система використовує модель RetinaFace для здійснення висновків і використовує OpenCV для афінних перетворень, демонструючи концепції та технології, які обговорюються в статті. Таким чином, проведена робот просуває сферу виявлення та розпізнавання обличь, представляючи інноваційний підхід, який повною мірою використовує сучасні апаратні та програмні досягнення.	en
dc.language.iso	en	en
dc.publisher	Odessа Polytechnic National University	en
dc.subject	Deepfake	en
dc.subject	affine transformation	en
dc.subject	face detection	en
dc.subject	video processing	en
dc.subject	alpha channel	en
dc.subject	binary masks	en
dc.subject	діпфейк	en
dc.subject	афінна трансформація	en
dc.subject	виявлення обличчя	en
dc.subject	відео обробка	en
dc.subject	альфа-канал	en
dc.subject	бінарні маски	en
dc.title	Efficient face detection and replacement in the creation of simple fake videos	en
dc.title.alternative	Ефективне детектування і заміна облич при створенні простого фейкового відео	en
dc.type	Article	en
opu.citation.journal	Applied Aspects of Information Technology	en
opu.citation.volume	3	en
opu.citation.firstpage	286	en
opu.citation.lastpage	303	en
opu.citation.issue	6	en