Loading...
- Type of Document: M.Sc. Thesis
- Language: Farsi
- Document No: 57678 (19)
- University: Sharif University of Technology
- Department: Computer Engineering
- Advisor(s): Rabiee, Hamid Reza
- Abstract:
- The rapid advance of deepfake generation models has led to the rise of highly realistic synthetic content with manipulation of either or both visual and audio modalities. Although these techniques have opened new frontiers in media production and applications such as video games, animations, and virtual reality, malicious use can pose significant threats to privacy, security, and public trust. Therefore, researchers have developed numerous techniques to detect deepfakes. While these models have performed well on test data from the same distribution as their training data, their performance often drops significantly on unseen data from different distributions. Therefore, generalization is a major challenge in this field. To tackle this issue, we propose a generalized ensemble audio-visual deepfake detection (EAV-DF) model plus a domain adaptation mechanism using a teacher-student architecture to improve the performance and generalization of the model on new domains. Therefore, this work uses the FakeAVCeleb dataset for training models and the DFDC and TIMIT-Deepfake datasets as new domains. Our experiments demonstrate that the proposed framework outperforms many state-of-the-art methods and yields a dynamic deepfake detection model that can adapt to new domain data and interpret the manipulated modality
- Keywords:
- Deep Fake ; Ensemble Learning ; Domain Adaptation ; Generalization ; Teacher-Student Structure ; Multimodal Model ; Deepfake Detection
-
محتواي کتاب
- view