Noisy-Channel Model for Feature Extraction

, Ph.D. Dissertation Sharif University of Technology Hafez Kolahi, Hassan (Author) ; Kasaei, Shohreh (Supervisor) ; Soleymani Baghshah, Mahdieh (Co-Supervisor)

Abstract

One of the approaches used in learning theory is using information theoretic tools. The general idea of this approach is that if we show the algorithm did not memorize the dataset, we could guarantee generalization. Noisy channel model is one of the important concepts in this approach. A noisy channel is a lossy process which maps the data to a compressed format.There are two ways to use noisy channel model in literature: input compression and model compression. One of the main results of this thesis is to show that the input compression methods can not explain the generalization of algorithms (despite previous belief). On the contrast by fixing some of the problems faced in the model...

Deepfake Videos Detection through Deep Analysis of Artifacts of Images

, M.Sc. Thesis Sharif University of Technology Aghababaei Harandi, Ali (Author) ; Ghaemmaghami, Shahrokh (Supervisor) ; Eghlidos, Taraneh (Supervisor)

Abstract

DeepFake is a type of forgery that uses deep learning algorithms to make changes to audio and video content that the audience is unable to detect. Nowadays, due to the threats posed by the use of DeepFake to move people's faces in video, researchers' attention has been drawn to designing methods to detect this type of forgery. Detection methods are usually classified into two types. The first case is the extraction of features to detect forgery distortions, for example, the extraction of facial orientations to detect inconsistencies. The second case is the use of deep learning networks for feature extraction and classification, of which the EfficientNet network is an example. Despite the...

Mathematical Foundations of Deep Learning: a Theoretical Framework for Generalization

, M.Sc. Thesis Sharif University of Technology Babaie, Anahita (Author) ; Alishahi, Kasra (Supervisor) ; Hadji Mirsadeghi, Mir Omid (Co-Supervisor)

Abstract

Deep Neural Networks, are predictive models in Machine Learning, that during the last decade they've had a great success. However being in an over-parametrized and highly non-convex regime, the analytical examinations of these models is quite a challenging task to do. The empirical developments of Neural Networks, and their distinguishing performance in prediction problems, has motivated researchers, to formalize a theoretical foundations for these models and provide us with a framework, in which one can explain and justify their behavior and properties. this framework is of great importance because it would help us to come to a better understanding of how these models work and also enables...

A Generalization method for Video Deepfake Detection

, M.Sc. Thesis Sharif University of Technology Abolhassani, Elham (Author) ; Rabiee, Hamid Reza (Supervisor)

Abstract

The rapid advance of deepfake generation models has led to the rise of highly realistic synthetic content with manipulation of either or both visual and audio modalities. Although these techniques have opened new frontiers in media production and applications such as video games, animations, and virtual reality, malicious use can pose significant threats to privacy, security, and public trust. Therefore, researchers have developed numerous techniques to detect deepfakes. While these models have performed well on test data from the same distribution as their training data, their performance often drops significantly on unseen data from different distributions. Therefore, generalization is a...

Analysis and Improvement of Privacy-Preserving Federated Learning

, M.Sc. Thesis Sharif University of Technology Rahmani, Fatemeh (Author) ; Jafari Sivoshani, Mahdi (Supervisor) ; Rohban, Mohammad Hossein (Supervisor)

Abstract

Membership inference attacks are one of the most important privacy-violating attacks in machine learning, as well as infrastructure of more serious attacks such as data extraction attacks. Since membership inference attack is used as a measure to evaluate the level of privacy protection of machine learning models, different researches have investigated and provided new methods for this attack. However, the accuracy of these attacks has not been investigated on models trained with the latest techniques such as data augmentation and regularization techniques. In this research, we see that the Lira attack, the latest membership inference attack, which has much more power compared to previous...

Many-Class Few-Shot Classification

, M.Sc. Thesis Sharif University of Technology Fereydooni, Mohammad Reza (Author) ; Soleymani Baghshah, Mahdieh (Supervisor)

Abstract

Few-shot learning methods have achieved notable performance in recent years. However, fewshot learning in large-scale settings with hundreds of classes is still challenging. In this dissertation, we tackle the problems of large-scale few-shot learning by taking advantage of pre-trained foundation models. We recast the original problem in two levels with different granularity. At the coarse-grained level, we introduce a novel object recognition approach with robustness to sub-population shifts. At the fine-grained level, generative experts are designed for few-shot learning, specialized for different superclasses. A Bayesian schema is considered to combine coarse-grained information with...

A Survey on Empirical Theory of Deep Learning

, M.Sc. Thesis Sharif University of Technology Motesharei, Erfan (Author) ; Foroughmand Araabi, Mohammad Hadi (Supervisor)

Abstract

The aim of this thesis is to review the theory of deep learning with an experimental approach. In this thesis, we review researches that examine the impact of input selection on outputs in deep learning systems; Inputs we can control (samples, architecture, model size, optimizer, etc.) and outputs we can observe (the performance of the neural network, its test error, its parameters, etc.). Among the reviewed cases are the generalizability of deep learning systems, the effect of model components on its accuracy, interpolation and hyperparameters, as well as new phenomena in this field for which new frameworks have been defined