Loading...
Search for:
soleymani--mohammad-ali
0.145 seconds
Total 7288 records
Conditional Text Generation with Neural Networks
, M.Sc. Thesis Sharif University of Technology ; Soleymani, Mahdieh (Supervisor)
Abstract
By the improvement of machine learning methods specially the Deep Learning in the last decade, there were expanding usage of these methods in Language Modeling task. As the essence of a language model is more basic, recently huge networks are trained with language model objective but fine-tuned on target tasks such as Question Answering, Sentiment Analysis and etc. which is a promising sign of its importance and usage in even other NLP tasks. However, this task still has severe problems. The Teacher Forcing based methods, suffer from the so-called exposure bias problem which is due to the train/test procedure discrepancy. Some solutions such as using Reinforcement Learning which has high...
Design Of a 3 DOF Robotic Exoskeleton With EMG Based Controller fFor Human Shoulder Joint
, M.Sc. Thesis Sharif University of Technology ; Zohoor, Hassan (Supervisor)
Abstract
Most elderly and physically disabled people suffer from lack of functionality and dexterity in their elbow or wrist. These disabilities are due to the damages mostly caused by sport surgery, spinal surgery, or stroke. Therefore, design of an assistive exoskeleton robot for upper limb movements seems necessary. The purpose of this study is to design, fabricate and deliver a control algorithm for an assistive wearable robot. The robot has five degrees of freedom in order to help the flexion/extension and abduction/adduction shoulder. Dynamic and kinematic model of elbow, forearm, and wrist is developed to determine the amounts of torques which are required in the joint actuators mounted on the...
Synthesis of Composite Coating with HA Nanoparticles on Ti by AC Plasma Electrolyte Oxidation
, M.Sc. Thesis Sharif University of Technology ; Ghorbani, Mohammad (Supervisor)
Abstract
In this project, a ceramic biocompatible coating was applied on Titanium which is included Hydroxyapatite by AC Plasma Electrolyte Oxidation method. First, coating was performed in 5 different solutions and various duration in 500mA/Cm2 as a current density. Then, by microscopic and macroscopic images and size of porosity and also thickness of coat, 3 solutions were chosen for adding nanoparticles: Solution 1 which includes NaH2PO4 and Ca(CH3COO)2 , Solution 3 which includes Ca(CH3COO)2 and Na-Beta G and Solution 4 which includes Ca(H2PO4)2, HMP, NA2(EDTA) and Ca(CH3COO)2 . Also the time 10 minutes was chose as a appropriate time. The average of the porosities size in solution 1 was about...
Video Captioning using Deep Recurrent Neural Networks
, M.Sc. Thesis Sharif University of Technology ; Soleymani, Mahdieh (Supervisor)
Abstract
Solving the visual symbol grounding problem has long been a goal of modern aritificial intelligence. Due to recent breakthroughs in deep learning methods for natural language processing and visual interpretation tasks‚ the field now seems to be as near to achieving this goal as it ever was. Also recent progress in using recurrent neural netowrks (RNNs) for image description‚ has motivated the exploration of their application for video description tasks. However, while images remain static‚ interpreting videos require modeling complex dynamic temporal sturctures and then properly integrating that information into a natural language description. Recurrent neural networks can be both used to...
Many-Class Few-Shot Classification
, M.Sc. Thesis Sharif University of Technology ; Soleymani Baghshah, Mahdieh (Supervisor)
Abstract
Few-shot learning methods have achieved notable performance in recent years. However, fewshot learning in large-scale settings with hundreds of classes is still challenging. In this dissertation, we tackle the problems of large-scale few-shot learning by taking advantage of pre-trained foundation models. We recast the original problem in two levels with different granularity. At the coarse-grained level, we introduce a novel object recognition approach with robustness to sub-population shifts. At the fine-grained level, generative experts are designed for few-shot learning, specialized for different superclasses. A Bayesian schema is considered to combine coarse-grained information with...
Continual Learning Algorithms Inspired by Human Learning
, M.Sc. Thesis Sharif University of Technology ; Soleymani Baghshah, Mahdieh (Supervisor)
Abstract
Despite the remarkable success of deep learning algorithms in recent years, it still has a long way to reach the status of human natural intelligence and to acquire the expected self-autonomy. As a result, many researchers in this field have focused on the development of these algorithms while taking inspiration from human cognitive behaviors. One of the disadvantages of current algorithms is the lack of their ability to learn in a continual manner while deployed in the environment. More precisely, deep learning models are not able to gradually gather knowledge from the environment and if they are in a situation of limited access to data, they will suffer from catastrophic forgetting; a...
Deep Probabilistic Models for Continual Learning
, M.Sc. Thesis Sharif University of Technology ; Soleymani Baghshah, Mahdieh (Supervisor)
Abstract
Recent advances in deep neural networks have shown significant potential; however, they still face challenges when it comes to non-stationary environments. Continual learning is related to deep neural networks with limited capacity that should perform well on a sequence of tasks. On the other hand, studies have shown that neural networks are sensitive to covariate shifts. But in many cases, the distribution of data varies with time. Domain Adaptation tries to improve the performance of a model on an unlabeled target domain by using the knowledge of other related labeled data coming from a different distribution. Many studies on domain adaptation have optimistic assumptions that are not...
Modification of carbohydrate polymers via grafting in air. 1. Ceric-Induced synthesis of starch-g-polyacrylonitrile in presence and absence of oxygen [electronic resource]
, Article Starch - Starke ; Volume 54, Issue 3-4, pages 140–147, April 2002 ; Zohurian Mehr, Mohammad J
Abstract
Monomer grafting, a unique technique for polysaccharide modification, is always performed under inert (e.g., N2) atmosphere. This work is the first report related to evaluating the possibility and efficiency of the grafting of acrylonitrile (AN) onto starch in presence of oxygen. Thus, corn starch (in both granular and gelatinized states) as well as soluble starch were grafted by AN using a ceric-carbohydrate redox initiating system. Graft copolymerizations were performed under nitrogen, air, and oxygen atmospheres at similar conditions. Grafting occurrence was verified using chemical and spectral proofs. The polymerization mechanism and kinetics were investigated by recording the...
Modification of carbohydrate polymers via grafting in Air. 2. Ceric-Initiated graft copolymerization of acrylonitrile onto natural and modified polysaccharides [electronic resource]
, Article Starch - Stärke ; Volume 54, Issue 10, pages 482–488, October 2002 ; Zohuriaan-Mehr, Mohammad J
Abstract
Acrylonitrile (AN) was grafted onto various natural and modified polysaccharides (i.e., gum arabic, gum tragacanth, xanthan gum, sodium alginate, chitosan, sodium carboxymethyl cellulose, hydroxyethyl cellulose, methyl cellulose) by using ceric-carbohydrate redox initiating system. After overcoming practical problems, mainly from the high viscosity of the aqueous solutions of the different substrates, the graft copolymerization reactions were run either in air or in N2 atmosphere under similar conditions. Grafting was confirmed using chemical and spectral (FTIR) proofs. The reactions were kinetically investigated using semi-empirical expressions and time-temperature profiles. An anomalous...
Representation Learning by Deep Networks and Information Theory
, M.Sc. Thesis Sharif University of Technology ; Soleymani Baghshah, Mahdieh (Supervisor)
Abstract
Representation learning refers to mapping the input data to another space, usually with lower dimensions than the input space. This task can be helpful in improving the performance of methods in downstream tasks, compression, and improving sample generation in generative models. Representation learning is a problem connected to information theory, and information theory's concepts and quantities are used widely in representation learning models. Besides, the representation learning problem is closely related to latent variable generative models. These models usually learn useful representations in their process of training, implicitly or explicitly. So, the usage of latent variable...
Design and Implementation of a Collision Avoidance Module in Dynamic Environment with Deep Reinforcement Learning on Arash Social Robot
, M.Sc. Thesis Sharif University of Technology ; Meghdari, Ali (Supervisor) ; Taheri, Alireza (Supervisor) ; Soleymani, Mahdieh (Co-Supervisor)
Abstract
Nowadays, one of the challenges in social robotics is to navigate the robot in social environments with moving elements such as humans. The purpose of this study is to navigate the Arash 2 social robot in a dynamic environment autonomously without encountering moving obstacles (humans). The Arash 2 robot was first simulated in the Gazebo simulator environment in this research. The simultaneous location and mapping (SLAM) technique was implemented on the robot using a lidar sensor to obtain an environment map. Then, using the deep reinforcement learning approach, the neural network developed in the simulation environment was trained and implemented on the robot in the real environment. The...
Modeling of in Plane Behavior of Retrofitted Adobe Walls with Finite Element Method under Cyclic Loading
, M.Sc. Thesis Sharif University of Technology ; Bakhshi, Ali (Supervisor) ; Ghannad, Mohammad Ali (Co-Advisor)
Abstract
Statistical studies indicate the fact that in Iran, like many other developing countries, a significant percentage of buildings are made of traditional materials and most of them are adobe buildings especially in rural areas. Furthermore, geological data indicates that Iran is located in alps-Himalayas seismic zone and exposed by destructive earthquakes and adobe buildings suffer the most damages comparing with other type of the structures. According to these explanations, this research deals with adobe buildings behavior and provides solutions for reinforcing them and improving the seismic response of these buildings. These studies include numerical modeling with finite elements methods by...
Silica chloride/wet SiO2 as a novel heterogeneous system for the deprotection of acetals under mild conditions [electronic resource]
, Article Phosphorus, Sulfur, and Silicon and the Related Elements ; Volume 178:2667-2670, Issue 12, 2003 ; Pourjavadi, Ali ; Zolfigol, Mohammad Ali ; Bamoniri, Abdolhamid
Abstract
A combination of silica chloride and wet SiO2 was used as an effective deacetalizating agent for the conversion of acetals to their corresponding carbonyl derivatives under mild and heterogeneous condition
Improvement of Reasoning in Large Language Models
, M.Sc. Thesis Sharif University of Technology ; Soleymani Baghshah, Mahdieh (Supervisor)
Abstract
Ensuring the reliability of Large Language Models (LLMs) in complex reasoning tasks remains a formidable challenge, particularly in scenarios that demand precise mathematical calculations and knowledge-intensive open-domain generation. In this work, we introduce an uncertainty-aware framework designed to enhance the accuracy of LLM responses by systematically incorporating model confidence at critical deci- sion points. We propose an approach that encourages multi-step reasoning in LLMs and quantify the confidence of intermediate answers such as numerical results in math- ematical reasoning and proper nouns in open-domain generation. Then, the overall confidence of each reasoning chain is...
Distributed Fault-tolerant Computation for Massive Data
, M.Sc. Thesis Sharif University of Technology ; Maddah-Ali, Mohammad Ali (Supervisor)
Abstract
In this thesis we consider the problem of distributed computation by many processors.We mainly concentrate on matrix multiplication problem in this thesis because of its importance. A distributed system consists of N worker processors and one master processor. The master processor should distribute the computation between workers and after computation in each of them, collect the results. In this thesis, we are going to mitigate the effect of straggler processors by using coding methods. Straggler processors can cause delays in the computation time.In this thesis, we firstly introduce a method to multiply any number of matrices in each other. The proposed method occurred in one shot without...
Robust Learning to Spurious Correlation without Access to Side Information of the Environment
, M.Sc. Thesis Sharif University of Technology ; Rohban, Mohammad Hossein (Supervisor) ; Soleymani Baghshah, Mahdieh (Supervisor)
Abstract
Traditionally, machine learning models for classification tasks rely on statistical methods to find correlations between patterns in the input data and their correspond- ing labels. However, these correlations are not necessarily consistent across different data partitions and may change at test time. Such unstable correlations are referred to as spurious correlations. When the spurious correlation relied upon during training changes at test time, the model’s accuracy can degrade. To improve robustness to shifts in spurious correlations, most research in this area assumes that group annota- tions based on different values of the spurious attribute are available during training or validation....
Incomplete Reduction of Iron Ore in a Fluidized Bed Reactor
, M.Sc. Thesis Sharif University of Technology ; Halali, Mohammad (Supervisor)
Abstract
The main goal of current investigation was partial reduction of Fe2O3 available in Hematite-rich ore to Magnetite by Fludization Bed method. The reductive gas, CO, was produced by incomplete combustion of Acetylene and Airmixed in inappropriate ratio. X-Ray Diffraction and X-Ray Florescence analyses were implemented in order to conducting qualitative analysis of present phases before and after reduction of the ore. Quantitative analysis of the same phases was also conducted by Titration. The latter analysis revealed the precise magnitude of available iron in each of the oxide containing phases. The chemical composition of the reactor gas was also determined before and after reduction...
Sequential Competitive Facility Location In Continuous Geometric Space
, M.Sc. Thesis Sharif University of Technology ; Abam, Mohammad Ali (Supervisor)
Abstract
Abstract The problem of competetive facility location can be defined as follows: There are a number of customers in the form of points in space, and two players arrange a number of facilities in the form of points in space, given some limitations, respectively. Each customer’s connection to each facility has a cost for the customer and an advantage for the facility, and each customer wants to be connected to at most one of the facilities which has the lowest cost for him. The goal is to find the strategy of placing the facilities and determining the cost which the facility receives from the customer, in such a way that the player’s profit is maximised.In this thesis, we first sought to...
Distributed Verifiable Computing: Algorithms and Analysis
, M.Sc. Thesis Sharif University of Technology ; Maddah Ali, Mohammad Ali (Supervisor)
Abstract
Zero knowledge proofs allow a person (prover) to convince another person (verifier) that he has performed a specific computation on a secret data correctly, and has obtained a true answer, without having to disclose the secret data. QAP (Quadratic Arithmetic Program) based zkSNARKs (zero knowledge Succinct Non-interactive Argument of Knowledge) are a type of zero knowledge proof. They have several properties that make them attractive in practice, e.g. verifier's work is very easy. So they are used in many areas such as Blockchain and cloud computing. But yet prover's work in QAP based zkSNARKs is heavy, therefore, it may not be possible for a prover with limited processing resource to run...
Privacy Preserving Communication Schemes for Light Clients in Blockchain Networks: Algorithms and Analysis
, M.Sc. Thesis Sharif University of Technology ; Pakravan, Mohammad Reza (Supervisor) ; Maddah Ali, Mohammad Ali (Co-Supervisor)
Abstract
Lightweight clients are a type of blockchain users who do not store all the blocks in the blockchain due to limited resources. These users store only a small part of each block and when needed, request transactions from full nodes that store the entire blockchain. These users have no role in block validation and only want to receive transactions related to their addresses with proof of the inclusion in the block from full nodes.Since light clients rely on full nodes for receiving transactions, their privacy against full nodes is important. The current implementation of Bitcoin uses Bloom filters for privacy, but this offers very little privacy to the users.In this thesis, we study the...