https://publicpagestutorbin.blob.core.windows.net/%24web/%24web/assets/Vector_3_18e566da35.png

Machine Learning Homework Help | Machine Learning Assignment Help

Excel in Exams with Expert Machine Learning Homework Help Tutors.

https://publicpagestutorbin.blob.core.windows.net/%24web/%24web/assets/Frame_1_7db546ad42.png

Trusted by 1.1 M+ Happy Students

4.4Trust Pilot

4.4Edu Reviewer

5App Review

4.8Student

Place An Orderand save time

^*Get instant homework help from top tutors—just a WhatsApp message away. 24/7 support for all your academic needs!

Mastering Machine Learning: A Comprehensive Guide for Students

Introduction to Machine Learning

Machine learning, a pivotal branch of artificial intelligence (AI), empowers computers to replicate human learning processes, enhancing their abilities automatically through algorithms and statistical models. This field is integral for students aiming to excel in the domains of advanced data science, mathematics, and computer science. Mastering machine learning requires a deep understanding of complex subjects, setting the foundation for innovative solutions in technology and beyond.

Expert Online Tutoring for Machine Learning

Personalized Tutoring Sessions

At the heart of our service are personalized online tutoring sessions. Our seasoned tutors, equipped with expertise in machine learning, craft lessons tailored to your individual learning style and academic needs. Utilizing cutting-edge tools, including a state-of-the-art whiteboard platform featuring desktop sharing and multimedia capabilities, we ensure an interactive and enriching learning experience. Whether you're grappling with basic principles or advanced concepts, our tutors are here to guide you every step of the way.

Homework Assistance and Resource Library

Struggling with machine learning assignments? Our homework help service offers expert assistance, providing clear code examples, detailed explanations, and documentation to support your learning. Additionally, our Homework Library serves as a quick reference, offering solutions to common machine learning challenges.

Key Machine Learning Topics Covered

Our tutoring service encompasses all aspects of machine learning, including but not limited to:

Artificial Intelligence: Dive into the world of AI, where computers emulate human intelligence across tasks like decision-making and speech recognition.
Computer Science: Explore the foundations of computing, including software and hardware intricacies.
Algorithms: Understand the essence of algorithms and their role in solving computational problems.
Python Programming: Gain proficiency in Python, a premier language for machine learning development.
Supervised and Unsupervised Learning: Master these core machine learning approaches, from working with labeled datasets to identifying patterns in untagged data.

Why Choose Tutorbin?

Unmatched Tutoring Expertise

Our team comprises computer science veterans and machine learning experts, holding credentials from leading institutions and tech companies. Their real-world experience translates into deeper, practical insights, elevating your understanding of machine learning.

Flexibility and Accessibility

Our services are designed to fit your schedule, offering 24/7 tutoring to suit your busy life. With prompt response times and a straightforward process, getting the help you need is seamless and efficient.

Affordability and Transparency

We believe in offering quality tutoring at fair prices. With no hidden fees or obligations, you control your tutoring budget, ensuring access to top-notch educational support without financial strain.

Building Confidence and Competence

Working with our tutors not only enhances your machine learning skills but also boosts your confidence, empowering you to tackle future challenges with assurance. This confidence is key to a successful and progressive educational journey.

Get Started with Machine Learning Tutoring Today

Embark on your machine learning journey with Tutorbin. Whether you need help with specific assignments, conceptual understanding, or exam preparation, our tutors are ready to support your academic goals. Unlock the full potential of machine learning and elevate your educational experience with expert tutoring tailored to your needs.

Recently Asked Machine Learning Questions

Expert help when you need it

Q1:Q1 Consider the problem where we want to predict the gender of a person from a set of input parameters, namely height, weight, and age.See Answer
Q2:For this programming assignment you will implement the Naive Bayes algorithm from scratch and the functions to evaluate it with a k-fold cross validation (also from scratch). You can use the code in the following tutorial to get started and get ideas for your implementation of the Naive Bayes algorithm but please, enhance it as much as you can (there are many things you can do to enhance it such as those mentioned at the end of the tutorial):See Answer
Q3:Q1 Consider the problem where we want to predict the gender of a person from a set of input parameters, namely height, weight, and age. a) Using Cartesian distance, Manhattan distance and Minkowski distance of order 3 as the similarity measurements show the results of the gender prediction for the Evaluation data that is listed below generated training data for values of K of 1, 3, and 7. Include the intermediate steps (i.e., distance calculation, neighbor selection, and prediction). b) Implement the KNN algorithm for this problem. Your implementation should work with different training data sets as well as different values of K and allow to input a data point for the prediction. c) To evaluate the performance of the KNN algorithm (using Euclidean distance metric), implement a leave- one-out evaluation routine for your algorithm. In leave-one-out validation, we repeatedly evaluate the algorithm by removing one data point from the training set, training the algorithm on the remaining data set and then testing it on the point we removed to see if the label matches or not. Repeating this for each of the data points gives us an estimate as to the percentage of erroneous predictions the algorithm makes and thus a measure of the accuracy of the algorithm for the given data. Apply your leave-one-out validation with your KNN algorithm to the dataset for Question 1 c) for values for K of 1, 3, 5, 7, 9, and 11 and report the results. For which value of K do you get the best performance? d) Repeat the prediction and validation you performed in Question 1 c) using KNN when the age data is removed (i.e. when only the height and weight features are used as part of the distance calculation in the KNN algorithm). Report the results and compare the performance without the age attribute with the ones from Question 1 c). Discuss the results. What do the results tell you about the data?See Answer
Q4:Q2. Using the data from Problem 2, build a Gaussian Naive Bayes classifier for this problem. For this you have to learn Gaussian distribution parameters for each input data feature, i.e. for p(height|W), p(height|M), p(weight|W), p(weight|M), p(age|W), p(age|M). a) Learn/derive the parameters for the Gaussian Na ive Bayes Classifier for the data from Question 2 a) and apply them to the same target as in problem 1a). b) Implement the Gaussian Na ive Bayes Classifier for this problem. c) Repeat the experiment in part 1 c) and 1 d) with the Gaussian Native Bayes Classifier. Discuss the results, in particular with respect to the performance difference between using all features and using only height and weight. d) Same as 1d but with Naïve Bayes. e) Compare the results of the two classifiers (i.e., the results form 1 c) and 1d) with the ones from 2 c) 2d) and discuss reasons why one might perform better than the other.See Answer
Q5:Question 1 Download the SGEMM GPU kernel performance dataset from the below link. https://archive.ics.uci.edu/ml/datasets/SGEMM+GPU+kernel+performance Understand the dataset by performing exploratory analysis. Prepare the target parameter by taking the average of the THREE (3) runs with long performance times. Design a linear regression model to estimate the target using only THREE (3) attributes from the dataset. Discuss your results, relevant performance metrics and the impact of normalizing the dataset.See Answer
Q6:This is a machine learning model in python using scikit learn to classify the handwritten Arabic letters. There are two files. The train data and the test data. The code is available, and we need to optimize the code so under box number 6 when we do the cross validation of the model, the accuracy of the model should be in high 80s and low 90s. we should be tuning the hyperparameters and improve the pipeline as needed. Anything is allowed to be used from the scikit learn but nothing more. The code as it is, the model accuracy is 79 The goal is to modify the code to be able to get an accuracy of the model in the high 80s and low 90s. In box 3 of the code, there are the hyperparameters that need to be tuned and the pipeline that might need to be modifed. Voting model can be used to get high accuracy. We need to improve the model accuracy from the existing code. Info about the dataset: The dataset is composed of 16,800 characters written by 60 participants, the age range is between 19 to 40 years, and 90% of participants are right-hand. Each participant wrote each character (from 'alef' to 'yeh') ten times on two forms. The forms were scanned at the resolution of 300 dpi. The dataset is partitioned into two sets: a training set (13,440 characters to 480 images per class) and a test set (3,360 characters to 120 images per class). Writers of training set and test set are exclusive. Ordering of including writers to test set are randomized to make sure that writers of test set were not from a single institution (to ensure variability of the test set). The code: This is a machine learning model in python using scikit learn to classify the handwritten Arabic letters. There are two files. The train data and the test data. The code is available, and we need to optimize the code so under box number 6 when we do the cross validation of the model, the accuracy of the model should be in high 80s and low 90s. we should be tuning the hyperparameters and improve the pipeline as needed. Anything is allowed to be used from the scikit learn but nothing more. Voting model can be used to improve accuracy. Goal: build an image classifier to classify handwritten Arabic language characters using scikit learn. The model accuracy have to be in high 80s like 89% or low 90s like 92% This is all about tuning the hyperparameters and the model pipelineSee Answer
Q7:There are four folders, each folder contains a set of exercises, the expected results are written at the top of each ipynb. some files are just example solutions Day 1 all about fitting a linear regression or logistic regression to the data. Also to determine the decision boundaries. Day 2 Use Neural Networks to solve simple classification examples Day 3 Using Convolutional Neural Network with PyTorch with one example solution Day 4 Deep learning, the solution is ready just we add the testing data and test the built model and output a submission file with labelsSee Answer
Q8:The main aim of this project is to analyze a movie review's textual content in order to determine its underlying sentiment. In this project, we try to classify whether a person liked the movie or not based on the review they give for the movie. 1) You need to develop a python code to calculate the sentiment using NLP analysis and should use CNN and logisitic regression 2) You need to create a report of what you have done in the code and also you need to explain how our work is different from the references we have taken (references are in the document)See Answer
Q9:CSE 6363 - Machine Learning Data Set Use the dataset given at the bottom of this file. Do Not Use You are not allowed to use any ML libraries other than NumPy. You cannot use sklearn or any ML library. If used, you will receive a penalty of 90 points. You cannot use pandas. If used, you will receive a penalty of 20 points. Libraries You are allowed to use NumPy, math. You can use matplotlib to plot graphs. If you want to use any other library apart from these, please check with your GTA and get their approval. Where to code 1. We will provide you with a directory structure with python files for each part of every question. You must write your code in these files. 2. It will contain a script to execute the files. You must run this script and verify that your code runs before you submit. To run this script you must make it executable first or else you will get permission denied error.See Answer
Q10:1. Consider the problem from the previous assignments where we want to predict gender from information about height, weight, and age. We will use Decision Trees to make this prediction. Note that as the data attributes are continuous numbers you have to use the ≥ attribute and determine a threshold for each node in the tree. As a result, you need to solve the information gain for each threshold that is halfway between two data points and thus the complexity of the computations increases with the number of data items. a) Implement a decision tree learner for this particular problem that can derive decision trees with an arbitrary, pre- determined depth (up to the maximum depth where all data sets at the leaves are pure) using the information gain criterion. b) Divide the data set from Question 1c) in Project 1 (the large training data set) into a training set comprising the first 50 data points and a test set consisting of the last 70 data elements. Use the resulting training set to derive trees of depths 1-5 and evaluate the accuracy of the resulting trees for the 50 training samples and for the test set containing the last 70 data items. Compare the classification accuracy on the test set with the one on the training set for each tree depth. For which depths does the result indicate overfitting?See Answer
Q11:This question walks you through the typical process of discovering association rules. We will use the market basket data in the Groceries.csv file to discover association rules. Here are the data contents. 1- Customer: Customer Identifier 2- Item: Name of Product Purchased For your information, we have sorted the observations in ascending order first by Customer and then by Item. Also, we have removed duplicated items for each customer. A- What is the number of items in the Universal Set? What is the maximum number of itemsets that we can find in theory from the data? What is the maximum number of association rules that we can generate in theory from the data? B- We are interested in the itemsets that can be found in the market baskets of at least seventy-five (75) customers. How many itemsets did we find? Also, what is the largest number of items, i.e., among these itemsets? C- We will use up to the largest value we found in Part (b) and then generate the association rules whose Confidence metrics are greater than or equal to 1%. How many association rules can we find? Next, we plot the Support metrics on the vertical axis against the Confidence metrics on the horizontal axis for these association rules. We will use the Lift metrics to indicate the size of the marker. We will add a color gradient legend to the chart for the Lift metrics. D- Among the rules that you found in Part (c), list the rules whose Confidence metrics are greater than or equal to 60%. Please show the rules in a table that shows the Antecedent, the Consequent, the Support, the Confidence, the Expected Confidence, and the Lift. See Answer
Q12:This question demonstrates the effect of rescaling input variables on the cluster results. We will discover clusters using all the observations in the TwoFeatures.csv file with the following specifications. • The input interval variables are x1 and x2 • The metric is the Manhattan distance • The minimum number of clusters is 1 • The maximum number of clusters is 8 • Use the Elbow value for choosing the optimal number of clusters Since the sklearn.cluster.KMeans class works only with the Euclidean distance, you will need to develop custom Python codes to implement the K-Means algorithm with the Manhattan distance. A- Plot x2 (vertical axis) versus x1 (horizontal axis). Add gridlines to both axes. Let the graph engine chooses the tick marks. How many clusters do you see in the graph? B- Discover the optimal number of clusters without any transformations. List the number of clusters, the Total Within-Cluster Sum of Squares (TWCSS), and the Elbow values in a table. Plot the Elbow Values versus the number of clusters. How many clusters do you find? What are the centroids of your optimal clusters? C- Linearly rescale x1 such that the resulting variable has a minimum of zero and a maximum of ten. Likewise, rescale x2. Discover the optimal number of clusters from the transformed observations. List the number of clusters, the Total Within-Cluster Sum of Squares (TWCSS), and the Elbow values in a table. Plot the Elbow Values versus the number of clusters. How many clusters do you find? What are the centroids of your optimal clusters in the original scale of x1 and x2? D- If you are doing everything correctly, you should discover two different optimal cluster solutions. In your words, how do you explain the difference? See Answer
Q13:In this problem I'd like you to use the following code to generate a dataset to evaluate various approaches to regression in the presence of outliers. 1 import numpy as np 2 np.random.seed (2017) 3 n = 100 4 xtrain = np.random.rand (n) 5 ytrain = 0.25 +0.5*xtrain + np. sqrt (0.1) *np.random.randn (n) 6 idx = np.random.randint (0, 100, 10) 7 ytrain [idx] = ytrain [idx] + np.random.randn (10) The code above generates training data by selecting random values for the zi's, then computing f(x) = + and adding a small amount of Gaussian noise to each observation. It then follows by creating some "outliers" in the y's by picking 10 random entries and adding a much larger amount of noise to just those elements. In the problems below, you should find a linear fit to this data. In all of the methods below, there will be one or more parameters to set. You can do this manually using whatever approach you like. (Do not go crazy optimizing these, just tune the parameters until your estimate looks reasonable.) 1. To begin, find a linear fit using the code for ridge regression that you produced in the first problem. Report the value of A that you selected, and report the slope and intercept of your linear fit. 2. Next, I would like you to find a linear fit using the Huber loss. This can be done via 1 from sklearn import linear_model 2 reg= linear_model. HuberRegressor (epsilon = 1.35, alpha=0.001) 3 reg.fit(xtrain. reshape (-1,1),ytrain) You have two parameters to choose here: € (which controls the shape of the loss function and needs to be greater than 1.0) and a (the regularization parameter). Report the values of and a you selected, and report the slope and intercept of your linear fit (see reg. intercept_ and reg.coef.).See Answer
Q14:(40 points) Part 1: Q-Learning and Policy Iteration on the Frozen Lake Environment In this part of the assignment, you will implement a basic version of Q-Learning on the Frozen Lake environment, following the tutorial provided here. The main purpose is to familiarize yourself with using the OpenAl Gym library. In part 2, we will be using a more complex RL method. Objective: Implement a Q-Learning agent to solve the Frozen Lake environment. Tasks: 1. Familiarize yourself with the Frozen Lake environment and its dynamics. 2. Implement the Q-Learning algorithm using the tutorial as a guide. 3. Train your Q-Learning agent on the Frozen Lake environment. 4. Evaluate the performance of your agent and analyze the impact of hyperparameters on the learning process. Specifically, verify the impact of the following hyperparameters: o alpha learning rate o gamma: discount factor • epsilon: exploration rate Test at least 3 different values for each hyperparameter and explain the effect of each hyperparameter on the learning process. 5. Implement the policy iteration algorithm and compare its performance to Q-Learning.See Answer
Q15:(60 points) Part 2: Proximal Policy Optimization on an Atari Game Environment In this part of the assignment, you will adapt the Q-Learning code from Part 1 to an Atari game environment of your choosing. Additionally, you will implement the Proximal Policy Optimization (PPO) algorithm and evaluate it on the same Atari game environment. Objective: Adapt the Q-Learning code to an Atari game environment and implement the PPO algorithm for comparison. Tasks: 1. Choose an Atari game environment from the OpenAl Gym library[1]. 2. Adapt the Q-Learning code from Part 1 to work with the chosen Atari game environment. 3. Train your Q-Learning agent on the Atari game environment. 4. Implement the PPO algorithm, following the guidelines provided here and here. You may use other implementations of PPO as a reference, but you must write your own code. Please cite any references you use. 5. Train your PPO agent on the same Atari game environment. 6. Compare the performance of the Q-Learning and PPO agents on the chosen Atari game environment by looking at the mean reward over time.See Answer
Q16:How well can we predict the amount of rainfall (in millimetres) based on the temperature (in degrees Celsius) using linear regression on a given dataset, such as the one below? What is the equation of the regression line, and what is the predicted amount of rainfall for a temperature of 25 degrees Celsius? Given Dataset: Temperature (X): [20, 22, 25, 28, 30] Rainfall (Y): [50, 60, 70, 80, 90] You must please explain step for step what you are doing and in detail please. Also, you must please refer to a textbook and the page number where you made use of the formula. This should be in word document.See Answer
Q17:QUESTION 1 (30 MARKS) 1.1 Can you describe the various steps and processes involved in the Machine Learning life cycle when developing a predictive model for a complex real-world scenario, without relying solely on code or programming language? (10) 1.2 How can one conceptualize and understand the impact of overfitting and underfitting in a predictive model, and what are the strategies for avoiding these problems when working on real-world applications, without relying solely on code or programming language? (10) 1.3 Can you explain the fundamental concepts and differences between regression and classification in the field of Machine Learning, without using code or programming language? How do these two techniques differ in terms of their objectives, inputs, and outputs, and when is each appropriate to use in a real-world scenario? (10)See Answer
Q18:QUESTION THREE 3.1 A confusion matrix, such as one shown below, is used to evaluate the performances of algorithms in machine learning. ACTUAL YES NO 28 4 NO 6 24 (30 MARKS) PREDICTED YES a. In your own words, explain what a confusion matrix is. (4) b. In the confusion matrix above, the values in the diagonal are high (28 and 24), what does that tell you about the accuracy of the model that has this confusion matrix? (4) c. What do high values for False Positive and False negative mean? Explain (4) d. Explain one example in which a high number of False Positives would have serious consequences. (4) e. Explain one example in which a high number of False Negatives would have serious consequences. (4) 3.2 Given a simple artificial neural network that does not have a hidden layer, compute the output value given the following information: There are 3 inputs(x1 = 30, x2 = 18, x3 = -40) and the value of the bias (b) isSee Answer
Q19:INSTRUCTIONS This assignment has three (3) parts: • In part I, you will be assessed on the theorical aspects of machine learning and deep learning. More specifically, you will be tested on activation functions and how DNNS, forward propagation and backward propagation work. . In part II, you will be required to build, train, test and improve a basic DNN for handwritten letter recognition. You will need to load and prepare data, visualize data for inspection, build your DNN, tune model hyperparameters, and then further improve your DNN using label smoothing. In part III, you will be required to build a real-world image classifier with CNNS and improve the model robustness against adversarial attacks. Make sure you read the instructions carefully. You need to submit through the Moodle Assignment activity a single ZIP file, name xxx_A1_solution.zip, where xxx is your student ID. The ZIP should contain: 1) Jupyter notebooks with answers to the questions and your work. They should be named A1_Part1_Solutions.ipynb, A1_Part2_Solutions.ipynb, and A1_Part3_Solutions.ipynb corresponding to part 1, part 2, and part 3 respectively. 2) A copy of your solution notebooks exported in HTML format. 3) (Optional) Any extra file or folder needed to complete your assignment (e.g., images used in your answers)./nINSTRUCTIONS This assignment has three (3) parts: • In part I, you will be assessed on the theorical aspects of machine learning and deep learning. More specifically, you will be tested on activation functions and how DNNS, forward propagation and backward propagation work. . In part II, you will be required to build, train, test and improve a basic DNN for handwritten letter recognition. You will need to load and prepare data, visualize data for inspection, build your DNN, tune model hyperparameters, and then further improve your DNN using label smoothing. In part III, you will be required to build a real-world image classifier with CNNS and improve the model robustness against adversarial attacks. Make sure you read the instructions carefully. You need to submit through the Moodle Assignment activity a single ZIP file, name xxx_A1_solution.zip, where xxx is your student ID. The ZIP should contain: 1) Jupyter notebooks with answers to the questions and your work. They should be named A1_Part1_Solutions.ipynb, A1_Part2_Solutions.ipynb, and A1_Part3_Solutions.ipynb corresponding to part 1, part 2, and part 3 respectively. 2) A copy of your solution notebooks exported in HTML format. 3) (Optional) Any extra file or folder needed to complete your assignment (e.g., images used in your answers)./nPart 1: Question on theory and knowledge (30 points) The first part of this assignment covers contents from the lectures and lab sessions in Weeks 1 and 2. You are highly recommended to revise these materials before attempting this part. In this part, you are expected to demonstrate your understanding of the concepts of activation functions, forward propagation and backward propagation in DNNs. Question 1.1 Activation Functions (8 points) Activation functions play an important role in modern DNNs. In this question, we will explore some of them for a deeper understanding of their characteristics and their advantages. a) Given the Exponential Linear Unit activation function: ELU = {a(ex - 1) if x < 0 otherwise State its output range, find its derivative (show your steps), and plot the activation function and its derivative. (2 points) b) There are a wide range of activation functions recently been proposed. Do your research and select two (2) activation functions that have not been discussed in the lecture (i.e., ReLU, Sigmoid, and Tanh). For each of the selected activation function, you must (3 points total for each function): • Identify the research paper which proposes the activation function. Write a summary of the author's motivation which leads to the activation function (max 150 words). • Write a summary of advantages of the activation function (max 150 words). Question 1.2 Feed-Forward Neural Networks (8 points) Assume that we feed a data point x with a ground-truth label y = 3 (with index starting from 1 as in the lecture) to the feed-forward neural network with the ReLU activation function and hidden layers shown in the following figure:/na) What is the numerical value of the latent presentation h¹(x)? (1 point) b) What is the numerical value of the latent presentation h² (x)? (1 point) c) What is the numerical value of the logit h³ (x)? (1 point) d) What is the corresponding prediction probability p(x)? (1 point) e) What is the predicted class? Is it a correct or incorrect prediction? (1 point) f) What is the cross-entropy loss value caused by the feed-forward neural network at (x, y)? (1 point) g) Assume that we are applying label smoothing technique¹ with a = 0.1. What is the relevant loss value caused by the feed-forward neural network at (x, y)? (2 points) Note: You must show both formulas and numerical results to get full marks. Although it is optional, it is great if you show your Numpy code for your computation. Question 1.3 Back propagation (10 points) Given a multi-layered feed-forward neural network for a classification problem with three (3) classes (where the model parameters are initialized randomly). The architecture is as follows: x = Input Layer h®(x) = x Hidden Layer 1 h¹(x) Output Layer h²(x) p(x)=softmax(h²(x))/nWe feed a feature vector x = [1 -1 0] with ground-truth label y = 3 to the above network. a) Using cross-entropy (CE) loss, what is the value of the CE loss /? (2 points) b) What are the derivatives and ? (3 points) al ab² al al əh²¹ əw²¹ al al Əl Əh¹¹ Əh¹¹ aw¹¹ c) What are the derivatives and? (3 points) d) Assume that we use SGD with learning rate n = 0.01 to update the model parameters. What are the values of W², b² and W¹, b¹ after being updated? (2 points) Note: You must show the formulas, numerical results, and your Numpy code to get full marks. ¹ Link for main paper from Goeff Hinton: https://papers.nips.cc/paper/2019/file/f1748d6b0fd9d439f71450117eba2725-Paper.pdf Question 1.4 Optimization with Gradient Descent (4 points) This question assesses your understanding of gradient descent, one of the most important optimization techniques in deep learning. a) Write a pseudo-code to implement the gradient descent algorithm and using your own words, explain what each line of the code does (2 points) b) Using your own words, explain why the negative gradient direction is the direction that gives us the fastest decrease of loss value? (2 points)See Answer
Q20:Part 2: Deep Neural Networks (30 points) The first part of this assignment covers contents from the lectures and lab sessions in Weeks 1 to 4. You are highly recommended to revise these materials before attempting this part. You are going to work with the EMNIST dataset for image recognition task. This dataset can be installed with the command pip install emnist. It has the exact same format as MNIST (grayscale images of 28 x 28 pixels), but the images represent handwritten letters rather than handwritten digits, so the problem is more challenging than MNIST. Question 2.1 Load the EMIST Datasets and Process Data (4 points) In this question, we use functions in the package emnist, namely extract_training_samples and extract_test_samples, to load the training and testing sets. We also want to encode labels using an ordinal encoding scheme. The shape of training and testing data are (num_train, 28, 28) and (num_test, 28, 28), where num_train and num_test are number of training and testing images respectively. We next convert them to arrays of vectors which have shapes (num_train, 784) and (num_test, 784). Question 2.2 Split Data into Training, Validation, and Testing Datasets (2 points) You need to write the code to address the following requirements: • Use 10% of training data for validation and the rest of training data for training. Scale the pixels of training, validation and testing data to [0, 1]. You have now the separate training, validation, and testing sets for training your model. Question 2.3 Visualize Some Images in the Training Set with Labels (5 points) You are required to write the code to randomly show 36 images in training data (which is an array of images) with labels as in the following figure. W W M R K U W 0 W X J X J V 2 uu Z U Se By U 2 Q N X/nQuestion 2.4 Write Code for the Feed-Forward Neural Net Using TF 2.x (5 points) We now develop a feed-forward neural network with the architecture 784 → 20 (RELU) → 40 (ReLU)→ 26 (Softmax). You can choose your own way to implement your network and an optimizer of interest. You should train model in at least 20 epochs and evaluate the trained model on the test set. Question 2.5 Tune Hyperparameters with Grid Search (6 points) Assume that you need to tune the number of neurons on the first and second hidden layers n₁ € {20,40), n₂ € {20, 40}, and the used activation function act € {sigmoid, tanh, relu}. The network has the architecture pattern 784 → n1 (act) → n2 (act)→ 26(softmax), where n1, n2, and act are in their grides. Write the code to tune the hyperparameters n1, n2, and act. Note that you can freely choose the optimizer and learning rate of interest for this task. Question 2.6 Experimenting with the Label Smoothing Technique (8 points) Implement the label smoothing technique by yourself. Note that you cannot use the built-in label smoothing loss function in TF 2.x. Try the label smoothing technique with a = {0.1, 0.15,0.2} and report the performances. You need to examine the label smoothing technique with the best architecture obtained in Question 2.5.See Answer

Popular Subjects for Machine Learning

You can get the best rated step-by-step problem explanations from 65000+ expert tutors by ordering TutorBin Machine Learning homework help.

Get Instant Machine Learning Solutions From TutorBin App Now!

Get personalized homework help in your pocket! Enjoy your $20 reward upon registration!

Download on the
App Store

Download on the
Google Play

Claim Your Offer

Moneyback

Guarantee

Free Plagiarism

Reports

$20 reward

Upon registration

Full Privacy

Unlimited

Rewrites/revisions

Testimonials

TutorBin has got more than 3k positive ratings from our users around the world. Some of the students and teachers were greatly helped by TutorBin.

"After using their service, I decided to return back to them whenever I need their assistance. They will never disappoint you and craft the perfect homework for you after carrying out extensive research. It will surely amp up your performance and you will soon outperform your peers."

Olivia

"Ever since I started using this service, my life became easy. Now I have plenty of time to immerse myself in more important tasks viz., preparing for exams. TutorBin went above and beyond my expectations. They provide excellent quality tasks within deadlines. My grades improved exponentially after seeking their assistance."

Gloria

"They are amazing. I sought their help with my art assignment and the answers they provided were unique and devoid of plagiarism. They really helped me get into the good books of my professor. I would highly recommend their service."

Michael

"The service they provide is great. Their answers are unique and expert professionals with a minimum of 5 years of experience work on the assignments. Expect the answers to be of the highest quality and get ready to see your grades soar."

Richard

"They provide excellent assistance. What I loved the most about them is their homework help. They are available around the clock and work until you derive complete satisfaction. If you decide to use their service, expect a positive disconfirmation of expectations."

Willow

TutorBin helping students around the globe

TutorBin believes that distance should never be a barrier to learning. Over 500000+ orders and 100000+ happy customers explain TutorBin has become the name that keeps learning fun in the UK, USA, Canada, Australia, Singapore, and UAE.