10th Computer Science Unit 4 Data and Analysis
Ans: Data science helps businesses make informed decisions by transforming raw data into meaningful insights. By applying statistical methods, machine learning, and data visualisation, companies can identify trends, forecast outcomes, and improve their strategies. We can briefly learn about its scope in some common fields of life as follows:
Healthcare: To predict the possibilities of some disease, based on X-rays, Ultrasounds or other analytics of the patient.
Sports: If we want to predict who is going to win a cricket match, based on the previous performance of teams, data science will help us.
Ans: Data science plays a crucial role in the development of machine learning and artificial intelligence in several ways.
(i) It helps analyse data to discover useful patterns and insights, which are essential for building intelligent models.
(ii) Data science supports the development and selection of appropriate algorithms by experimenting with various approaches and measuring their effectiveness on real-world data.
(iii) It ensures continuous improvement by monitoring the performance of ML and AI systems over time and updating them with new data to keep them accurate and relevant.
Ans:
| Supervised learning | Unsupervised learning |
| (i) Uses labelled data (input and output are known) | (i) Uses unlabelled data (only input is known) |
| (ii) Predict outcomes or classify data based on known labels. | (ii) Discover hidden patterns or groupings in data. |
| (iii) Email spam detection, price prediction, disease diagnosis. | (iii) Customer segmentation, market basket analysis. |
| (iv) Predicts specific values or categories. | (iv) Identifies clusters, associations, or structures. |
| (v) Learns from the correct answers provided during training. | (v) Learns without explicit guidance or correct answers. |
Ans: An everyday example of reinforcement learning is teaching a pet dog new tricks. When you train your dog, you give it a command, and if the dog performs the trick correctly (like sitting or fetching), you reward it with a treat or praise. This reward encourages the dog to repeat the behaviour in the future. If the dog doesn’t follow the command, it doesn’t get a reward, so it learns to avoid that behaviour. Over time, through trial and error and receiving feedback (rewards or no rewards), the dog learns which actions lead to positive outcomes.
| Sr. No. | Scenario | Suitable Machine Learning Model |
| 1 | You have a basket of mixed fruits (apple and banana),lo0 and you want a robot/machine to sort them. | Supervised learning |
| 2 | You are given a task to find the similarity in various flavours of ice cream. | Reinforcement learning |
| 3 | You have a book with pictures, and you want to teach your sibling to recognise them. | Unsupervised learning |
| 4 | You want to train a toy robot to find its way out of a maze. | Supervised learning |
| 5 | You have a set of shapes (square, triangle, circle and you want to teach a computer to recognise them. | Reinforcement learning |
| 6 | Your parents want you to clean your messy room if you want to attend the birthday party of your friend | Reinforcement learning |
| 7 | You are given a task to learn how to ride a bicycle to participate in a sports event. | Supervised learning |
| 8 | You have a book collection without specific categories, and you want your sibling to arrange them according to size, choice or ease of access. | Unsupervised learning |
| 9 | You have to unlock some rewards in your favourite video game. | Unsupervised learning |
| 10 | You have to unlock some rewards in your favorite video game. | Reinforcement learning |
Artificial Intelligence and Machine Learning skills refer to develop systems that can learn and perform decision-making. Data Science skill refers to extracting insights from data and making informed decisions. There are many skills to achieve excellence in artificial intelligence. These skills include programming language, machine learning, and acquiring domain-specific knowledge. In programming languages, Python and R are the most used languages in Al. In machine learning algorithms, the knowledge of the TensorFlow framework, deep learning, neural networks, and NLP is very important.
Data Science has a wide scope in many disciplines of life, and it is continuously expanding. We can briefly learn about its scope in some common fields of life a follows:
The term artificial intelligence is not new. In 1950, a British mathematician, Alan Turing, proposed a Turing Test, which measured the ability of a machine to exhibit intelligent behaviour. Nowadays the modern robots are supposed to be smarter if they can pass the Turing Test. The term Artificial Intelligence refers to the ability of a machine to exhibit human behaviour like problem solving, understanding natural language & interacting with the environment intelligently. Like data science, the scope of artificial intelligence is wide.
Machine learning is used in many fields of life, like healthcare, the automation industry, to develop recommender systems, finance and banking, pattern recognition, NLP, computer vision, research and innovation. An example of machine learning is Automated fraud detection. It helps to identify fraudulent activity by finding anomalies such as sudden large transactions or unusual network traffic. In short, the entire scope of Data Science and Artificial Intelligence is based on the algorithms developed in the field of Machine Learning
There are different ways to represent data graphically to make it easier to understand. Some common types of data visualization are as follows:
Quantitative visualization is used to represent numerical data. It focuses on quantities or numbers to show measurable data. It is used to display data that can be measured or counted. For example, a bar chart showing sales figures of a company over several months effectively communicates quantitative information.
Categorical visualization is used to represent data that falls into distinct categories. It helps to show proportions or parts of a whole. This type is ideal for displaying nominal or ordinal data. For example, a pie chart showing the market share of different companies in a specific industry.
Temporal visualization is used to display data that changes over time. It is used for time-series data. Line graphs are usually used to represent temporal data. For example, a line graph.
Spatial visualization is used to represent data related to physical locations or spaces, visualizing geographic or spatial data. For example, a heat map showing population density across different regions.
Multivariate visualization is used to represent data involving more than two variables or dimensions. Scatter plots and heat maps are effective tools to represent such data. For example, a scatter plot matrix showing relationships between income, age, and spending habits.
Interactive visualization allows users to interact with the data through digital platforms. These digital platforms can be dashboards and filters. A dashboard helps the user to filter and manipulate data visualizations and explore different trends and insights.
Statistical visualization is used to present data to show statistical properties, such as distribution or correlation. Histograms, box plots, and scatter plots are popular choices.
Information visualization is used to present complex data sets in an easier way, often for abstract conceptual data. Network diagrams, tree maps, and word clouds are effective tools. For example, a network diagram showing relationships between entities, such as a social network.
Ans: Data visualization is not only used to display exact data values but also to communicate the uncertainty or variability in data. Uncertainty means that the data may have errors, or predictions may vary due to changing conditions or incomplete information. Visualizations can show this uncertainty using special elements like error bars, shaded confidence intervals, or multiple possible outcome lines.
Two specific examples are:
When meteorologists predict temperatures, they often show a line graph t the expected temperature each day. Around this line, a shade area or band shows the possible temperature range (for example, the highest and lowest likely temperatures). This shaded area is called a confidence interval. It communicates that while the forecast predicts a certain temperature, actual temperatures may fall within this range due to natural weather variability.
During a pandemic, scientists use models to predict how many people might get infected in the future. These models include different scenarios based on factors such as social distancing or vaccination rates. A chart may show several lines or a Shaded area to represent best-case and worst-case predictions. This visualization communicates uncertainty by showing that the actual number of cases could vary widely depending on future events and behavior.
Ans: Choosing appropriate visualizations involves several key considerations to ensure effective communication and analysis of data. The visualization must match the type of data and the goal of the analysis.
Below are the key considerations:
Like Artificial Intelligence, data science and machine learning, data visualization is useful in almost all fields of life.
Some of them are as follows:
Business Intelligence: Data visualization helps to make data-driven, well-informed decisions. It is used to find market trends and helps to track and improve performance.
Healthcare: It helps to visualize the impact of various diseases affecting the patient. It is helpful to track disease and visualize the spread of disease.
Education: Data visualization is very helpful to teach data literacy skills, concept building, creative thinking, and critical thinking
Science and Research: It is useful to visualize complex findings, very huge and complex data, such as complex scientific data received from satellites in the form of photographs.
Sports and gamming: It is useful to visualize performance of players, whether they are playing football on a ground or chess players playing in online tournaments. It is also helpful in sports broadcasting and other Predictions.
Finance: It is helpful to analyse market trends, to track portfolio performance and to identify investment opportunities.
Entertainment: It helps the entertainment industry to visualize movie performance data to predict future trends. It helps in content optimization by visualizing audience insights and trends.
Poor data quality can significantly affect the performance of a data model, especially in fields like Machine Learning and Data Science. When the input data is incorrect, incomplete, inconsistent, or outdated, it leads to unreliable outputs. Below are some key consequences:
Models trained on poor-quality data may produce wrong or misleading results.
For example, a disease prediction model may give false results if the training data has missing or incorrect patient records.
Overfitting happens when the model learns too much noise from the data, while underfitting occurs when it cannot capture patterns at all.
Poor data quality causes both issues, reducing the model’s ability to generalize to new data.
If the dataset is biased or unbalanced (e.g., too many examples of one category), the model may give unfair or incorrect results.
For example, a face recognition system might fail if it were trained mostly on one ethnicity.
Time, computing power, and storage are wasted when models are trained on low-quality data.
Analysts may also spend extra time cleaning and reprocessing data instead of focusing on analysis.
In real-world applications like business, healthcare, or finance, bad data can lead to wrong decisions, financial loss, or even safety risks.
1. Which of the following is the primary benefit of integrating Mathematics and Statistics with Computer Science in Data Science?
A) Improved data visualization
B) Better forecasting
C) Increased accuracy
D) Better decision making
2. Which of the following best describes the relationship between Data Science and Artificial Intelligence?
A. Data Science is a subset of Artificial Intelligence
B. Artificial Intelligence is a tool used in Data Science
C. Data Science and Artificial Intelligence are unrelated
D. Data Science enables Artificial Intelligence
3. The Turing Test, proposed by Alan Turing in 1950, measures a machine’s ability to exhibit intelligent behaviour. Which of the following is the fundamental assumption that underlies this test?
A. Humans are better
B. Machines are equal
C. Intelligence levels vary
D. Machines copy humans
4. Which of the following should be considered critically while developing AI-powered chatbots and virtual assistants?
A. User experience
B. Data security
C. Contextual awareness
D. Emotional intelligence
5. What ethical consideration arises from the integration of Artificial Intelligence (AI) into daily life devices?
A. Job displacement due to automation.
B. Increased energy consumption.
C. Improved customer service.
D. Enhanced data security.
6. Which of the following fields of Artificial Intelligence (AI) enables smartphones to recognize faces and unlock devices?
A. NLP
B. Computer vision
C. Deep learning
D Neural networks
7. A company wants to develop a system that categorizes customer feedback into positive, negative, or neutral. Which learning model would be most suitable?
A. Supervised learning
B. Unsupervised learning
C. Reinforcement learning
D. Deep learning
8. In a Reinforcement Learning model, what is the primary function of rewards and penalties provided as feedback to the agent?
A. Labelling data
B. Evaluating performance
C. Improving action choices
D. Classifying outcomes
9. Which stage of the data science life cycle ensures the model’s accuracy, reliability, and compliance with privacy rules?
A. Model Deployment
B. Model Evaluation
C. Data Analysis
D. Maintenance and Monitoring
10. Which of the following is the key characteristic of the “Data Cleaning” stage in the data science life cycle?
A. Data collection
B. Error removal and data organization
C. Pattern identification
D. Model deployment
Unit 8: Entrepreneurship in Digital Age Write answers of the following short response questions. Q.1.…
Unit 7: Digital Literacy Write answers of the following short response questions. Q.1. Differentiate between…
Unit 6: Impacts of Computing Write answers to the following short response questions. Q1. List…
Unit 5: Applications of Computer Science Write answers of the following short response questions. Q1.…
10th Computer Unit 3: Programming Fundamentals Unit 3: Programming Fundamentals Write answers of the following…
Unit 2: Computational Thinking & Algorithms Write answers of the following short response questions. 1.…