#### Quiz 1: Data and Analysis in the Real World

Q1. What statement below best describes why we do data analytics in business?

Refer to the following video for a refresher: video 1.

• Analytics improve our understanding of how the business works
• We must show a return on the investment we make in data & analytical resources
• We need specific insights to make business decisions
• We have to calculate & report financial results to owners / shareholders

Q2. What should you consider as you approach an analytical problem and in which order? Identify correct order for the following ideas/steps.

For example, if you think they are already in the correct order, the correct answer would be ABCDEF.

• A. Sourcing Data
• B. Analysis Outputs
• C. Execute Analysis
• D. Analysis Methods
• E. Define Decision
• F. Data Needs

Refer to the following video for a refresher: video 1.

ABCDEF:

• A. Sourcing Data
• B. Analysis Outputs
• C. Execute Analysis
• D. Analysis Methods
• E. Define Decision
• F. Data Needs

EBDAFC

• E. Define Decision
• B. Analysis Outputs
• D. Analysis Methods
• A. Sourcing Data
• F. Data Needs
• C. Execute Analysis

EBDFAC

• E. Define Decision
• B. Analysis Outputs
• D. Analysis Methods
• F. Data Needs
• A. Sourcing Data
• C. Execute Analysis

BDFACE

• B. Analysis Outputs
• D. Analysis Methods
• F. Data Needs
• A. Sourcing Data
• C. Execute Analysis
• E. Define Decision

Q3. What diagram below best describes the relationship between a mobile wireless carrier account holder and devices at a point in time?

A – shows oval with the account connected to oval with the device by a straight line

B – shows oval with the account connected to oval with the device by straight-line – forked end on the Account side

C – shows oval with the account connected to oval with the device by straight-line – forked end on the Device side

D – shows oval with the account connected to oval with the device by straight-line – forked end on both sides

Refer to the following video for a refresher: video 2.

• A
• B
• C
• D

Q4. For the next 5 questions that describe types of metrics, select a source that best describes where the following data might come from:

The average temperature of a turbine bearing over the last 8 hours

Refer to the following video for a refresher: video 1.

• Billing System
• Usage Tracking System
• Customer Relationship Management System
• Machine Data System
• Enterprise Resource Planning System
• Ticketing / Workflow System

Q5. Select a source that best describes where the following data might come from:

The number of developers allocated to a company software project

• Billing System
• Usage Tracking System
• Customer Relationship Management System
• Machine Data System
• Enterprise Resource Planning System
• Ticketing / Workflow System

Q6. Select a source that best describes where the following data might come from:

Household water consumption by month

• Billing System
• Usage Tracking System
• Customer Relationship Management System
• Machine Data System
• Enterprise Resource Planning System
• Ticketing / Workflow System

Q7. Select a source that best describes where the following data might come from:

The dollar amount of unpaid invoices at the end of a month

• Billing System
• Usage Tracking System
• Customer Relationship Management System
• Machine Data System
• Enterprise Resource Planning System
• Ticketing / Workflow System

Q8. Select a source that best describes where the following data might come from:

The average age of customers in Madison, Wisconsin

• Billing System
• Usage Tracking System
• Customer Relationship Management System
• Machine Data System
• Enterprise Resource Planning System

Ticketing / Workflow System

Q9. Why is it important for data analysts to understand the value-chain (process) associated with information and the analytical process?

Refer to the following videos for a refresher: videos 3 and 4

What do you think?

Ans: The more you understand about the way the business work and how information flows through business systems, the better prepared you

will be to both execute and interpret your analysis. Also, the more skill you have in finding and accessing data, the more productive and valuable you will be as an analyst!

Q10. Identify correct order of steps in the Information-Action Value Chain.

Refer to the following videos for a refresher: videos 3 and 4.

ABCDEFGHI

• A. Develop Strategy & Plan
• B. Deliver the Pitch
• C. Events & Characteristics in the Real World
• D. Take Action
• E. Data Capture by Source Systems
• F. Data Extraction
• G. Data Storage
• H. Analytical Methods
• I. Summarize & Interpret Results

CEFGHIABD

• C. Events & Characteristics in the Real World
• E. Data Capture by Source Systems
• F. Data Extraction
• G. Data Storage
• H. Analytical Methods
• I. Summarize & Interpret Results
• A. Develop Strategy & Plan
• B. Deliver the Pitch
• D. Take Action

CEGFHIABD

• C. Events & Characteristics in the Real World
• E. Data Capture by Source Systems
• G. Data Storage
• F. Data Extraction
• H. Analytical Methods
• I. Summarize & Interpret Results
• A. Develop Strategy & Plan
• B. Deliver the Pitch
• D. Take Action

• C. Events & Characteristics in the Real World
• E. Data Capture by Source Systems
• G. Data Storage
• F. Data Extraction
• H. Analytical Methods
• I. Summarize & Interpret Results
• B. Deliver the Pitch
• A. Develop Strategy & Plan
• D. Take Action

Q11. Why do we bring data together into a common location (select all that apply)?

Refer to the following video for a refresher: video 3.

• We can establish relationships among data sources
• It’s more convenient for extraction to have data in one place
• Sometimes we can’t access source systems directly
• Source data may be unstructured or not formatted for analysis

Q12. What type of analytics would you use to determine the best way to route delivery trucks to minimize miles driven or gasoline consumed?

Refer to the following video for a refresher: video 4.

• Descriptive
• Predictive
• Transitive
• Cognitive
• Prescriptive

#### Quiz 2: Analytical Tools

Q1. What type of file normally stores two dimensional data with column and row breaks, identified using special characters?

Refer to the following video for a refresher: video 1.

• XML File
• Log File
• Delimited Text File
• Excel File

Q2. What term best describes data storage that is optimized for handling front-end business operations?

Refer to the following video for a refresher: video 1.

• Online Analytical Processing ( OLAP )
• Online Transactional Processing ( OLTP )
• Document Store
• Hadoop Distributed File System ( HDFS )

Q3. Suppose you are a software developer looking for an online environment to help you rapidly build and scale applications. Which of the following services would best accommodate your needs?

Refer to the following video for a refresher: video 2.

• Infrastructure as a Service ( IaaS )
• Software as a Service ( SaaS )
• Development as a Service ( DaaS )
• Platform as a Service ( PaaS )

Q4. Which of the following statements about Cloud computing are true? (select all that apply)

Refer to the following video for a refresher: video 2.

• Cloud computing is more secure than a company’s data center
• Cloud computing is needed for handling Big Data
• Cloud computing outsources all of a company’s data operations
• Cloud computing can allow cheaper and more scalable operations
• Cloud computing speaks to where data is stored or manipulated

Q5. Suppose your objective is to build a predictive model that can be used to recommend products to customers in real-time based on their navigation on your web site. Which of these technologies would be most critical in helping you achieve this objective?

Refer to the following video for a refresher: video 3.

• In-Memory Computing
• Data Virtualization
• Hadoop Distributed File System ( HDFS )
• In-Database Analytics
• Data Federation

Q6. Suppose you are a data analyst working on a project to show why sales in a particular region are down relative to other regions. Your job is to figure out what’s going on, find a good way to show the data, and produce a report that can be automated to go out weekly to track progress on any actions that are taken. You anticipate that only descriptive analytics will be needed for this project, and you’re working from a data set that has been prepared by your partners in IT.

Which of the following classes of tools are you most likely to use directly in this project? (select all that apply)

Refer to the following video for a refresher: video 5.

• Dashboarding
• Database systems
• Standard reporting
• Data visualization & exploration
• Statistical modeling

Q7. Suppose you’re a data analyst and you’re traveling to a conference. There’s a straightforward but critical ad-hoc analysis you need to accomplish, but you’re not certain how much internet connectivity you’ll have during your trip. You also haven’t decided which of your desktop tools you’ll use in the analysis.

Which of the following process methodologies would work best for your situation?

Refer to the following video for a refresher: video 6.

• Intermediate File Approach
• Direct Connection Approach
• Downstream Integration Approach

Q8. For the following 9 questions, consider the set of relational database tables below, illustrated using the shorthand we introduced in the module.

For simplicity, assume that a phone number is a permanent attribute of the subscriber and that subscribers on both the “to” and “from” sides of calls are included in the database.

What kind of key best describes the primary key of the PLANS table ?

Refer to the following video for a refresher: video 4.

• Natural
• Surrogate
• Composite

Q9. Which table, if any, has a Composite Primary Key?

• SUBSCRIBERS
• PLANS
• CALLS
• DEVICES

Q10. Select all the fields which COULD have foreign key relationships in THIS model:

• Device_SN
• Device_Type
• Phone_Number
• From_Number
• End_Time
• Plan_ID
• Model
• To_Number
• Manufacturer

Q11. In the next 6 questions, define the relationship between the tables.

Refer to the following video for a refresher: video 4.

SUBSCRIBERS to PLANS

• One to One
• One to Many
• Many to One
• Many to Many
• No direct relationship

Q12. CALLS to SUBSCRIBERS

• One to One
• One to Many
• Many to One
• Many to Many
• No direct relationship

Q13. DEVICES to SUBSCRIBERS

• One to One
• One to Many
• Many to One
• Many to Many
• No direct relationship

Q14. CALLS to DEVICES

• One to One
• One to Many
• Many to One
• Many to Many
• No direct relationship

Q15. PLANS to CALLS

One to One

One to Many

Many to One

Many to Many

No direct relationship

Q16. DEVICES TO PLANS

• One to One
• One to Many
• Many to One
• Many to Many
• No direct relationship

Q17. Is this data model in third normal form?

Refer to the following video for a refresher: video 4.

• Yes
• No

#### Quiz 3: SQL Coding Assignment

Q1. In this quiz, you’ll be writing queries based on the following database. Follow this link to access a larger picture of the database.

Note: your SQL code entries will not be saved between quiz attempts! Please copy paste them somewhere so you don’t have to retype the entire code when you take the quiz again.

How many aircrafts are there in the PLANES table?

Refer to the following video if you need a refresher: video 1.

• 25
• 50
• 562
• 10961

Q2. Write a query that provides a list of all planes that have a seat count of 100 or more, ordered from lowest to highest number of seats.

What is the TAIL_NUMBER of the plane with the second lowest number of seats in that list?

Refer to the following video if you need a refresher: video 2.

• N114DD
• N132AA
• N120EE
• N126AA

Q3. Write a query that provides the number of flights flown by each aircraft.

Which aircraft flew the most flights?

Refer to the following video if you need a refresher: video 2.

• N125EE
• N135DD
• N120EE
• N111AA

Q4. Write a query that provides a list of planes that flew more than 600 passengers across all flights.

How many planes are in that list?

Refer to the following video if you need a refresher: video 2.

• 5
• 22
• 3
• 7

Q5. Write a query that provides the total number of flights by country.

How many flights originated in the United States (Country US)?

Refer to the following video if you need a refresher: video 3.

• 34
• 23
• 12
• 2

Q6. Write a query that provides the total number of flights by regionality.

Which regionality has the second highest number of flights?

Refer to the following video if you need a refresher: video 3.

• US-US
• US-ROW
• ROW-NA
• ROW-ROW

Q7. How many CITY_PAIRS are there which depart from one of the following airports?

KLAX, KDEN, KORD, KDET, KLGA

Hint: use the IN operator!

Refer to the following video if you need a refresher: video 5.

• 722
• 803
• 245
• 104

Q8. How many airports are missing elevation values?

Refer to the following video if you need a refresher: video 5.

• 10
• 8
• 6
• 4

Q9. What flight number had the lowest passenger count (try using a subquery if you can!)?

Refer to the following video if you need a refresher: video 6.

• ALN626
• ALN745
• GLB805
• SKY996

Q10. What is the average distance flown by SKY Airline flights (nearest mile)?

Refer to the following video if you need a refresher: video 6.

• 1474
• 1570
• 1577
• 1768

#### Quiz 4: Introduction to Data Analytics for Business

Q1. In the next 6 questions, indicate which specialist you’d seek out to help with each of the scenarios listed.

Refer to video 1 if you need a refresher!

You’ve just completed an analysis that reveals the importance of a few metrics that business leaders would like to see monthly. You need someone to help you productionalize and automate a monthly report containing those metrics.

Who should you talk to?

• IT Infrastructure Resource
• Application Developer
• Data Architect
• ETL Developer
• BI Developer
• Database Analyst
• Data Analyst
• Modeler
• Data Scientist

Q2. You’ve arranged for an external partner to send you data each day. You need someone to help set up a file transfer process that will allow that partner to securely connect to your company through a firewall.

Who should you talk to?

• IT Infrastructure Resource
• Application Developer
• Data Architect
• ETL Developer
• BI Developer
• Database Analyst
• Data Analyst
• Modeler
• Data Scientist

Q3. To ensure that the results of a data analysis can be placed into context, you need someone who can examine how certain business processes work and help you map them out.

Who should you talk to?

• IT Infrastructure Resource
• Application Developer
• Data Architect
• ETL Developer
• BI Developer
• Database Analyst
• Data Analyst
• Modeler
• Data Scientist

Q4. You’ve done a descriptive analysis that seems to show a correlation between customer defection and several customer characteristics, but you think that a formal statistical procedure would yield more powerful results that can predict churn. You need someone who knows how to do this.

Who should you talk to?

• IT Infrastructure
Resource
• Application Developer
• Data Architect
• ETL Developer
• BI Developer
• Database Analyst
• Data Analyst
• Modeler
• Data Scientist

Q5. You know that a new product is coming online and you’d like to understand how measurements around that product will be represented in the database model.

Who should you talk to?

• IT Infrastructure
Resource
• Application Developer
• Data Architect
• ETL Developer
• BI Developer
• Database Analyst
• Data Analyst
• Modeler
• Data Scientist

Q6. You’re finding that the SQL queries you are writing against your data warehouse are taking a long time to run. You need someone who can help you determine if your queries are written in the best way. Who should you talk to?

• IT Infrastructure
Resource
• Application Developer
• Data Architect
• ETL Developer
• BI Developer
• Database Analyst
• Data Analyst
• Modeler
• Data Scientist

Q7. Consider the following structure for analytic organizations. Pick which structure would fit best in each scenario.

Refer to video 2 if you need a refresher!

Your company is functionally organized. The data sources and analytical techniques tend to be pretty similar across functions, and most resources are located in a headquarters building in downtown Chicago. The executive team gets along, but are very protective of their teams and work product.

• Allocated Model
• Centralized Model
• Distributed Model
• Coordinated Model

Q8. Your company is a multinational organization that operates in a number of distinct industries. Each industry uses its own methods and tends to hire somewhat different types of people into analytical organizations.

Which structure would best fit this scenario?

• Allocated Model
• Centralized Model
• Distributed Model
• Coordinated Model

Q9. Your company is organized by customer groups, which are mostly distinct but have some limited overlaps. Analyses vary in similarity – some are very similar, but others are quite different. They do, however use most of the same data sources. Currently resources are located within each customer group organization, but it’s pretty typical for there to be only one or two analysts in each area.

Which structure would best fit this scenario (mark all that apply)?

• Allocated Model
• Centralized Model
• Distributed Model
• Coordinated Model

Q10. What term best describes the process of identifying and standardizing an organization’s most critical data?

Refer to the following video if you need a refresher: video 3.

• SOX Compliance
• Master Data Management
• Data Governance
• Data Stewardship

Q11. Who is responsible for making sure that a data domain is correctly represented and used within an organization?

Refer to the following video if you need a refresher: video 3.

• Data Steward
• Data Architect
• Data Governance Council
• SOX Compliance Auditor
• ETL Developer

Q12. A large drugstore chain wants to use prescription data from its pharmacy to make complementary relevant offers to specific customers via custom coupon books, delivered via direct mail.

What is the most limiting standard that might be relevant in this case?

Refer to the following video if you need a refresher: video 4.

• Policy Standards
• Legal Standards
• Good Judgement
• Ethical Standards

Q13. A Mobile Phone Company wants to construct and sell ‘profiles’ of customers based on a combination of internet sites visited and location data. The profiles would provide aggregate information that is not considered CPNI.

What are the most limiting standards that might be relevant in this case? (select all that apply)

Refer to the following video if you need a refresher: video 4.

• Policy Standards
• Legal Standards
• Good Judgement
• Ethical Standards

Q14. Consider the following aspects of Data Quality. Indicate which is the dominant issue in each case below.

Refer to video 5 if you need a refresher!

A few years ago your company acquired another company and merged summary financial data into a key database. The data looks complete but there are some peculiarities we can’t explain.

• Completeness / Uniqueness
• Accuracy / Consistency
• Conformance / Validity
• Timeliness
• Provenance

Q15. Your company wants a mobile application that allows certain purchases to be made via the application. However, those transaction use the date / time of the user’s device as the timestamp of the transaction that is stored in the purchase database.

• Completeness / Uniqueness
• Accuracy / Consistency
• Conformance / Validity
• Timeliness
• Provenance

Q16. We notice that when we join data from two different tables we need to be careful to convert the time zone in one table from Central Standard Time ( CST ) to Universal Coordinated Time ( UTC ) to match the second table, even though the company standard it UTC.

• Completeness / Uniqueness
• Accuracy / Consistency
• Conformance / Validity
• Timeliness
• Provenance

Q17. On your company’s website, a customer can accidentally click a purchase button twice, which results in two purchase records being generated. Luckily, these purchases are filtered by the credit card payment processing system and removed from the company’s general ledger. However, those records are not removed from the analytical data warehouse.

• Completeness / Uniqueness
• Accuracy / Consistency
• Conformance / Validity
• Timeliness
• Provenance

