Business Intelligence Assignment

 Complete the following assignment in one MS word document: 

Chapter 4 – discussion question #1-5 & exercise 1

When submitting work, be sure to include an APA cover page and include at least two APA formatted references (and APA in-text citations) to support the work.
All work must be original (not copied from any source).

Discussion Questions:

1. Define data mining. Why are there many names and definitions for data mining? 

2. What are the main reasons for the recent popularity of data mining? 

3. Discuss what an organization should consider before making a decision to purchase data mining software.

 4. Distinguish data mining from other analytical tools and techniques.

 5. Discuss the main data mining methods. What are the fundamental differences among them? 

Exercise 1:

 Visit teradatauniversitynetwork.com. Identify case studies and white papers about data mining. Describe  recent developments in the field of data mining and  predictive modeling 

Exp19_Access_Ch05_CapAssessment – Paterson Credit Union 1.1

Exp19_Access_Ch05_CapAssessment – Paterson Credit Union 1.1

Exp19 Access Ch05 CapAssessment Paterson Credit Union 1.1

Access Chapter 5 Capstone Assessment – Paterson Credit Union

  

Project Description:

You work as a database administrator at the Paterson Credit Union. You are modifying a database to add validation, lookup fields, and an input mask. You will also modify queries to take advantage of advanced functions and features.

     

Start   Access. Open the file named Exp19_Access_Ch05_CapAssessment_Paterson_Credit_Union.accdb.   Grader has automatically added your last name to the beginning of the   filename.

 

You   want to make sure that the customer account types are documented and stored   correctly. To do this you will create a table that will list each account   type.
 

  Use Design view to create a new table. Add AccountType   as the first field name, with data type Short Text and field size 10. Ensure AccountType is set as the   primary key. Save the table and name it AccountTypes.   Add three records: Platinum,   Silver, and Gold.   Save and close the table.

 

Now,   you wish to ensure that, when customers are added to your database, the phone   number and account type must be entered. To do this you will set the   PhoneNumber and AccountType fields as required fields.
 

  Open the Customers table in Design view. Set the PhoneNumber and AccountType   fields to Required. Save and close the table.

 

Paterson   Credit Union only offers loans with interest rates between 2.0% and 10.25%.   To ensure that no loans are offered outside of those constraints you will add   a validation rule that will not allow loans outside of that range to the   InterestRate field in the Loans table.
 

  Open the Loans table in Design view. Establish a validation rule for the   InterestRate field that requires the value to be greater than or equal to 2.0 but less than or equal to 10.25. Create validation text for the   InterestRate: Value must be between 2   and 10.25   (no period). Save the table and switch to Datasheet view. Change the   InterestRate in the first record to 1.9.   The validation text appears. Press ESC to restore the original value. Close   the Loans table.

 

You’ve   made the PhoneNumber field required in the Customers table, but now you want   to ensure that phone numbers are entered in a specific format. To do this you   will add an input mask to the PhoneNumber field in the Customers table.
 

  Open the Customers table in Design view. Add a phone number input mask for   the PhoneNumber field, storing the symbols with the data.

 

You   would like to easily add the account type for each customer without typing   anything on your keyboard. To do this you will turn the AccountType field   into a Lookup Wizard using the AccountTypes table, that you recently created,   as the source.
 

  Change the Data Type of the AccountType field to Lookup Wizard. Use the   AccountTypes table for the values in the lookup field, select the AccountType   field from the table, accept the default sort, accept default column widths,   and then accept the default name AccountType.   Save the table. Switch to Datasheet view.

 

Change   the account type to Platinum in the   first record. Close the table.

 

For   ease of use, you would like for users to be able to indicate the minimum loan   amount on which they would like to pull loan information. You will do this by   adding a parameter criterion to the LoanAmount field in the Customer Loans   Parameter query.
 

  Open the Customer Loans Parameter query in Design view. Add criteria for the   Amount field. The user should be prompted to Enter Minimum   Loan Amount (no period). The query should display   all records that have a loan Amount that is greater than or equal to the   value entered as the parameter. Run the query. Enter 250000 when prompted to Enter Minimum Loan   Amount. You should have five results. Ensure that the query results display a   total at the bottom of the Date column, and an average at the bottom of the   Amount column. Save and close the query.

 

You   have noticed that a few of your customers are missing address information. You   would like to address this by creating a query that returns only the   customers that are missing addresses so that you can update that information.   You will complete this by adding a field that indicates whether an address is   missing then adding criteria to that field so that only customers with   missing addresses are returned.
 

  Open the Missing Addresses query in Design view. Add a new column to   determine if a customer does not have an address on file. If the customer’s   Address is null, it should display Missing.   If not, it should display nothing. Name the column AddressPresent. Add criteria of Missing to the column you just created, so   only the customers missing an address display. Move the AddressPresent field   so it appears between PhoneNumber and Address. Run the query. Ensure only   customers with null Address fields display. Save and close the query.

 

For   simplicity, you are now interested in rounding the interest rates for each   loan to the nearest whole number. To do so, you will utilize the Round   function in the Loans by Interest Rate query.
 

  Open the Loans By Interest Rate query in Design view. Create a new column to   round the InterestRate of each Loan to the nearest whole number. Name the   field RoundedRate. Run the query and verify the RoundedRate   column displays whole numbers. Save and close the query.

 

Seeing   what the total and average payments month over month are is important to your   operation. To display this information, you will use the DatePart function to   extract the month from the PaymentDate field then ensure that the query is   grouped by month.
 

  Open the Payment By Month query in Design view. Change the first column so   that instead of grouping by the payment date, you group by the month. Use the   DatePart function to extract the month from the date. Name the column MonthNumber. Group by the MonthNumber field and   display the Sum of the first Total field and the Average of the Average   field. Run the query. The first line should read 2 (as the month,   representing February), with a total of $5,246.51 as the total payments   received and $1,311.63 as the average payment amount. Ensure that the query   results display a total at the bottom of the Total column, and an average at   the bottom of the Average column. Save and close the query.

 

Finally,   you would like to classify the various loans as either high or low priority   for the Credit Union. To do this you will add a column that determines   whether the interest rate for a loan is greater than or equal to 7.9%, as   that is what is considered high priority.
 

  Open the Refinance Candidates query in Design view. This query displays all   adjustable loans in the database. Create a new column to display High Priority for all loans that have an   InterestRate of 7.9% or more, and Low Priority   otherwise. Name the field Priority.   Run the query. Notice customers with the highest interest rate values show a   higher priority. Save and close the query.

 

Save   the database. Close the database, and then exit Access. Submit the database   as directed.

Week 7: Data Analysis

 (THIS WILL BE RAN THROUGH TURNITIN)

Instructions

Using MS Excel, create a new workbook with the following (Save as w7_firstname_lastname.xlsx or w7_firstname_lastname.xls).

As an administrator, you recorded the requests from different locations (i.e. a few states are listed) in the sheet below, you may create your own sheet. You need to explain the collected data. In the table below, you have the requests against the locations.

Software Installation requestsSoftware Update requestsHardware Installation requestsWV231215MD1385VA1746DC1462NY741FL292111

You can use any data gathering for your choice

Create at least 5 statistical functions and graphs relating to the data.

From the data discuss the trend (your conclusion, what does this data mean for your planning), You can use the question of “What if… then”

Introduction to What-if Analysis

Submit your week 7 work in w7_firstname_lastname.xlsx or w7_firstname_lastname.xls

Requirements

Points

Data Layout

20

Statistical functions and graph

50

Data Analysis Conclusion

30

TOTAL POINTS

100

Essay Paper

  

Select a recently published (5 years old MAX)  article from an academic journal or academic conference related to software engineering.

1) Summarize the paper, do not just quote the opening summary.

2) Describe the salient point that the author(s) researched or presented as “new”

3) Check at least one source from the bibliography and describe specifically what the authors of your selected paper referenced.

4) Describe any shortfalls or items the authors of your selected paper did not address

5) List and describe at least one possible item left unanswered that could still be researched (this may be what the authors “left out”)

6) You must cite the journal or conference

7) This is NOT to be from a marketing web page, company whitepaper, or Wikipedia.

This must be in DOC or PDF format but should resemble a published article regarding font selection and size. I do not expect it to be in 2-column format.

This is similar to 1/4 of a doctoral comprehensive exam, but you are not answering any specific question. You should think of this as preparation for a one-hour presentation. You are NOT presenting, nor are you creating a presentation. This is just a paper that should be no longer than two pages, single space 12pt text. Images ARE allowed but should be similar in size to the original article. No more than one half page of your entire paper should be images.

A bibliography should be included at the end of your paper, that is similar to the original article but should only have the few articles you examined & discussed.

Distributed Hadoop Mapreduce – Course recommendation

Title: Design and develop a Distributed Recommendation System on Hadoop

Problem statement: 

 Given 2 CSV data sets: 

(a) A course dataset containing details of courses offered

(b) A job description dataset containing a list of job descriptions 

(Note: Each field of a job description record is demarcated by ” “)

You have to design and implement a distributed recommendation system using the data sets, which will recommend the best courses for up-skilling based on a given job description. You can use the data set to train the system and pick some job descriptions not in the training set to test. 

It is left up to you how you pick necessary features and build the training that creates matching courses for job profiles.

Use Map Reduce and Python

NOTE: Combine all data_job_posts.csv into a single CSV file. Due to size limitation, I had to split the file

Capstone research paper and Power Point Presentation for the Project: Handwriting Recognition. Python Language basics needed.

Project: Handwriting Recognition.

Total 20 Pages

Cover page 

Reference list page (Total 10 References)

Literature review page (10 Pages)

Eight pages about the project. (This can include the screen print, figures/tables, code, etc.)

Total cover page + Literature review + project details + Reference list (1 + 10 + 8 + 1): Total 20 Pages 

APA format: 12-point font: Time New Roman: Double spaced: in-text citations required. 

Project Presentation PPT:

15 Slides are needed.

DEADLINES:

Research paper due- 10th April 2023 by 10 PM CST

PPT due on 13th April 2023 by 10 PM CST

Note:

I am attaching the two sample research papers in APA format and also the proposal of the project for your reference.

Presentation on Enterprise System and Enterprise Architecture Considerations

PowerPoint 

Thesis statement 

• 3-6 slides that provide an evaluation of what each organization did right and what it did wrong; and compare and contrast the organizations

• 1-4 slides on positioning of these case studies for their ability to enable the architecture (i.e. are the “hooks” in place to enable EA?) Very important!

And make sure to not make the slides too detailed as the professor instructed to have the majority of the info added as a note in the slide