Data Analysis

 

For this portion of the project, you will examine your dataset for incorrect data. Any incorrect data should be removed, corrected, or imputed. Follow these steps:

  • Remove irrelevant data. If you are unsure if it is irrelevant, then keep it.
  • Remove duplicate records that are repeated.
  • Make sure numbers are interpreted as numerical data types.
  • Fix typos.
  • Standardize.
  • Investigate outliers.
  • Check and manage missing values.
  • Format and normalize data if needed.
  • Change categorical values into numbers if needed.

Once you have completed this, you will need to provide a Word document summarizing the pre-processing steps performed on your dataset. 

Are Your Articles Research Articles? Feb 4

Assignment Content

  1. In class today, you were asked to find research articles that explore your approved research question. (DO NOT ATTEMPT THIS ASSIGNMENT UNLESS YOU HAVE COMPLETED THE PREVIOUS ASSIGNMENT and gained approval of a research question: Research Question Proposals.)

    For this assignment, follow directions CAREFULLY: 

    Final ‘Mission’ Packet.docx

    Step 1: download the document below which is your Finals Packet:

    Step 2: Complete p. 13a, PHASE 1 ONLY for only three articles.
    Step 3: Then, copy and paste the entire activity into a Word document and upload it here to be graded and approved. 

    *For each database you use, submit a screenshot or photo of your parameters (full text, research journals, timeline not over 10 years). This will help your professor troubleshoot any problems you may have had, and it will allow your professor to provide suggestions. 
    Grading is as follows:

    Article 1 (Basic Research Article): 34 pts

    • HOW = Name the methodology AND cut and paste AND highlight indicator words 
    • WHAT = Findings 
    • Article 2 (Basic Research Article): 33 pts
    • HOW = Name the methodology AND cut and paste AND highlight indicator words 
    • WHAT = Findings 
    • Article 3 (Basic OR Applied Research Article): 33 pts
    • HOW = Name the methodology AND cut and paste AND highlight indicator words 
    • WHAT = Findings 

Big Data Final Year Projects

 Takeoff projects help students complete their academic projects. Register at takeoff projects today to find and learn about different interesting big data projects and grab the best jobs. Get started right now. 

Data Representation

Compute xmin, the column vector with the smallest norm, and xmax, the column vector with the largest norm. (Ties are broken arbitrarily.) If applied to the matrix X above the result should be: xmin = ? 1 0 ? , xmax = ? 2 3 ? . You may assume that a function “norm(x)” exists such that norm(xi) is the norm of xi

Dictionary Attacks

 

Windows stores passwords using the NT LAN Manager (NTLM) hash.  This hash is created by hashing the plain text password with the MD4 algorithm.  There is no salt used.  The lack of salt enables the use of a rainbow table to lookup passwords from their hash.   For this assignment you will create a rainbow table for a set of passwords. 

More about NTLM: http://techgenix.com/how-cracked-windows-password-part1/

For this assignment you will create a Rainbow table for NTLM passwords.   Use the following list of steps as a guide.

  1. Use the sys library to read a command line argument that is the name of a password dictionary file.
    For your submission used the password dictionary file found at this link: http://www.openwall.com/passwords/wordlists/password-2011.lst (Links to an external site.)
  2. Open the password dictionary. 
  3. Loop through the contents of the password dictionary one password at a time.
  4. Sort the list of keys from your dictionary
  5. Use a for loop to print all keys and passwords. Print on hash and password pair per line. Structure each line as follows to allow a password to be looked up easily from it’s hash:
    [hash]:[password]
  6. Store the output of your program as rainbow_table.txt. 
    *** Run your program as ‘python3 rainbowtable.py dictionary.txt > rainbow_table.txt’
  7. Name your script rainbowtable.py

You will be graded by

  

You will be graded by the quality of your response. Also, please do not copy someone else’s post but try to make it yours.
You may look at the posts of others for ideas, but do not copy and paste it and make it yours.
You can do one post that answers both parts. So think carefully about your post.
 

Here is the topic:
****************************
 

Write pseudo code that prompts the user to enter his/her age and then outputs
(i) the user age divided by 4
 

and
 

(ii) the remainder when age is divided by 4

40/s1

 company or use one that you are familiar with that has experienced information security change in the workplace. What information assurance change occurred, and were there any barriers to the change faced by the organization? Finally, what are the best practices when managing change?  

DATABASES QUESTIONS

QUESTION 1

 Given the following business scenario, create a Crow’s Foot ERD using a specialization hierarchy if appropriate. Two-Bit Drilling Company keeps information on employees and their insurance dependents. Each employee has an employee number, name, date of hire, and title. If an employee is an inspector, then the date of certification and certification renewal date should also be recorded in the system. For all employees, the Social Security number and dependent names should be kept. All dependents must be associated with one and only one employee. Some employees will not have dependents, while others will have many dependents. 

 Make sure that the file is saved as ch5_problem1_deleon.doc

————————————————————-

QUESTION 2

 Given the following business scenario, create a Crow’s Foot ERD using a specialization hierarchy if appropriate. Tiny Hospital keeps information on patients and hospital rooms. The system assigns each patient a patient ID number. In addition, the patient’s name and date of birth are recorded. Some patients are resident patients who spend at least one night in the hospital, and others are outpatients who are treated and released. Resident patients are assigned to a room. Each room is identified by a room number. The system also stores the room type (private or semiprivate) and room fee. Over time, each room will have many patients. Each resident patient will stay in only one room. Every room must have had a patient, and every resident patient must have a room. 

Make sure that the file is saved as ch5_problem2_deleon.doc

——————————————————————————————————

QUESTION 3

 Given the following business scenario, create a Crow’s Foot ERD using a specialization hierarchy if appropriate. Granite Sales Company keeps information on employees and the departments in which they work. For each department, the department name, internal mail box number, and office phone extension are kept. A department can have many assigned employees, and each employee is assigned to only one department. Employees can be salaried, hourly, or work on contract. All employees are assigned an employee number, which is kept along with the employee’s name and address. For hourly employees, hourly wages and target weekly work hours are stored; for example, the company may target 40 hours/week for some employees, 32 for others, and 20 for others. Some salaried employees are salespeople who can earn a commission in addition to their base salary. For all salaried employees, the yearly salary amount is recorded in the system. For salespeople, their commission percentage on sales and commission percentage on profit are stored in the system. For example, John is a salesperson with a base salary of $50,000 per year plus a 2 percent commission on the sales price for all sales he makes, plus another 5 percent of the profit on each of those sales. For contract employees, the beginning date and end date of their contracts are stored along with the billing rate for their hours. 

Make sure that the file is saved as ch5_problem3_deleon.doc

—————————————————————————————————

QUESTION 4

Modify the design shown in Figure 4.35 to include these business rules:

  • An employee could be staff, a professor, or an administrator.
  • A professor may also be an administrator.
  • Staff employees have a work-level classification, such as Level I or Level II.
  • Only professors can chair a department. A department is chaired by only one professor.
  • Only professors can serve as the dean of a college. Each of the university’s colleges is served by one dean.
  • A professor can teach many classes.
  • Administrators have a position title.

Given that information, create the complete ERD that contains all primary keys, foreign keys, and main attributes. 

SEE ATTACHED FILE : https://cnow.apps.ng.cengage.com/ilrn/books/co3ds13h/Images/image044.gif

Make sure that the file is saved as ch5_problem4_deleon.doc

——————————————————————————————————-

QUESTION 5

 Tiny College wants to keep track of the history of all its administrative appointments, including dates of appointment and dates of termination. (Hint: Time- variant data is at work.) The Tiny College chancellor may want to know how many deans worked in the College of Business between January 1, 1960, and January 1, 2018, or who the dean of the College of Education was in 1990. Given that information, create the complete ERD that contains all primary keys, foreign keys, and main attributes.
 

Make sure that the file is saved as ch5_problem5_deleon.doc

———————————————————————————————————–

QUESTION 6

 Some Tiny College staff employees are information technology (IT) personnel. Some IT personnel provide technology support for academic programs, some provide technology infrastructure support, and some provide support for both. IT personnel are not professors; they are required to take periodic training to retain their technical expertise. Tiny College tracks all IT personnel training by date, type, and results (completed versus not completed). Given that information, create the complete ERD that contains all primary keys, foreign keys, and main attributes. 

Make sure that the file is saved as ch5_problem6_deleon.doc

———————————————————————————————————

QUESTION 7

The FlyRight Aircraft Maintenance (FRAM) division of the FlyRight Company (FRC) performs all maintenance for FRC’s aircraft. Produce a data model segment that reflects the following business rules:

  • All mechanics are FRC employees. Not all employees are mechanics.
  • Some mechanics are specialized in engine (EN) maintenance. Others are specialized in airframe (AF) maintenance or avionics (AV) maintenance. (Avionics are the electronic components of an aircraft that are used in communication and navigation.) All mechanics take periodic refresher courses to stay current in their areas of expertise. FRC tracks all courses taken by each mechanic—date, course type, certification (Y/N), and performance.
  • FRC keeps an employment history of all mechanics. The history includes the date hired, date promoted, and date terminated. Given those requirements, create the Crow’s Foot ERD segment.

Make sure that the file is saved as ch5_problem7_deleon.doc