Cyber Analytics and Intelligence: Decision Trees

  

DECISION TREES for Risk Assessment

One of the great advantages of decision trees is their interpretability. The rules learnt for classification are easy for a person to follow, unlike the opaque “black box” of many other methods, such as neural networks. We demonstrate the utility of this using a German credit data set. You can read a description of this dataset at the UCI site. The task is to predict whether a loan approval is good or bad credit risk based on 20 attributes. We’ve simplified the data set somewhat, particularly making attribute names and values more meaningful.

1. Download the credit_Dataset.arff dataset and load it to Weka.

2. (5 Points) When presented with a dataset, it is usually a good idea to visualise it first. Go to the Visualise tab. Click on any of the scatter plots to open a new window which shows the scatter plot for two selected attributes. Try visualising a scatter plot of age and duration. Do you notice anything unusual? You can click on any data point to display all it’s values.

3. (5 Points) In the previous point you should have found a data point, which seems to be corrupted, as some of its values are nonsensical. Even a single point like this can significantly affect the performance of a classifier. How do you think it would affect Decision trees? A good way to check this is to test the performance of each classifier before and after removing this datapoint.

4. (10 Points) To remove this instance from the dataset we will use a filter. We want to remove all instances, where the age of an applicant is lower than 0 years, as this suggests that the instance is corrupted. In the Preprocess tab click on Choose in the Filter pane. Select filters > unsupervised > instance > RemoveWithValues. Click on the text of this filter to change the parameters. Set the attribute index to 13 (Age) and set the split point at 0. Click Ok to set the parameters and Apply to apply the filter to the data. Visualise the data again to verify that the invalid data point was removed.

5. (20 Points) On the Classify tab, select the Percentage split test option and change its value to 90%. This way, we will train the classifiers using 90% of the training data and evaluate their performance on the remaining 10%. First, train a decision tree classifier with default options. Select classifiers > trees > J48 and click Start. J48 is the Weka implementation of the C4.5 algorithm, which uses the normalized information gain criterion to build a decision tree for classification.

6. (20 Points) After training the classifier, the full decision tree is output for your perusal; you may need to scroll up for this. The tree may also be viewed in graphical form by right-clicking in the Result list and selecting Visualize tree; unfortunately this format is very cluttered for large trees. Such a tree accentuates one of the strengths of decision tree algorithms: they produce classifiers which are understandable to humans. This can be an important asset in real life applications (people are seldom prepared to do what a computer program tells them if there is no clear explanation). Observe the output of the classifier and try to answer the following questions:

o How would you assess the performance of the classifier? Is the Percentage of Correctly Classified Instances a sufficient measure in this case? Why? Hint: check the number of good and bad cases in the test sample, using the confusion matrix. Each column of the matrix represents the instances in a predicted class, while each row represents the instances in an actual class. For example let us define an experiment from P positive instances and N negative instances. The four outcomes can be formulated in a 2 by 2 contingency table or confusion matrix. One benefit of a confusion matrix is that it is easy to see if the system is confusing two classes (i.e. commonly mislabeling one as another).

o Looking at the decision tree itself, are the rules it applies sensible? Are there any branches which appear absurd? At what depth of the tree? What does this suggest?
Hint: Check the rules applied after following the paths: (a) CheckingAccount = <0, Foreign = yes, Duration >11, Job = skilled, OtherDebtors = none, Duration <= 30 and (b) CheckingAccount = <0, Foreign = yes, Duration >11, Job = unskilled.

o How does the decision tree deal with classification in the case where there are zero instances in the training set corresponding to that particular path in the tree (e.g. those leaf nodes that have (0:0))?

7. (20 Points) Now, explore the effect of the confidenceFactor option. You can find this by clicking on the Classifer name (to the right of the Choose button on the Classify tab). On the Classifier options window, click on the More button to find out what the confidence factor controls. Try the values 0.1, 0.2, 0.3 and 0.5. What is the performance of the classifier at each case? Did you expect this given your observations in the previous questions? Why do you think this happens?

8. (20 Points) Suppose that it is worse to classify a customer as good when they are bad, than it is to classify a customer as bad when they are good. Which value would you pick for the confidence factor? Which performance measure would you base your decision on?

9.  (20 Points)Finally we will create a random decision forest and compare the performance of this classifier to that of the decision tree and the decision stump. The random decision forest is an ensemble classifier that consists of many decision trees and outputs the class that is the mode of the class’s output by individual trees. Again set the test option Percentage split to 90%. Select classifiers > trees > RandomForest and hit Start. Again, observe the output. How high can you get the performance of the classifier by changing the number of trees (numTrees) parameter? How does the random decision forest compare performance wise to the decision tree and decision stump?

Unit 2 Assignment

 

In this assignment, you will describe the Standish Group and the CHAOS Report.

Refer to the attached document for full assignment details and grading rubric.

final analysis

 

Module 05 Content

  1. Create a risk assessment plan for your company and do a thorough risk assessment analysis. Assess the ramifications of the risks you identify and make recommendations to mitigate those risks.

    Your assignment should meet the following requirements:

    • 6-8 pages long, not including the cover page and reference page.
    • Conform to APA Style.
    • For your final project, you will compile all of the weekly deliverables from Modules 02-04 and submit as a final project. Be sure to include your risk assessment plan in this final deliverable.

      Compile your deliverables into a paper with the following sections (Hint: make these your level 1 heading per APA format, remembering the title of the paper is your heading for your introduction):

    • Introduction
    • IT Governance and Risk Control Plan
    • Business Continuity and Service Level Agreements
    • Risk Status Report
    • IT Audit Process
    • Risk Assessment Plan and Analysis
    • Conclusion
    • After compiling the weekly deliverables, condense the information into a paper 8-10 pages long, and keep only the most substantial information.

      Your assignment should meet the following requirements:

    • Be 8-10 pages long, not including the cover page and reference page.
    • Conform to APA Style.
    • Support your answers with at least six current scholarly journal articles (not more than five years old). The Rasmussen Library is a great place to find resources.
    • Be clearly and well-written, concise, and logical, using excellent grammar and style techniques. You are being graded in part on the quality of your writing. If you need assistance with your writing style and APA format, start with the Writing and APA guides at the Rasmussen Library.

One acre of land is equivalent to

One acre of land is equivalent to 43,560 square feet. Write a program that calculates the number of acres in a tract of land with 389,767square feet. Hint: Divide the size of the tract of land by the size of an acre to get the number of acres.

Mastering Blockchain

Find a peer-reviewed scholarly journal article discussing blockchain technology. Complete a review of the article by writing a 2-3 page overview of the article. This will be a detailed summary of the journal article, including concepts discussed and findings. Additionally, find one other source (it does not have to be a peer-reviewed journal article) that substantiates the findings in the article you are reviewing.Once you find the article, you will read it and write a review of it.  This is considered a research article review.Your paper should meet these requirements: 

  • Be approximately three to four pages in length, not including the required cover page and reference page.
  • Follow APA 7 guidelines. Your paper should include an introduction, a body with fully developed content, and a conclusion.

Creating a database and website

. Create a database named “Bookstore” and tables, using the following E-R diagram.

1

You could create the database on AWS using RDS, or you could create your database on your local machine. 

2. Create a website for access your database, and include some functionalities, like: 

insert records to each table, you may need to update other tables when you need to insert a record to a table. 

query records from each table using different columns 

delete records from each table, you might need to update other tables when you delete a record from a table. 

3. You could use any framework to develop your website, for example: 

if you are familiar with Python, you could use Flask or Django 

if you are familiar with javascript, you could use Node.js 

Solved Exp19_Excel_Ch07_ML2_Finances

  Your family is considering purchasing a house and investing in a business venture. You started the structure for a loan amortization table and the investment table. You will complete the first five years of the 20-year loan amortization table. To complete the table, you will enter formulas to calculate the beginning balance, monthly payment, and ending balance. You will use financial functions to calculate the interest and principal paid for each monthly payment. In addition, you want to calculate cumulative interest after the first year, total interest over the life of the loan, and the amount of principal paid after the first year. You also want to see how many months half or more of the payment is for interest. You will then focus your attention on completing an investment table using date functions, formulas, and a financial function to calculate the future value of the investment.

     

1

Start Excel. Download and open   the file named Exp19_Excel_Ch07_ML2_Finances.xlsx.   Grader has automatically added your last name to the beginning of the   filename.

 

2

The first step is to enter a   formula to reference the loan amount for the beginning balance for the first   payment.
 

  In cell B9, enter a formula that references cell D2.

 

3

A loan amortization table   usually contains a column that displays the monthly payment for each row.
 

  In cell C9, enter a formula to reference the monthly payment in cell D3. Use   a mixed reference and copy the formula to the range C10:C68.

 

4

The monthly payment indicates   the total amount of the payment, which includes principal and interest.   Interest is calculated based on the loan amount, the rate, the payment   number, and the number of payments.
 

  In cell D9, enter the IPMT function to calculate the interest paid for the   first month using mixed cell references to the input area for the Rate, Nper,   and PV arguments and using cell A9 for the Per argument. Make sure the result   is a positive value and copy the function to the range D10:D68.

 

5

After calculating the interest   paid, the rest of the monthly payment repays the principal.
 

  In cell E9, enter the PPMT function to calculate the principal paid for the   first month using mixed cell references to the input rea for the Rage, Nper,   and PV argument and using cell A9 for the Per argument. Make sure the result   is a positive value and copy the function to the range E10:E68.

 

6

The last column of the loan   amortization table calculates the ending balance.
 

  In cell F9, calculate the ending balance by subtracting the Principal   Repayment from the Beginning Balance in row 9. Copy the formula to the range   F10:F68.

 

7

The beginning balance for each   payment is calculated and entered in column B.
 

  In cell B10, enter a formula that references the first month’s ending balance   in cell F9. Copy the formula to the range B11:B68.

 

8

Ensure that Accounting Number   Format is applied to the range B9:F68.

 

9

You want to calculate the total   interest paid for the first year.
 

  In cell F2, insert the CUMIPMT function to calculate the cumulative interest   paid for the first year. Use A9 for the Start_period argument and B6 for the   End_period argument. Use 0 as the Type argument. Make sure the result is a positive value.

 

10

Now you want to calculate the   total interest paid for the entire loan.
 

  In cell F3, insert the CUMIPMT function to calculate the total cumulative   interest paid for the entire loan. Use A9 for the Start_period and D6 for the   End_period arguments. Make sure the result is a positive value.

 

11

You want to calculate the   cumulative principal paid for the first year.
 

  In cell F4, insert the CUMPRINC function to calculate the cumulative   principal paid for the first year. Use A9 for the Start_period and B6 for the   End_period arguments. Make sure the result is a positive value.

 

12

In cell F5, insert the COUNTIF   function to count the number of payment periods in which the interest in the   loan amortization table is higher than one-half of the monthly payment (cell   D4).

 

13

Apply General number format to   cell F5.

 

14

You want to extract the year and   use it to determine the payoff year.
 

  Display the Investment sheet. In cell D4, insert the YEAR function to extract   the year from cell D3 and add the number of years (cell B3). You will change   the format of the result in the next step.

 

15

You need to format the result in   cell D4 as a number.
 

  Ensure that General number format is applied to cell D4.

 

16

At the end of each period, you   will add $125 to the investment.
 

  In cell D7, enter a formula that references cell D2. Use a mixed reference to   ensure the row number does not change. Copy the formula to the range D8:D54. 

 

17

You want to calculate the   interest earned per period.
 

  In cell C7, multiply the beginning balance in cell B7 to the result of   dividing the APR by the No. of Pmts per Year. Use mixed and relative cell   references. Copy the formula to the range C8:C54.

 

18

You are ready to calculate the   ending balance for each payment period.
 

  In cell E7, add the Beginning Balance, Interest Earned, and End-of-Period   Investment for row 7. Copy the formula to the range E8:E54.

 

19

You will use a nested function   to calculate the dates in column A.
 

  In cell A8, create a DATE function with a nested YEAR function, a nested   MONTH function and then add 1 to increment the month, and a nested DAY   function. The function arguments should reference the date on the previous   row.

 

20

Copy the function from cell A8   to the range A9:A54 but preserve the fill formatting.

 

21

In cell E56, insert the FV   function to calculate the future value of the investment use references to   the respective cells in the input area for the arguments. Make sure the   result is positive. Leave the Type argument empty.

 

22

Create a footer with your name   on the left side, the sheet name code in the center, and the file name code   on the right side of both sheets.

 

23

Save and close Exp19_Excel_Ch07_ML2_Finances.xlsx.   Exit Excel. Submit the file as directed.

ISSC362 IT Security: Attack & Defense MyClassroomPayment Type: TA, Tuition Grant Undergraduate Sep 06, 2021 – Oct 31, 2021 Elliott Lynn : Instructor

Course Details

ISSC362 IT Security: Attack & Defense 3 Semester Hours

This course examines the techniques and technologies for penetration of networks, detection of attacks, and prevention of attacks. This course addresses the techniques, the technologies, and the methodologies used by cyber intruders (hackers) to select a target and launch an attack. Students will gain insight into the motives and desired goals of hackers as well as effective tools and techniques used as countermeasures ensuring data assets remain secure. This course focuses on techniques and technologies to detect such attacks even while the attack is in progress; early detection enables the administrator to track the movements of the hacker and to discover the intent and goals of the hacker. This course assesses the various countermeasures to keep the system out of the “sights” of the hacker and to keep the hacker out of the perimeter of the target network. This course also explores the laws and the legal considerations in prosecuting computer crime.

For future semesters, email [email protected] to confirm course materials.

Security Awareness Requirements

 

You are required to write a three to five (3-5) page proposal in which you recommend the need for security awareness training. In your proposal, be sure to:

  1. Identify compliance or audit standards that your organization must adhere to.
  2. Identify security awareness requirements for those standards.
  3. Identify training methods to meet those requirements (In house, contract or CBT).

Assumptions

  • You should assume that your company will have to accept credit cards as payments.
  • You should assume that no current awareness/training plans exist for your company.
  • You should assume that all offices and groups need training.

Notes on submission:

  • Use at least three (3) quality resources as references in this assignment. Wikipedia and similar Websites do not qualify as quality resources.
  • Your assignment must follow these formatting requirements: Be typed, double spaced, using Times New Roman font (size 12), with one-inch margins on all sides; citations and references must follow APA or school-specific format. Check with your professor for any additional instructions.
  • Include a cover page containing the title of the assignment, the student’s name, the professor’s name, the course title, and the date. The cover page and the reference page are not included in the required assignment page length.

Apply-AWS Reference Table for Users

 Write a 1- to 2-page reference table that lists at least 4 resources available on the AWS website, such as FAQs, videos, whitepapers, tutorials, case studies, etc. The audience for your table will be business users engaged once the services are fully implemented. Your table should include the resource type, a description, who would benefit from the resource, and a link to the resource(s). The table will be a resource for your organization, so keep in mind the format and ease of use.