You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
33 lines
1.0 KiB
33 lines
1.0 KiB
1. Objective:
|
|
We want to predict the VPN usage of employees based on the frequency of use of other apps.
|
|
|
|
2. License: Free to use, collected by Splunk.
|
|
|
|
3. Data Source:
|
|
Corporate app usage from an anonymous company.
|
|
|
|
4. Data Set Information:
|
|
The dataset covers just under three months of aggregate app usage.
|
|
|
|
5. Field Meanings:
|
|
A. _time: UNIX epoch time
|
|
B. The remaining fields (app names anonymized) contain the number of logins during that day.
|
|
|
|
6. Parameter Selection:
|
|
A. Dashboard: Predict Numeric Fields
|
|
Settings:
|
|
1) Search: | inputlookup app_usage.csv
|
|
2) Field to Predict: RemoteAccess
|
|
3) Fields to Use: CRM, CloudDrive, HR1, Webmail
|
|
|
|
B. Dashboard: Cluster Numeric Fields
|
|
Settings:
|
|
0) Search: | inputlookup app_usage.csv
|
|
1) Model name: N/A
|
|
2) Fields to preprocess: CRM, CloudDrive, ERP, Expenses, HR1, HR2, ITOps, OTHER, Recruiting, RemoteAccess, Webmail
|
|
3) Apply StandardScaler
|
|
4) Apply PCA to reduce to 3 fields
|
|
5) Algorithm: Spectral Clustering
|
|
6) Fields to use: PC_1, PC_2, PC_3
|
|
7) K: 3
|