top of page
Search
Mar 20, 2022
Python - Probability - Dice rolls sum
# You roll together 5 fair, six-sided dices. # 1. What is the probability that the sum of the five dices is equal to 19 ? 13.9% # 2. What...
270
Mar 12, 2022
Python and PySpark - Credit score - Loan propensity
Sample datafile is providing numerical and categorical variables for each case id. We can calculate final score for each case id and see...
270
Mar 3, 2022
PySpark - Linear regression for categorical features
# PySpark # LINEAR REGRESSION FOR CATEGORICAL FEATURES !pip install pyspark pip install findspark import findspark # Use pandas to import...
220
Feb 28, 2022
PySpark - Data Frame Basic Operations in Google Colaboratory
# PySpark - Data Frame !pip install pyspark pip install findspark import findspark # Use pandas to import csv import pandas as pd...
140
Feb 28, 2022
PySpark - Mac Terminal Commands
Below are the basic Mac Terminal Commands to set up Anaconda, PySpark and Java on Mac for Jupyter or Colaboratory. 1. Revise in Terminal...
120
Feb 17, 2022
Python - Time Series Forecast - 5. Regression
Time Series Forecast Steps: 1. Identification of time series type 2. Seasonality tests 3. Stationarity tests 4. Decomposition 5....
170
Feb 17, 2022
Python - Time Series Forecast - 3. Stationarity
Time Series Forecast Steps: 1. Identification of Time Series Type 2. Seasonality Tests 3. Stationarity Tests 4. Decomposition 5....
290
Feb 16, 2022
Python - Time Series Forecast - 4. Decomposition
There are many types of time series based on data: additive, multiplicative, other temporal structures. Before time series forecast we...
290
Jan 28, 2022
Python - Classification - Random Forest
Many years ago I used the Random Forest methodology in R Studio to calculate the feature importance in Coffee Data Set. Random forests...
100
Jan 24, 2022
Python - fizzbuzz function
Fizz Buzz is an old group word game for children. It is a very simple programming task, used in software developer job interviews, to...
80
Jan 23, 2022
Python - RFM segmentation - Retail example
RFM segmentation is the basic customer segmentation based on recency, frequency and monetary value of customer. Recency (R): When was the...
430
Nov 15, 2021
Python - chatbot - nltk
Chatbot characteristics are small data set, very simple analytics and primitive communication function. The first chatbots were ...
190
Nov 15, 2021
Python - www operations - advertools
We can analyze marketing productivity, keywords, searched data, tables, information by google, e.g. on wikipedia, audience, marketing...
150
Nov 9, 2021
Python - Text CAPTCHA - captcha
CAPTCHA / Completely Automated Public Turing test to tell Computers and Humans Apart / package is for authentication, provides entry to...
240
Sep 20, 2021
Python - Next word prediction - nltk
We can change customer attention in text document. If we use in an article more words, which are focused on group, a reader will focus...
580
Sep 13, 2021
Python - Triplets, Coprime and Prime numbers
Lets remember some easy calculation, either prime numbers, coprime numbers or triplets are important part of a base for more...
630
Aug 13, 2021
Python - Sentiment Analytics - Tweepy
There are many areas, how sentiment analytics could bring better business performance. Python offers various approaches to sentiment and...
550
Jul 26, 2021
Python - Economics - Capital Intensity - Line Chart - Matplotlib
There are 3 basic types of market efficiency: allocative, operational and informative. Assets per employee is one of many operational...
630
Jul 21, 2021
Python - Bar Chart - Matplotlib
Basic Bar Chart example to compare Total Assets by Bank in years 2011 - 2020. from pandas import read_csv from matplotlib import pyplot...
320
Jul 19, 2021
Finance - Annual Report - Dictionary
Assets under management - client assets under management. Basic earnings per share - basic EPS = (Net income – Preferred dividend) /...
200
Jul 3, 2021
R Studio - Text Analytics - Corpus Inspection
There are several areas in R Studio Text Analytics, which can be examined. One of the main tasks is to explore the text Corpus. I use the...
2060
Jun 13, 2021
R Studio - Proportion Data - Survfit, Survreg
The R Book ( Michael Crawley, ISBN: 978-0-470-97392-9 ) and its chapter 16 inspired me to run the code and pick up the most suitable...
540
May 21, 2021
Python - Binary Bootstrap Sample
The code provides Yes/No prediction, fitting binary bootstrap sample for mistakes in data entries. Leo Breiman was (1928 – 2005)...
250
bottom of page