Skip to content

The Big Data Blog

A science blog about my spare time data analysis projects.

Menu
  • Basics
  • Projects
  • Coding
  • Curriculum vitae
  • Impressum

Category: Python

Cluster / Kubernetes / Python

Building a minimal, cost efficient Dask cluster

In this article we will show a way to do high performance parallel computing at a Kubernetes cluster using task. A primary focus is that we want to archive the …

Coding / Python

Kernel Regression using the Fast Fourier Transform

1. Setup In a previous post it was shown how to speed up the computation of a kernel density using the Fast Fourier Transform. Conceptually a kernel density is not …

Coding / Python

Fast Kernel Density Estimation using the Fast Fourier Transform

1. Setup This Post is about how to speed up the computation kernel density estimators using the FFT (Fast Fourier Transform). Let be be a random sample drawn from an …

Basics / Coding / Python / Spark

Non-Linear Classification Methods in Spark

In a previous post I covered how to apply classical linear estimators like support vector machines or logistic regression to a non-linear dataset using the kernel method. This article can …

Basics / Coding / Python / Spark

Non-Linear Support Vector Machines (SVM)

1. Introduction This blog post is about Support Vector Machines (SVM), but not only about SVMs. SVMs belong to the class of classification algorithms and are used to separate one …

Coding / Functional Data Analysis with Spark / Projects / Python / Spark

Kernel Regression using Pyspark

1. Kernel Regression using Pyspark In a previous article I presented an implementation of a kernel denisty estimation using pyspark. It is thus not difficult to modify the algorithm to …

Coding / Functional Data Analysis with Spark / Python / Spark

Nonparametric Density estimation using Spark

1. A Nonparametric Density implementation in Spark One of my previous blog post concerns about nonparametric density estimation. In this post i presented some Matlab code. An advantage of this …

Coding / Functional Data Analysis with Spark / Python / Spark

Functional Regression with Spark

1. Functional Regression Let the covariate be an at least twice continuously differentiable random function defined wlog. on an interval and the corresponding the response. For simplicity we assume centered …

Coding / Functional Data Analysis with Spark / Python / Spark

Functional Principal Component Analysis with Spark

1.) Functional Principal Component Analysis Let be a centered smooth random function in , with finite second moment . Without loss of generality we assume instead of some arbitrary compact …

Coding / Passwords / Projects / Python

2. Coding the “Educated Guess Procedure”

1. Perform the Analyze To start with, we load the “rockyou.txt.tar.gz” password list using wget. I’m not sure if it is legal to provide a link for the list, therefore …

Posts navigation

Older posts

Recent Posts

  • Estimating the extrema of noisy curves and optimization using spline surface approximation
  • A WordPress Plugin to embed raw.githubusercontent
  • Building a minimal, cost efficient Dask cluster
  • Frequency-Severity Modeling in consideration of COVID-19 induced effects
  • Kernel Regression using the Fast Fourier Transform

Recent Comments

  • Building a minimal, cost efficient Dask cluster – The Big Data Blog on Cluster Monitoring using a ST7789 Display
  • Building a minimal, cost efficient Dask cluster – The Big Data Blog on Kubernetes at an OrangePi
  • Heiko Wagner on Fast Kernel Density Estimation using the Fast Fourier Transform
  • Petkov on Fast Kernel Density Estimation using the Fast Fourier Transform
  • Kernel Regression using the Fast Fourier Transform – The Big Data Blog on Nonparametric Density estimation using Spark

Archives

  • April 2022
  • January 2022
  • April 2021
  • February 2021
  • October 2020
  • June 2020
  • March 2020
  • January 2020
  • September 2019
  • July 2019
  • June 2019
  • April 2019
  • February 2019
  • September 2018
  • August 2018
  • December 2017
  • October 2017
  • April 2017
  • March 2017
  • February 2017
  • January 2017

Categories

  • Basics
  • Cluster
  • Coding
  • Functional Data Analysis with Spark
  • Hardware
  • Install Spark on a OrangePi PC
  • Introduction
  • JavaScript
  • Kubernetes
  • Matlab
  • Passwords
  • Projects
  • Python
  • Spark

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
Visit me on social media
Copyright © 2022 The Big Data Blog – OnePress theme by FameThemes