a research paper about FDSLRM modeling with supplementary materials - software, notebooks

Project: fdslrm

Path: EBLUP-NE / PYnotebooks / PY-estimation-cyberattacks-SciPyCVXPY.ipynb

Views: ¹⁰⁴¹

Kernel: Python 2 (Ubuntu Linux)

Authors: Jozef Hanč, Martina Hančová, Andrej Gajdoš
Faculty of Science, P. J. Šafárik University in Košice, Slovakia
emails: [email protected]

EBLUP-NE for cyber attacks

Python-based computational tools - SciPy, CVXPY

Data and model - data and model description of empirical data
Natural estimators - EBLUPNE based on NE
NN-DOOLSE, MLE - EBLUPNE based on nonnegative DOOLSE (same as MLE)
NN-MDOOLSE, REMLE - EBLUPNE based on nonnegative MDOOLSE (same as REMLE)
References

To get back to the contents, use the Home key.

Python libraries

CVXPY: A Python-Embedded Modeling Language for Convex Optimization

Purpose: scientific Python library for solving convex optimization tasks
Version: 1.0.1, 2018
URL: https://www.cvxpy.org/
Computational parameters of CVXPY:

solver - the convex optimization solver ECOS, OSQP, and SCS chosen according to the given problem * OSQP for convex quadratic problems * max_iter - maximum number of iterations (default: 10000). * eps_abs - absolute accuracy (default: 1e-4). * eps_rel - relative accuracy (default: 1e-4). * ECOS for convex second-order cone problems * max_iters - maximum number of iterations (default: 100). * abstol - absolute accuracy (default: 1e-7). * reltol - relative accuracy (default: 1e-6). * feastol - tolerance for feasibility conditions (default: 1e-7). * abstol_inacc - absolute accuracy for inaccurate solution (default: 5e-5). * reltol_inacc - relative accuracy for inaccurate solution (default: 5e-5). * feastol_inacc - tolerance for feasibility condition for inaccurate solution (default: 1e-4). * SCS for large-scale convex cone problems * max_iters - maximum number of iterations (default: 2500). * eps - convergence tolerance (default: 1e-4). * alpha - relaxation parameter (default: 1.8). * scale - balance between minimizing primal and dual residual (default: 5.0). * normalize - whether to precondition data matrices (default: True). * use_indirect - whether to use indirect solver for KKT sytem (instead of direct) (default: True).

Scipy - NumPy, Pandas

Numpy is the fundamental Python library of SciPy ecosystem for fast scientific computing with large, multi-dimensional arrays and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.
default precision: double floating-point precision $\text{eps}<10^{-15}$
Pandas is the Python library providing high-performance, easy to use data structures.

In [1]:

import cvxpy
import numpy as np
import pandas as pd
import platform as pt

from cvxpy import *
from math import cos, sin
from numpy.linalg import inv, norm
from itertools import product
from __future__ import print_function
from __future__ import division

np.set_printoptions(precision=10)

In [2]:

# software versions
print('cvxpy:'+cvxpy.__version__)
print('numpy:'+np.__version__)
print('pandas:'+pd.__version__)
print('python:'+pt.python_version())

cvxpy:1.0.22
numpy:1.16.3
pandas:0.23.4
python:2.7.15rc1

Data and Model

This FDSLRM application describes the real time series data set representing total weekly number of cyber attacks against a honeynet -- an unconventional tool which mimics real systems connected to Internet like business or school computers intranets to study methods, tools and goals of cyber attackers.

Data, taken from from Sokol, 2017 were collected from November 2014 to May 2016 in CZ.NIC honeynet consisting of Kippo honeypots in medium-interaction mode. The number of time series observations is $n=72$ .

The suitable FDSLRM, after a preliminary logarithmic transformation of data $Z(t) = \log X(t)$ , is Gaussian orthogonal:

$\begin{array}{rl} & Z(t) & \! = \! &\beta_1+\beta_2\cos\left(\tfrac{2\pi t\cdot 3}{72}\right)+\beta_3\sin\left(\tfrac{2\pi t\cdot 3}{72}\right)+\beta_4\sin\left(\tfrac{2\pi t\cdot 4}{72}\right) + \\ & & & +Y_1\sin\left(\tfrac{2\pi\ t\cdot 6}{72}\right)+Y_2\sin\left(\tfrac{2\pi\cdot t\cdot 7}{72}\right)+w(t), \, t\in \mathbb{N}, \end{array}$

where $\boldsymbol{\beta}=(\beta_1,\,\beta_2,\,\beta_3,\,\beta_4)' \in \mathbb{R}^4, \mathbf{Y} = (Y_1,Y_2)' \sim \mathcal{N}_2(\boldsymbol{0}, \mathrm{D}), w(t) \sim \mathcal{iid}\, \mathcal{N} (0, \sigma_0^2), \boldsymbol{\nu}= (\sigma_0^2, \sigma_1^2, \sigma_2^2) \in \mathbb{R}_{+}^3$ .

We identified the given and most parsimonious structure of the FDSLRM using an iterative process of the model building and selection based on exploratory tools of spectral analysis and residual diagnostics (for details see our Jupyter notebook cyberattacks.ipynb).

SciPy(Numpy)

In [3]:

# data - time series observation
path = 'cyberattacks.csv'
data = pd.read_csv(path)

# log of observations x as matrix
x = np.asmatrix(np.log(data.values))
x.shape

(72, 1)

In [4]:

# model parameters
n, k, l = 72, 4, 2

# significant frequencies
om1, om2, om3, om4 = 2*np.pi*3/72, 2*np.pi*4/72, 2*np.pi*6/72, 2*np.pi*7/72

# model - design matrices F', F, V',V
Fc = np.mat([[1 for t in range(1,n+1)],
             [cos(om1*t) for t in range(1,n+1)],
             [sin(om1*t) for t in range(1,n+1)],
             [sin(om2*t) for t in range(1,n+1)]])
Vc = np.mat([[sin(om3*t) for t in range(1,n+1)],
             [sin(om4*t) for t in range(1,n+1)]])
F, V = Fc.T, Vc.T

# columns vj of V and their squared norm ||vj||^2
vv = lambda j: V[:,j-1]
nv2 = lambda j: np.trace(vv(j).T*vv(j))

In [5]:

# auxiliary matrices and vectors

# Gram matrices GF, GV
GF, GV = Fc*F, Vc*V
InvGF, InvGV = inv(GF), inv(GV)

# projectors PF, MF, PV, MV
In = np.identity(n)
PF = F*InvGF*Fc
PV = V*InvGV*Vc
MF, MV = In-PF, In-PV

# residuals e, e'
e = MF*x
ec = e.T

In [6]:

# orthogonality condition
Fc*V, GV

(matrix([[-8.0344234420e-16,  2.2822973698e-15],
         [ 2.1007369868e-15,  6.6042080180e-15],
         [ 4.9460266475e-15,  1.0170518226e-14],
         [-4.6486498125e-15,  4.0212026360e-15]]),
 matrix([[ 3.6000000000e+01, -3.8300919285e-15],
         [-3.8300919285e-15,  3.6000000000e+01]]))

Natural estimators

ANALYTICALLY

using formula (4.1) from Hancova et al 2019

ParseError: KaTeX parse error: \renewcommand{\arraystretch} when command \arraystretch does not yet exist; use \newcommand

$\boldsymbol{1^{st}}$ stage of EBLUP-NE

SciPy(Numpy)

In [7]:

# NE according to formula (4.1) 
NE0 = [1/(n-k-l)*np.trace(ec*MV*e)]
NEj = [(np.trace(ec*vv(j))/nv2(j))**2 for j in range(1,l+1)] 
NE = NE0+NEj
print(NE, norm(NE))

[0.05934201263868991, 0.02547467738230785, 0.015495329728874107] 0.06641188820647724

CVXPY

NE as a convex optimization problem

$\begin{array}{ll} \textit{minimize} & \quad f_0(\boldsymbol{\nu})=||\mathbf{e}\mathbf{e}' - \mathrm{VDV'}||^2+||\mathrm{M_V}\mathbf{e}\mathbf{e}'\mathrm{M_V}-\nu_0\mathrm{M_F}\mathrm{M_V}||^2 $6pt] \textit{subject to} & \quad \boldsymbol{\nu} = \left(\nu_0, \ldots, \nu_l\right)'\in [0, \infty)^{l+1} \end{array}$

In [8]:

# the optimization variable, objective function
v = Variable(l+1)
fv = sum_squares(e*ec-V*diag(v[1:])*Vc)+sum_squares(MV*e*ec*MV-v[0]*MF*MV)

# the optimization problem for NE
objective = Minimize(fv)
constraints = [v >= 0]
prob = Problem(objective,constraints)

# solve the NE problem
prob.solve()
NEcvx = v.value
print('NEcvx =', NEcvx, ' norm = ', norm(NEcvx))

NEcvx = [0.0593420126 0.0254746774 0.0154953297]  norm =  0.06641188820647723

$\boldsymbol{2^{nd}}$ stage of EBLUP-NE

using formula (3.10) from Hancova et al 2019.

$\mathring{\nu}_j = \rho_j^2 \breve{\nu}_j; j = 0,1 \ldots, l\\ \rho_0 = 1, \rho_j = \dfrac{\hat{\nu}_j||\mathbf{v}_j||^2}{\hat{\nu}_0+\hat{\nu}_j||\mathbf{v}_j||^2}$
where $\boldsymbol{\breve{\nu}}$ are NE, $\boldsymbol{\hat{\nu}}$ are initial estimates for EBLUP-NE

SciPy(Numpy)

In [9]:

# EBLUP-NE based on formula (3.9)
rho2 = lambda est: [1]+ [ (est[j]*nv2(j)/(est[0]+est[j]*nv2(j)))**2 for j in range(1,l+1) ]
EBLUPNE = lambda est: [rho2(est)[j]*NE[j] for j in range(l+1)]

In [10]:

# numerical results
print(rho2(NE))

[1, 0.8821446487542702, 0.8169426439141595]

In [11]:

print(EBLUPNE(NE),norm(EBLUPNE(NE)))

[0.05934201263868991, 0.02247235033154431, 0.012658795637028089] 0.06470491558153936

cross-checking

using formula (3.6) for general FDSLRM from Hancova et al 2019.

$\mathring{\nu}_0 = \breve{\nu}_0, \mathring{\nu}_j = (\mathbf{Y}^*)_j^2, j = 1, 2, \ldots, l \\ \mathbf{Y}^* = \mathbb{T}\mathbf{X} \mbox{ with } \mathbb{T} = \mathrm{D}\mathbb{U}^{-1}\mathrm{V}'\mathrm{M_F}, \mathbb{U} = \mathrm{V}'\mathrm{M_F}\mathrm{V}\mathrm{D} + \nu_0 \mathrm{I}_l$

In [12]:

def EBLUPNEgen(est):
    D = np.diag(est[1:])
    U = Vc*MF*V*D + est[0]*np.identity(l)
    T = D*inv(U)*Vc*MF
    eest = np.vstack((np.matrix(NE[0]),np.multiply(T*x, T*x)))
    return np.array(eest).flatten().tolist()

In [13]:

print(EBLUPNEgen(NE), norm(EBLUPNEgen(NE)))

[0.05934201263868991, 0.022472350331544263, 0.012658795637028123] 0.06470491558153935

NN-DOOLSE or MLE

$\boldsymbol{1^{st}}$ stage of EBLUP-NE

KKT algorithm

using the the KKT algorithm (tab.3, Hancova et al 2019)

$~$

$\qquad \mathbf{q} = \left(\begin{array}{c} \mathbf{e}' \mathbf{e}\\ (\mathbf{e}' \mathbf{v}_{1})^2 \\ \vdots \\ (\mathbf{e}' \mathbf{v}_{l})^2 \end{array}\right)$
$\qquad\mathrm{G} = \left(\begin{array}{ccccc} \small n^* & ||\mathbf{v}_{1}||^2 & ||\mathbf{v}_{2}||^2 & \ldots & ||\mathbf{v}_{l}||^2 \\ ||\mathbf{v}_{1}||^2 & ||\mathbf{v}_{1}||^4 & 0 & \ldots & 0 \\ ||\mathbf{v}_{2}||^2 & 0 & ||\mathbf{v}_{2}||^4 & \ldots & 0 \\ \vdots & \vdots & \vdots & \ldots & \vdots \\ ||\mathbf{v}_{l}||^2 & 0 & 0 & \ldots & ||\mathbf{v}_{l}||^4 \end{array}\right)$

SciPy(Numpy)

In [14]:

# Input: form G
ns, nvj = n, norm(V, axis=0)
u, v, Q  = np.mat(ns), np.mat(nvj**2), np.diag(nvj**4)
G = np.bmat([[u,v],[v.T,Q]])
# form q
e2, Ve2 = ec*e, np.multiply(Vc*e, Vc*e)
q = np.vstack((e2, Ve2))

# body of the algorithm
for b in product([0,1], repeat=l): 
    # set the KKT-conditions matrix K
    K = G*1
    for j in range(1,l+1): 
        if b[j-1] == 0: K[0,j], K[j,j]  = 0,-1
    # calculate the auxiliary vector g
    g = inv(K)*q
    # test non-negativity g
    if (g >= 0).all(): break   

# Output: Form estimates nu
nu = g*1
for j in range(1,l+1):
    if b[j-1] == 0: nu[j] = 0

NN_DOOLSE = np.array(nu).flatten()
print(NN_DOOLSE, norm(NN_DOOLSE),b)

[0.0559510405 0.0239204818 0.0139411342] 0.06242646556962567 (1, 1)

CVXPY

nonnegative DOOLSE as a convex optimization problem

ParseError: KaTeX parse error: Got function '\boldsymbol' with no arguments as subscript at position 97: …hbf{e}'-\Sigma_\̲b̲o̲l̲d̲s̲y̲m̲b̲o̲l̲{\nu}||^2 $6pt]…

In [15]:

# set the optimization variable, objective function
v = Variable(l+1)
fv = sum_squares(e*e.T - v[0]*In - V*diag(v[1:])*V.T)

# construct the problem for DOOLSE
objective = Minimize(fv)
constraints = [v >= 0]
prob = Problem(objective,constraints)

# solve the DOOLSE problem
prob.solve()

NN_DOOLSEcvx = v.value
print('NN-DOOLSEcvx =', NN_DOOLSEcvx, 'norm =', norm(NN_DOOLSEcvx))

NN-DOOLSEcvx = [0.0559510405 0.0239204818 0.0139411342] norm = 0.062426465569625667

CVXPY

using equivalent (RE)MLE convex problem (proposition 5, Hancova et al 2019)

$\begin{array}{ll} \textit{minimize} & \quad f_0(\mathbf{d})=-(n^*\!-l)\ln d_0 - \displaystyle\sum\limits_{j=1}^{l} \ln(d_0-d_j||\mathbf{v}_j||^2+d_0\mathbf{e}'\mathbf{e}-\mathbf{e}'\mathrm{V}\,\mathrm{diag}\{d_j\}\mathrm{V}'\mathbf{e} $6pt] \textit{subject to} & \quad d_0 > \max\{d_j||\mathbf{v}_j||^2, j = 1, \ldots, l\} \\ & \quad d_j \geq 0, j=1,\ldots l \\ & \\ & \quad\text{for MLE: } n^* = n, \text{ for REMLE: } n^* = n-k \\ \textit{back transformation:} & \quad \nu_0 = \dfrac{1}{d_0}, \nu_j = \dfrac{d_j}{d_0\left(d_0 -d_j||\mathbf{v}_j||^2\right)} \end{array}$

In [16]:

# set variables for the objective
ns = n
d = Variable(l+1)
logdetS = (ns-l)*log(d[0])+sum(log(d[0]-GV*d[1:]))

# construct the problem
objective = Maximize(logdetS-(d[0]*ec*e-ec*V*diag(d[1:])*Vc*e))
constraints = [0 <= d[1:], max(GV*d[1:]) <= d[0]]
prob = Problem(objective,constraints)

# solve the problem
solution = prob.solve()
dv = d.value.tolist()

# back transformation
s0 = [1/dv[0]]
sj = [dv[i]/(dv[0]*(dv[0]-dv[i]*GV[i-1,i-1])) for i in range(1,l+1)]
sv = s0+sj

print('MLEcvx = ', sv, ' norm = ', norm(sv))

MLEcvx =  [0.0559510407058649, 0.023920500360523053, 0.013941147758493946]  norm =  0.062426475908794465

$\boldsymbol{2^{nd}}$ stage of EBLUP-NE

SciPy(Numpy)

In [17]:

# numerical results
print(rho2(NN_DOOLSE))

[1, 0.8817032888843341, 0.8094584645596983]

In [18]:

print(EBLUPNE(NN_DOOLSE),norm(EBLUPNE(NN_DOOLSE)))

[0.05934201263868991, 0.02246110683124819, 0.01254282581018068] 0.06467842193034484

In [19]:

#cross-checking
print(EBLUPNEgen(NN_DOOLSE), norm(EBLUPNEgen(NN_DOOLSE)))

[0.05934201263868991, 0.022461106831248263, 0.012542825810180711] 0.06467842193034487

NN-DOOLSE or REMLE

using the KKT algorithm (tab.3, Hancova et al 2019)

$\boldsymbol{1^{st}}$ stage of EBLUP-NE

KKT algorithm

SciPy(Numpy)

In [20]:

# Input: form G
ns, nvj = n-k, norm(V, axis=0)
u, v, Q  = np.mat(ns), np.mat(nvj**2), np.diag(nvj**4)
G = np.bmat([[u,v],[v.T,Q]])
# form q
e2, Ve2 = ec*e, np.multiply(Vc*e, Vc*e)
q = np.vstack((e2, Ve2))

# body of the algorithm
for b in product([0,1], repeat=l): 
    # set the KKT-conditions matrix K
    K = G*1
    for j in range(1,l+1): 
        if b[j-1] == 0: K[0,j], K[j,j]  = 0,-1
    # calculate the auxiliary vector g
    g = inv(K)*q
    # test non-negativity g
    if (g >= 0).all(): break   

# Output: Form estimates nu
nu = g*1
for j in range(1,l+1):
    if b[j-1] == 0: nu[j] = 0

NN_MDOOLSE = np.array(nu).flatten()
print(NN_MDOOLSE, norm(NN_MDOOLSE),b)

[0.0593420126 0.0238262881 0.0138469405] 0.06542861936152923 (1, 1)

In [21]:

nu.dtype

dtype('float64')

CVXPY

nonnegative DOOLSE as a convex optimization problem

ParseError: KaTeX parse error: Got function '\boldsymbol' with no arguments as subscript at position 109: …hrm{M_F}\Sigma_\̲b̲o̲l̲d̲s̲y̲m̲b̲o̲l̲{\nu}\mathrm{M_…

In [22]:

# the optimization variable, objective function
v = Variable(l+1)
fv = sum_squares(e*ec - v[0]*MF - V*diag(v[1:])*Vc)

# the optimization problem for MDOOLSE
objective = Minimize(fv)
constraints = [v >= 0]
prob = Problem(objective,constraints)

# solve the MDOOLSE problem
prob.solve()

NN_MDOOLSEcvx = v.value
print('NN-MDOOLSEcvx =', NN_MDOOLSEcvx, 'norm =', norm(NN_MDOOLSEcvx) )

NN-MDOOLSEcvx = [0.0593420126 0.0238262881 0.0138469405] norm = 0.06542861936152923

CVXPY

using equivalent (RE)MLE convex problem (proposition 5, Hancova et al 2019)

$\begin{array}{ll} \textit{minimize} & \quad f_0(\mathbf{d})=-(n^*\!-l)\ln d_0 - \displaystyle\sum\limits_{j=1}^{l} \ln(d_0-d_j||\mathbf{v}_j||^2+d_0\mathbf{e}'\mathbf{e}-\mathbf{e}'\mathrm{V}\,\mathrm{diag}\{d_j\}\mathrm{V}'\mathbf{e} $6pt] \textit{subject to} & \quad d_0 > \max\{d_j||\mathbf{v}_j||^2, j = 1, \ldots, l\} \\ & \quad d_j \geq 0, j=1,\ldots l \\ & \\ & \quad\text{for MLE: } n^* = n, \text{ for REMLE: } n^* = n-k \\ \textit{back transformation:} & \quad \nu_0 = \dfrac{1}{d_0}, \nu_j = \dfrac{d_j}{d_0\left(d_0 -d_j||\mathbf{v}_j||^2\right)} \end{array}$

In [23]:

# set variables for the objective
ns = n - k
d = Variable(l+1)
logdetS = (ns-l)*log(d[0])+sum(log(d[0]-GV*d[1:]))

# construct the problem
objective = Maximize(logdetS-(d[0]*ec*e-ec*V*diag(d[1:])*Vc*e))
constraints = [0 <= d[1:], max(GV*d[1:]) <= d[0]]
prob = Problem(objective,constraints)

# solve the problem
solution = prob.solve()
dv = d.value.tolist()

# back transformation
s0 = [1/dv[0]]
sj = [dv[i]/(dv[0]*(dv[0]-dv[i]*GV[i-1,i-1])) for i in range(1,l+1)]
sv = s0+sj

print('REMLEcvx = ', sv, ' norm = ', norm(sv))

REMLEcvx =  [0.05934201333186461, 0.023826291358895236, 0.013846946267044214]  norm =  0.06542862238440125

$\boldsymbol{2^{nd}}$ stage of EBLUP-NE

SciPy(Numpy)

In [24]:

# numerical results
print(rho2(NN_MDOOLSE))

[1, 0.8747730479406809, 0.7985571584490208]

In [25]:

print(EBLUPNE(NN_MDOOLSE),norm(EBLUPNE(NN_MDOOLSE)))

[0.05934201263868991, 0.022284561179026965, 0.012373906477520341] 0.06458474814123416

In [26]:

#cross-checking
print(EBLUPNEgen(NN_MDOOLSE), norm(EBLUPNEgen(NN_MDOOLSE)))

[0.05934201263868991, 0.022284561179026784, 0.012373906477520409] 0.0645847481412341

References

This notebook belongs to suplementary materials of the paper submitted to Statistical Papers and available at https://arxiv.org/abs/1905.07771.

Hančová, M., Vozáriková, G., Gajdoš, A., Hanč, J. (2019). Estimating variance components in time series linear regression models using empirical BLUPs and convex optimization, https://arxiv.org/, 2019.

Abstract of the paper

We propose a two-stage estimation method of variance components in time series models known as FDSLRMs, whose observations can be described by a linear mixed model (LMM). We based estimating variances, fundamental quantities in a time series forecasting approach called kriging, on the empirical (plug-in) best linear unbiased predictions of unobservable random components in FDSLRM.

The method, providing invariant non-negative quadratic estimators, can be used for any absolutely continuous probability distribution of time series data. As a result of applying the convex optimization and the LMM methodology, we resolved two problems $-$ theoretical existence and equivalence between least squares estimators, non-negative (M)DOOLSE, and maximum likelihood estimators, (RE)MLE, as possible starting points of our method and a practical lack of computational implementation for FDSLRM. As for computing (RE)MLE in the case of $n$ observed time series values, we also discovered a new algorithm of order $\mathcal{O}(n)$ , which at the default precision is $10^7$ times more accurate and $n^2$ times faster than the best current Python(or R)-based computational packages, namely CVXPY, CVXR, nlme, sommer and mixed.

We illustrate our results on three real data sets $-$ electricity consumption, tourism and cyber security $-$ which are easily available, reproducible, sharable and modifiable in the form of interactive Jupyter notebooks.

Sokol P., Gajdoš, A. (2017). Prediction of Attacks Against Honeynet Based on Time Series Modeling. Silhavy, R., Silhavy, P., & Prokopova, Z. (Eds.). (2017). Applied Computational Intelligence and Mathematical Methods (Vol. 662). Cham: Springer International Publishing, pp. 360-371

Table of Contents

Python libraries

CVXPY: A Python-Embedded Modeling Language for Convex Optimization

Scipy - NumPy, Pandas

Data and Model

SciPy(Numpy)

Natural estimators

ANALYTICALLY

$\boldsymbol{1^{st}}$ stage of EBLUP-NE

SciPy(Numpy)

CVXPY

$\boldsymbol{2^{nd}}$ stage of EBLUP-NE

SciPy(Numpy)

cross-checking

NN-DOOLSE or MLE

$\boldsymbol{1^{st}}$ stage of EBLUP-NE

KKT algorithm

SciPy(Numpy)

CVXPY

CVXPY

$\boldsymbol{2^{nd}}$ stage of EBLUP-NE

SciPy(Numpy)

NN-DOOLSE or REMLE

$\boldsymbol{1^{st}}$ stage of EBLUP-NE

KKT algorithm

SciPy(Numpy)

CVXPY

CVXPY

$\boldsymbol{2^{nd}}$ stage of EBLUP-NE

SciPy(Numpy)

References

Abstract of the paper

Product

Resources

Company

Table of Contents

Python libraries

CVXPY: A Python-Embedded Modeling Language for Convex Optimization

Scipy - NumPy, Pandas

Data and Model

SciPy(Numpy)

Natural estimators

ANALYTICALLY

1st\boldsymbol{1^{st}}1st stage of EBLUP-NE

SciPy(Numpy)

CVXPY

2nd\boldsymbol{2^{nd}}2nd stage of EBLUP-NE

SciPy(Numpy)

cross-checking

NN-DOOLSE or MLE

1st\boldsymbol{1^{st}}1st stage of EBLUP-NE

KKT algorithm

SciPy(Numpy)

CVXPY

CVXPY

2nd\boldsymbol{2^{nd}}2nd stage of EBLUP-NE

SciPy(Numpy)

NN-DOOLSE or REMLE

1st\boldsymbol{1^{st}}1st stage of EBLUP-NE

KKT algorithm

SciPy(Numpy)

CVXPY

CVXPY

2nd\boldsymbol{2^{nd}}2nd stage of EBLUP-NE

SciPy(Numpy)

References

Abstract of the paper

$\boldsymbol{1^{st}}$ stage of EBLUP-NE

$\boldsymbol{2^{nd}}$ stage of EBLUP-NE

$\boldsymbol{1^{st}}$ stage of EBLUP-NE

$\boldsymbol{2^{nd}}$ stage of EBLUP-NE

$\boldsymbol{1^{st}}$ stage of EBLUP-NE

$\boldsymbol{2^{nd}}$ stage of EBLUP-NE