User:Iaosui/sandbox

Functional linear regression models

Functional linear models can be viewed as an extension of the traditional multivariate linear models that associates vector responses with vector covariates. The traditional linear model with scalar response $Y\in \mathbb {R}$ and vector covariate $X\in \mathbb {R} ^{p}$ can be expressed as

Y=\beta _{0}+\langle X,\beta \rangle +\varepsilon =\beta _{0}+X_{1}\beta _{1}+\dots +X_{p}\beta _{p}+\varepsilon ,

(2)

where $\langle \cdot ,\cdot \rangle$ denotes the inner product in Euclidean space, $\beta _{0}\in \mathbb {R}$ and $\beta \in \mathbb {R} ^{p}$ denote the regression coefficients, and $\varepsilon$ is a zero mean finite variance random error (noise). Functional linear models can be divided into two types based on the responses.

Functional regression models with scalar response

Replacing the vector covariate $X$ and the coefficient vector $\beta$ in model (2) by a centered functional covariate $X^{c}(t)=X(t)-\mu (t)$ and coefficient function $\beta =\beta (t)$ for $t\in {\mathcal {I}}=[0,T]$ and replacing the inner product in Euclidean space by that in Hilbert space $L^{2}$ , one arrives at the functional linear model

Y=\beta _{0}+\langle X^{c},\beta \rangle +\varepsilon =\beta _{0}+\int _{\mathcal {I}}X^{c}(t)\beta (t)\,dt+\varepsilon .

(3)

One ad hoc approach to estimating $\beta _{0}$ and $\beta (t)$ is to expand the covariate $X^{c}$ and the coefficient function $\beta$ in the same functional basis, such as the B-spline basis or the eigenbasis in (FPCA formula). Specially, consider an orthonormal basis, $\{\phi _{k}\}_{k=1}^{\infty }$ , of the function space. Then expanding $X^{c}$ and $\beta$ in this basis leads to $X^{c}(t)=\sum _{k=1}^{\infty }A_{k}\phi _{k}(t)$ , $\beta (t)=\sum _{k=1}^{\infty }\beta _{k}\phi _{k}(t)$ and model (3) is seen to be equivalent to the traditional linear model (2) of the form

Y=\beta _{0}+\sum _{k=1}^{\infty }\beta _{k}A_{k}+\varepsilon ,

(4)

where in implementations the sum on r.h.s. is replaced by a finite sum that is truncated at the first K terms. In addition to the basis-expansion approach, a penalized approach using either P-splines or smoothing splines has also been studied^[1] can be applied. In special case where the basis functions $\phi _{k}$ are selected as the eigenfunctions $\varphi _{k}$ of $X$ (refers to (1)), the basis representation approach in (4) is equivalent to conducting a principal component regression, albeit with an increasing number of principal components.

The simple functional linear model (3) can be extended to multiple functional covariates, $\{X_{j}\}_{j=1}^{p}$ , also including additional vector covariates $Z=(Z_{1},\cdots ,Z_{q})$ , where $Z_{1}=1$ , by

Y=\langle Z,\theta \rangle +\sum _{j=1}^{p}\int _{{\mathcal {I}}_{j}}X_{j}^{c}(t)\beta _{j}(t)\,dt+\varepsilon ,

(5)

where $\theta \in \mathbb {R^{q}}$ is regression coefficient for $Z$ , ${\mathcal {I_{j}}}$ is the interval where $X_{j}$ is defined, $X_{j}^{c}$ is the centered functional covariate given by $X_{j}^{c}(t)=X_{j}(t)-\mu _{j}(t)$ , and $\beta _{j}$ is regression coefficient function for $X_{j}^{c}$ , for $j=1,\ldots ,p$ . Model (3) and (5) have been studied extensively.^[2]^[3]^[4]^[5]

Functional regression models with functional response

For a functional response $Y(s)$ on ${\mathcal {I}}_{Y}$ and multiple functional covariates $X_{j}(t)$ , $t\in {\mathcal {I}}_{X_{j}}$ , two major models have been considered.^[6]^[7] One of these two models is generally referred to as functional linear model (FLM) which can be written as:

Y(s)=\alpha _{0}(s)+\sum _{j=1}^{p}\int _{{\mathcal {I}}_{X_{j}}}\alpha _{j}(s,t)X_{j}^{c}(t)\,dt+\varepsilon (s),\ {\text{for}}\ s\in {\mathcal {I}}_{Y}

(6)

where $\alpha _{0}(s)$ is the functional intercept, for $j=1,\ldots ,p$ , $X_{j}^{c}(t)=X_{j}(t)-\mu _{j}(t)$ is a centered functional covariate on ${\mathcal {I}}_{X_{j}}$ , $\alpha _{j}(s,t)$ is the corresponding functional slopes with same domain, respectively, and $\varepsilon (s)$ is usually a random process with mean zero and finite variance.^[6] In this case, at any given time $s\in {\mathcal {I}}_{Y}$ , the value of $Y$ , i.e., $Y(s)$ , depends on the entire trajectories of $\{X_{j}(t)\}_{j=1}^{p}$ . In particular, taking $X_{j}(\cdot )$ as a constant function yields a special case of model (6) $Y(s)=\alpha _{0}(s)+\sum _{j=1}^{p}X_{j}\alpha _{j}(s)+\varepsilon (s),\ {\text{for}}\ s\in {\mathcal {I}}_{Y},$ which is a functional linear model with functional responses and scalar covariates. Model (6) are also studied extensively.^[8]^[9]^[10]^[11]^[12]

Concurrent regression model

The second model assumes that ${\mathcal {I}}_{Y}={\mathcal {I}}_{X_{1}}=\ldots ={\mathcal {I}}_{X_{p}}={\mathcal {I}}$ and is most often referred to as "varying-coefficient" model. It can be written as

Y(s)=\beta _{0}(s)+\sum _{j=1}^{p}\beta _{j}(s)X_{j}(s)+\varepsilon (s),\ {\text{for}}\ s\in {\mathcal {I}},

(7)

where $X_{1},\ldots ,X_{p}$ are multiple functional covariates on ${\mathcal {I}}$ , $\beta _{0},\beta _{1},\ldots ,\beta _{p}$ are the coefficient functions defined on the same interval and $\varepsilon (s)$ is usually assumed to be a random process with mean zero and finite variance.^[6] This model assumes that the value of $Y(s)$ depends on the current value of $\{X_{j}(s)\}_{j=1}^{p}$ only and not the history $\{X_{j}(t):t\leq s\}_{j=1}^{p}$ or future value, hence it is a "concurrent regression model". Various estimation methods can be applied to model (7).^[13]^[14]^[15]^[16]^[17]^[18]

Clustering of functional data

For vector-valued multivariate data, hierarchical clustering and the k-means partitioning methods are two classical and popular approaches. Classical clustering concepts for vector-valued multivariate data can typically be extended to functional data, where various considerations arise, such as discrete approximations of distance measures, and dimension reduction of the infinite-dimensional functional data objects. Generally, k-means type clustering algorithms have been widely applied to functional data, and more popular than hierarchical clustering algorithms.

Mean functions as cluster centers

It is natural to view cluster mean functions as the clusters in functional clustering. Specifically, for a sample of functional data $\{X_{i}(t);i=1,\ldots ,n\}$ , the k-means functional clustering aims to find a set of cluster centers $\{\mu ^{c};c=1,\ldots ,L\}$ , assuming there are $L$ clusters, by minimizing the sum of the squared distances between $\{X_{i}\}$ and the cluster centers that are associated with their cluster labels $\{C_{i};i=1,\ldots ,n)\}$ , for a suitable functional distance $d$ . The distance $d$ is often chosen as the $L^{2}$ norm. The traditional k-means clustering for vector-valued multivariate data has been extended to functional data using mean functions as cluster centers, and one can distinguish two typical approaches, as follows.

Functional clustering via functional basis expansion

Given a set of pre-specified basis functions $\{\phi _{1},\phi _{2},\ldots \}$ of the function space, the first $K$ projections $\{B_{ik}\}$ of the observed trajectories onto the space spanned by the set of basis functions can be used to represent the functional data, where $B_{ik}=\langle X_{i}^{c},\phi _{k}\rangle$ , $k=1,\ldots ,K$ . The distributional patterns of $\{B_{ik}\}$ then reflect the clusters in function space. Then applying available clustering algorithms for multivariate data, such as the k-means algorithm, to partition the estimated sets of coefficients. Finally, one can obtain cluster centers $\{{\bar {B}}_{k}^{c}\}_{k=1}^{K}$ on the projected space, and thus the set of cluster centers in the function space $\{{\hat {\mu }}^{c};c=1,\ldots ,L\}$ , where ${\hat {\mu }}^{c}(t)=\sum _{k=1}^{K}{\bar {B}}_{k}^{c}\phi _{k}(t)$ . For good performance, this method requires a judicious choice of the basis function and researches corresponding to different choice of basis function are studied a lot, such as B-spline basis^[19], Fourier basis^[20], P splines basis^[21], Gaussian orthonormalized basis^[22] and wavelet basis^[23] functions.

Functional clustering via FPCA

In contrast to the functional basis expansion approach that requires a pre-specified set of basis functions, the finite approximation FPCA approach (1) using the FPCs employs data-adaptive basis functions that are determined by the covariance function of the functional data. Then the FPC scores $\{A_{ik}\}$ play a similar role as the basis coefficients $\{B_{ik}\}$ for clustering.^[24]

When the mean functions as the cluster centers are sufficient to define the clusters, this step is sufficient. However, when covariance structures also play a role to distinguish clusters, taking mean functions as cluster is not adequate, as will be discussed as followed.

Subspaces as cluster centers

Since functional data are realizations of random functions, it is natural to use differences in the stochastic structure of random functions for clustering. This idea is particularly sensible in functional data clustering, utilizing the truncated Karhunen–Loève representation in (1). Then the subspace spanned by the components of the expansion, the mean function and the set of the eigenfunctions, can be used to characterize clusters. Therefore, clusters of the data set are identified via subspace projection such that cluster centers hinge on the stochastic structure of the random functions, rather than the mean functions only.

The FPC subspace-projected k-centers functional clustering approach uses subspaces as cluster centers.^[25] Let $C$ be the cluster membership variable, and the FPC subspace ${\mathcal {S}}^{c}=\{\mu ^{c},\varphi _{1}^{c},\ldots ,\varphi _{K_{c}}^{c}\}$ , $c=1,\ldots ,L$ , assuming that there are $L$ clusters. The projected function of $X_{i}$ onto the FPC subspace ${\mathcal {S}}^{c}$ can be written as

{\tilde {X}}_{i}^{c}(t)=\mu ^{c}(t)+\sum _{k=1}^{K_{c}}A_{ik}^{c}\varphi _{k}^{c}(t).

(8)

One aims to find the set of cluster centers $\{{\mathcal {S}}^{c};c=1,\ldots ,L\}$ , such that the best cluster membership of $X_{i}$ , $c^{*}(X_{i})$ , is determined by minimizing the discrepancy between the projected function ${\tilde {X}}_{i}^{c}$ and the observation $X_{i}$ ,

c^{*}(X_{i})={\underset {c=\{1,\ldots ,L\}}{\operatorname {arg\,min} }}d^{2}(X_{i},{\tilde {X}}_{i}^{c}).

(9)

In contrast, k-means clustering aims to find the set of cluster sample means as the cluster centers, instead of the subspaces spanned by ${\mathcal {S}}^{c}=\{\mu ^{c},\varphi _{1}^{c},\ldots ,\varphi _{K_{c}}^{c}\}$ . The initial step of the subspace-projected clustering procedure uses only $\mu ^{c}$ , which reduces to the k-means functional clustering. In the subsequent iteration steps, the mean function and the set of eigenfunction for each cluster is updated and used to identify the set of cluster subspaces $\{{\mathcal {S}}^{c}\}$ , until iterations converge. This functional clustering approach simultaneously identifies the structural components of the stochastic representation for each cluster.

Functional clustering with mixture models

Model-based clustering^[26] based on mixture models is widely used in clustering vector-valued multivariate data and has been extended to functional data clustering. In this approach, similarly to the k-means type of functional data clustering, typical mixture model-based approaches to functional data clustering in a first step project the infinite dimensional functional data onto low-dimensional subspaces. For example, we can apply functional clustering models based on Gaussian mixture^[27]^[28]^[29] distributions to the natural cubic spline basis coefficients, with emphasis on clustering sparsely sampled functional data. Furthermore, random effects modeling also provides a model-based clustering approach, using mixed effects models with B-splines or P-splines.^[21] For clustering longitudinal data, a linear mixed model for clustering using a penalized normal mixture as random effects distribution has been studied.^[30] Bayesian hierarchical clustering also plays an important role in the development of model-based functional clustering.^[31]^[32]^[33]^[34]

Clustering of functional data

For vector-valued multivariate data, k-means partitioning methods and hierarchical clustering are two main approaches. Classical clustering concepts for vector-valued multivariate data can typically be extended to functional data, where various considerations arise, such as discrete approximations of distance measures, and dimension reduction of the infinite-dimensional functional data objects. Generally, k-means type clustering algorithms have been widely applied to functional data, and more popular than hierarchical clustering algorithms. For k-means clustering, mean functions are usually viewed as the cluster centers. Specifically, for a sample of functional data $\{X_{i}(t);i=1,\ldots ,n\}$ , the k-means functional clustering aims to find a set of cluster centers $\{\mu ^{c};c=1,\ldots ,L\}$ , assuming there are $L$ clusters, by minimizing the sum of the squared distances between $\{X_{i}(t)\}$ and the cluster centers that are associated with their cluster labels $\{C_{i};i=1,\ldots ,n)\}$ , for a suitable functional distance $d$ . Typically, functional basis expansion^[35]^[36]^[37]^[38]^[39] or FPCA^[40] is used for dimension reduction. However, when covariance structures also play a role to distinguish clusters, taking mean functions as center is not adequate and FPC subspace-projected k-centers functional clustering approach, who uses subspaces as cluster centers, is taken into consideration.^[41] Specifically, the Let $C$ be the cluster membership variable, and the FPC subspace ${\mathcal {S}}^{c}=\{\mu ^{c},\varphi _{1}^{c},\ldots ,\varphi _{K_{c}}^{c}\}$ , $c=1,\ldots ,L$ , assuming that there are $L$ clusters. The projected function of $X_{i}$ onto the FPC subspace ${\mathcal {S}}^{c}$ can be written as ${\tilde {X}}_{i}^{c}(t)=\mu ^{c}(t)+\sum _{k=1}^{K_{c}}A_{ik}^{c}\varphi _{k}^{c}(t).$ One aims to find the set of cluster centers $\{{\mathcal {S}}^{c};c=1,\ldots ,L\}$ , such that the best cluster membership of $X_{i}$ , $c^{*}(X_{i})$ , is determined by minimizing the discrepancy between the projected function ${\tilde {X}}_{i}^{c}$ and the observation $X_{i}$ . The initial step of the subspace-projected clustering procedure uses only $\mu ^{c}$ , which reduces to the k-means functional clustering. In the subsequent iteration steps, the mean function and the set of eigenfunction for each cluster is updated and used to identify the set of cluster subspaces $\{{\mathcal {S}}^{c}\}$ , until iterations converge. This functional clustering approach simultaneously identifies the structural components of the stochastic representation for each cluster. Furthermore, model-based clustering^[42] based on mixture models is widely used in clustering vector-valued multivariate data and has been extended to functional data clustering. In this approach, similarly to the k-means type of functional data clustering, typical mixture model-based approaches to functional data clustering in a first step project the infinite dimensional functional data onto low-dimensional subspaces. For example, we can apply functional clustering models based on Gaussian mixture^[43]^[44]^[45] distributions to the natural cubic spline basis coefficients, with emphasis on clustering sparsely sampled functional data. Furthermore, random effects modeling also provides a model-based clustering approach, using mixed effects models with B-splines or P-splines.^[37] For clustering longitudinal data, a linear mixed model for clustering using a penalized normal mixture as random effects distribution has been studied.^[46] Bayesian hierarchical clustering also plays an important role in the development of model-based functional clustering.^[47]^[48]^[49]^[50]

Most recent developments in FDA

Functional designs
Domain selection problems
Dependent functional data, such as functional time series
Multivariate functional data
Spatially indexed functional data
“Second Generation” functional data

Applications

Continuous tracking and monitoring of movement and health data
Traffic flow data continuously recorded over time by arrays of sensors
Continuously recorded climate and weather data
Transcription factor count modeling along the genome
Analysis of auction data
Volatility data

This is the user sandbox of Iaosui. A user sandbox is a subpage of the user's user page. It serves as a testing spot and page development space for the user and is not an encyclopedia article. Create or edit your own sandbox here.

Other sandboxes: Main sandbox | Template sandbox

Finished writing a draft article? Are you ready to request review of it by an experienced editor for possible inclusion in Wikipedia? Submit your draft for review!

Functional data analysis (FDA) is a branch of statistics that analyzes data providing information about curves, surfaces or anything else varying over a continuum. In its most general form, under an FDA framework, each sample element of functional data is considered to be a function. The physical continuum over which these functions are defined is often time, but may also be spatial location, wavelength, probability, etc. Intrinsically, functional data are infinite dimensional. The high intrinsic dimensionality of these data brings challenges for theory as well as computation, where these challenges vary with how the functional data were sampled. However, the high or infinite dimensional structure of the data is a rich source of information and there are many interesting challenges for research and data analysis.

History

Functional data analysis has roots going back to work by Grenander and Karhunen in the 1940s and 1950s.^[51]^[52]^[53]^[54] They considered the decomposition of square-integrable continuous time stochastic process into eigencomponents, now known as the Karhunen-Loève decomposition. A rigorous analysis of functional principal components analysis was done in the 1970s by Kleffe, Dauxois and Pousse including results about the asymptotic distribution of the eigenvalues.^[55]^[56] More recently in the 1990s and 2000s the field has focused more on application and understanding the effects of dense and sparse observations schemes. The term "Functional Data Analysis" was coined by James O. Ramsay.^[57]

Mathematical formalism

Random functions can be viewed as random elements taking values in a Hilbert space, or as a stochastic process. The former is mathematically convenient, whereas the latter is somewhat more suitable from an applied perspective. These two approaches coincide if the random functions are continuous and a condition called mean-squared continuity is satisfied. ^[58]

Hilbertian random variables

In the Hilbert space viewpoint, one considers an $H$ -valued random element $X$ , where $H$ is a separable Hilbert space such as the space of square-integrable functions $L^{2}[0,1]$ . Under the integrability condition that $\mathbb {E} \|X\|_{L^{2}}^{2}=\mathbb {E} (\int _{0}^{1}|X(t)|^{2}dt)<\infty$ , one can define the mean of $X$ as the unique element $\mu \in H$ satisfying

\mathbb {E} \langle X,h\rangle =\langle \mu ,h\rangle ,\qquad h\in H.

This formulation is the Pettis integral but the mean can also be defined as $\mu =\mathbb {E} X$ the Bochner sense. Under the integrability condition that $\mathbb {E} \|X\|_{L^{2}}^{2}$ is finite, the covariance operator of $X$ is a linear operator ${\mathcal {C}}:H\to H$ that is uniquely defined by the relation

{\mathcal {C}}h=\mathbb {E} [\langle h,X-\mu \rangle (X-\mu )],\qquad h\in H,

or, in tensor form, ${\mathcal {C}}=\mathbb {E} [(X-\mu )\otimes (X-\mu )]$ . The spectral theorem allows to decompose $X$ as the Karhunen-Loève decomposition

X=\mu +\sum _{i=1}^{\infty }\langle X,\varphi _{i}\rangle \varphi _{i},

where $\varphi _{i}$ are eigenvectors of ${\mathcal {S}}$ , corresponding to the nonnegative eigenvalues of ${\mathcal {S}}$ , in a non-increasing order. Truncating this infinite series to a finite order underpins functional principal component analysis.

Stochastic processes

The Hilbertian point of view is mathematically convenient, but abstract; the above considerations do not necessarily even view $X$ as a function at all, since common choices of $H$ like $L^{2}[0,1]$ and Sobolev spaces consist of equivalence classes, not functions. The stochastic process perspective views $X$ as a collection of random variables

\{X(t)\}_{t\in [0,1]}

indexed by the unit interval (or more generally some compact metric space $K$ ). The mean and covariance functions are defined in a pointwise manner as

\mu (t)=\mathbb {E} X(t),\qquad \Sigma (s,t)={\textrm {Cov}}(X(s),X(t)),\qquad s,t\in [0,1]

(if $\mathbb {E} [X(t)^{2}]<\infty$ for all $t\in [0,1]$ ). We can hope to view $X$ as a random element on the Hilbert function space $H=L^{2}[0,1]$ . However, additional conditions are required for such a pursuit to be fruitful, since if we let $X(t)$ be Gaussian white noise, i.e. $X(t)$ is standard Gaussian and independent from $X(s)$ for any $s,t\in [0,1]$ , it is clear that we have no hope of viewing this as a square integrable function.

A convenient sufficient condition is mean square continuity, stipulating that $\mu$ and $c$ are continuous functions. In this case $c$ defines a covariance operator ${\mathcal {C}}:H\to H$ by

({\mathcal {C}}f)(t)=\int _{K}\Sigma (s,t)f(s)\,\mathrm {d} s.

The spectral theorem applies to ${\mathcal {C}}$ , yielding eigenpairs $(\lambda _{j},\varphi _{j})$ , so that in tensor product notation ${\mathcal {C}}$ writes

{\mathcal {C}}=\sum _{j=1}^{\infty }\lambda _{j}\varphi _{j}\otimes \varphi _{j}.

Moreover, since ${\mathcal {S}}f$ is continuous for all $f\in H$ , all the $\varphi _{j}$ 's are continuous. Mercer's theorem then states that the covariance function $c$ admits an analogous decomposition

\sup _{s,t\in [0,1]}\left|c(s,t)-\sum _{j=1}^{K}\lambda _{j}\varphi _{j}(s)\varphi _{j}(t)\right|\to 0,\qquad K\to \infty .

Finally, under the extra assumption that $X$ has continuous sample paths, namely that with probability one, the random function $X:[0,1]\to \mathbb {R}$ is continuous, the Karhunen-Loève expansion above holds for $X$ and the Hilbert space machinery can be subsequently applied. Continuity of sample paths can be shown using Kolmogorov continuity theorem.

Functional data designs

Functional data are considered as realizations of a stochastic process $X(t),t\in [0,1]$ that is an $L^{2}$ process on a bounded and closed interval $[0,1]$ with mean function $\mu (t)=\mathbb {E} (X(t))$ and covariance function $\Sigma (s,t)={\textrm {Cov}}(X(s),X(t))$ . The realizations of the process for the i-th subject is $X_{i}(\cdot )$ , and the sample is assumed to consist of $n$ independent subjects. The sampling schedule may vary across subjects, denoted as $T_{i1},...,T_{iN_{i}}$ for the i-th subject. The corresponding i-th observation is denoted as ${\textbf {X}}_{i}=(X_{i1},...,X_{iN_{i}})$ , where $X_{ij}=X_{i}(t_{ij})$ . In addition, the measurement of $X_{ij}$ is assumed to have random noise $\epsilon _{ij}$ with $\mathbb {E} (\epsilon _{ij})=0$ and ${\textrm {Var}}(\epsilon _{ij})=\sigma _{ij}^{2}$ , which are independent across $i$ and $j$ .

1. Fully observed functions without noise at arbitrarily dense grid

Measurements $Y_{it}=X_{i}(t)$ available for all $t\in {\mathcal {I}},\,i=1,\ldots ,n$

Often unrealistic but mathematically convenient.

Real life example: Tecator spectral data.^[57]

2. Dense design with noisy measurements

Measurements $Y_{ij}=X_{i}(T_{ij})+\varepsilon _{ij}$ , where $T_{ij}$ are recorded on a regular grid,

$T_{i1},\ldots ,T_{iN_{i}}$ , and $N_{i}\rightarrow \infty$ applies to typical functional data.

Real life example: Berkeley Growth Study Data and Stock data

3. Sparse design with noisy measurements (longitudinal data)

Measurements $Y_{ij}=X_{i}(T_{ij})+\varepsilon _{ij}$ , where $T_{ij}$ are random times and their number $N_{i}$ per subject is random and finite.

Real life example: CD4 count data for AIDS patients.^[59]

Functional principal component analysis

Functional principal component analysis (FPCA) is the most prevalent tool in FDA, partly because FPCA facilitates dimension reduction of the inherently infinite-dimensional functional data to finite-dimensional random vector of $scores$ . More specifically, dimension reduction is achieved by expansion the underlying observed random trajectories $X_{i}(t)$ in a functional basis that consists of the eigenfunctions of the (auto)-covariance operator on $X$ . Consider the (auto)-covariance operator ${\mathcal {C}}:L^{2}[0,1]\rightarrow L^{2}[0,1]$ , $({\mathcal {C}}X)(t)=\int _{0}^{1}\Sigma (s,t)X(s)ds$ , which is a compact operator on Hilbert space. By Mercer's theorem, the kernel of ${\mathcal {C}}$ , i.e., the covariance function $\Sigma (\cdot ,\cdot )$ , has spectral decomposition $\Sigma (s,t)=\sum _{k=1}^{\infty }\lambda _{k}\varphi _{k}(s)\varphi _{k}(t)$ , where the series convergence is absolute and uniform, and $\lambda _{k}$ are real-valued nonnegative eigenvalues in descending order with the corresponding orthonormal eigenfunctions $\varphi _{k}(t)$ . By the Karhunen–Loève theorem, the FPCA expansion of an underlying random trajectory is $X_{i}(t)=\mu (t)+\sum _{k=1}^{\infty }A_{ik}\varphi _{k}(t)$ , where $A_{ik}=\int _{0}^{1}(X_{i}(t)-\mu (t))\varphi _{k}(t)dt$ are the functional principal components (FPCs), sometimes referred to as $scores$ . The Karhunen–Loève expansion facilitates dimension reduction in the sense that the partial sum converges uniformly, i.e., $\sup _{t\in [0,1]}\mathbb {E} [X_{i}(t)-\mu (t)-\sum _{k=1}^{K}A_{ik}\varphi _{k}(t)]^{2}\rightarrow 0$ as $K\rightarrow \infty$ and thus the partial sum with a large enough $K$ yields a good approximation to the infinite sum. Thereby, the information in $X_{i}$ is reduced from infinite dimensional to a $K$ -dimensional vector $A_{i}=(A_{i1},...,A_{iK})$ with the approximated process：

X_{i}^{(K)}(t)=\mu (t)+\sum _{k=1}^{K}A_{ik}\varphi _{k}(t)

(1)

Other popular bases include spline, Fourier series and wavelet bases. Important applications of FPCA include the modes of variation and functional principal component regression.

Functional linear regression models

Functional linear models can be viewed as an extension of the traditional multivariate linear models that associates vector responses with vector covariates. The traditional linear model with scalar response $Y\in \mathbb {R}$ and vector covariate $X\in \mathbb {R} ^{p}$ can be expressed as

Y=\beta _{0}+\langle X,\beta \rangle +\varepsilon =\beta _{0}+X_{1}\beta _{1}+\dots +X_{p}\beta _{p}+\varepsilon ,

(2)

where $\langle \cdot ,\cdot \rangle$ denotes the inner product in Euclidean space, $\beta _{0}\in \mathbb {R}$ and $\beta \in \mathbb {R} ^{p}$ denote the regression coefficients, and $\varepsilon$ is a zero mean finite variance random error (noise). Functional linear models can be divided into two types based on the responses.

Functional regression models with scalar response

Replacing the vector covariate $X$ and the coefficient vector $\beta$ in model (2) by a centered functional covariate $X^{c}(t)=X(t)-\mu (t)$ and coefficient function $\beta =\beta (t)$ for $t\in [0,1]$ and replacing the inner product in Euclidean space by that in Hilbert space $L^{2}$ , one arrives at the functional linear model

Y=\beta _{0}+\langle X^{c},\beta \rangle +\varepsilon =\beta _{0}+\int _{0}^{1}X^{c}(t)\beta (t)\,dt+\varepsilon .

(3)

The simple functional linear model (3) can be extended to multiple functional covariates, $\{X_{j}\}_{j=1}^{p}$ , also including additional vector covariates $Z=(Z_{1},\cdots ,Z_{q})$ , where $Z_{1}=1$ , by

Y=\langle Z,\theta \rangle +\sum _{j=1}^{p}\int _{0}^{1}X_{j}^{c}(t)\beta _{j}(t)\,dt+\varepsilon ,

(4)

where $\theta \in \mathbb {R^{q}}$ is regression coefficient for $Z$ , the domain of $X_{j}$ is $[0,1]$ , $X_{j}^{c}$ is the centered functional covariate given by $X_{j}^{c}(t)=X_{j}(t)-\mu _{j}(t)$ , and $\beta _{j}$ is regression coefficient function for $X_{j}^{c}$ , for $j=1,\ldots ,p$ . Model (3) and (4) have been studied extensively.^[60]^[61]^[62]

Functional regression models with functional response

For a functional response $Y(s)$ on $[0,1]$ and multiple functional covariates $X_{j}(t)$ , $t\in [0,1]$ , two major models have been considered.^[63]^[64] One of these two models is generally referred to as functional linear model (FLM) which can be written as:

Y(s)=\alpha _{0}(s)+\sum _{j=1}^{p}\int _{0}^{1}\alpha _{j}(s,t)X_{j}^{c}(t)\,dt+\varepsilon (s),\ {\text{for}}\ s\in [0,1]

(5)

where $\alpha _{0}(s)$ is the functional intercept, for $j=1,\ldots ,p$ , $X_{j}^{c}(t)=X_{j}(t)-\mu _{j}(t)$ is a centered functional covariate on $[0,1]$ , $\alpha _{j}(s,t)$ is the corresponding functional slopes with same domain, respectively, and $\varepsilon (s)$ is usually a random process with mean zero and finite variance.^[63] In this case, at any given time $s\in [0,1]$ , the value of $Y$ , i.e., $Y(s)$ , depends on the entire trajectories of $\{X_{j}(t)\}_{j=1}^{p}$ . Model (5) are also studied extensively.^[65]^[66]^[67]^[68]^[69]

Function-on-scalar regression

In particular, taking $X_{j}(\cdot )$ as a constant function yields a special case of model (5) $Y(s)=\alpha _{0}(s)+\sum _{j=1}^{p}X_{j}\alpha _{j}(s)+\varepsilon (s),\ {\text{for}}\ s\in [0,1],$ which is a functional linear model with functional responses and scalar covariates.

Concurrent regression models

This model is given by,

Y(s)=\beta _{0}(s)+\sum _{j=1}^{p}\beta _{j}(s)X_{j}(s)+\varepsilon (s),\ {\text{for}}\ s\in [0,1],

(6)

where $X_{1},\ldots ,X_{p}$ are multiple functional covariates on $[0,1]$ , $\beta _{0},\beta _{1},\ldots ,\beta _{p}$ are the coefficient functions defined on the same interval and $\varepsilon (s)$ is usually assumed to be a random process with mean zero and finite variance.^[63] This model assumes that the value of $Y(s)$ depends on the current value of $\{X_{j}(s)\}_{j=1}^{p}$ only and not the history $\{X_{j}(t):t\leq s\}_{j=1}^{p}$ or future value, hence it is a "concurrent regression model", which also has been referred to as "varying-coefficient" model. Various estimation methods have been proposed.^[70]^[71]^[72]^[73]^[74]^[75]

Functional nonlinear regression models

Direct nonlinear extensions of the classical functional linear regression models (FLMs) still involve a linear predictor, but combine it with a nonlinear link function, analogous to the idea of generalized linear model from the conventional linear model. Developments towards fully nonparametric regression models for functional data encounter problems such as curse of dimensionality. In order to bypass the “curse” and the metric selection problem, we are motivated to consider nonlinear functional regression models, which are subject to some structural constraints but do not overly infringe flexibility. One desires models that retain polynomial rates of convergence, while being more flexible than, say, functional linear models. Such models are particularly useful when diagnostics for the functional linear model indicate lack of fit, which is often encountered in real life situations. In particular, functional polynomial models, functional single and multiple index models and functional additive models are three special cases of functional nonlinear regression models.

Functional polynomial regression models

Functional polynomial regression models may be viewed as a natural extension of the Functional Linear Models (FLMs) with scalar responses, analogous to extending linear regression model to polynomial regression model. For a scalar response $Y$ and a functional covariate $X(\cdot )$ with domain $[0,1]$ and the corresponding centered predictor processes $X^{c}$ , the simplest and the most prominent member in the family of functional polynomial regression models is the quadratic functional regression^[76] given as follows, $\mathbb {E} (Y|X)=\alpha +\int _{0}^{1}\beta (t)X^{c}(t)\,dt+\int _{0}^{1}\int _{0}^{1}\gamma (s,t)X^{c}(s)X^{c}(t)\,ds\,dt$ where $X^{c}(\cdot )=X(\cdot )-\mathbb {E} (X(\cdot ))$ is the centered functional covariate, $\alpha$ is a scalar coefficient, $\beta (\cdot )$ and $\gamma (\cdot ,\cdot )$ are coefficient functions with domains $[0,1]$ and $[0,1]\times [0,1]$ , respectively. In addition to the parameter function β that the above functional quadratic regression model shares with the FLM, it also features a parameter surface γ. By analogy to FLMs with scalar responses, estimation of functional polynomial models can be obtained through expanding both the centered covariate $X^{c}$ and the coefficient functions $\beta$ and $\gamma$ in an orthonormal basis.^[76]^[77]

Functional single and multiple index models

A functional multiple index model is given as below, with symbols having their usual meanings as formerly described, $\mathbb {E} (Y|X)=g\left(\int _{0}^{1}X^{c}(t)\beta _{1}(t)\,dt,\ldots ,\int _{0}^{1}X^{c}(t)\beta _{p}(t)\,dt\right)$ Here g represents an (unknown) general smooth function defined on a p-dimensional domain. The case $p=1$ yields a functional single index model while multiple index models correspond to the case $p>1$ . However, for $p>1$ , this model is problematic due to curse of dimensionality. With $p>1$ and relatively small sample sizes, the estimator given by this model often has large variance.^[78]^[79]

Functional additive models (FAMs)

Let $X(t)=\sum _{k=1}^{\infty }x_{k}\phi _{k}(t)$ denote an expansion of a functional covariate $X$ with domain $[0,1]$ in an orthonormal basis $\{\phi _{k}\}_{k=1}^{\infty }$ .

A functional linear model with scalar responses [as shown in model (2) of FLM] can be thus be written as follows, $\mathbb {E} (Y|X)=\mathbb {E} (Y)+\sum _{k=1}^{\infty }\beta _{k}x_{k}.$ One form of FAMs is obtained by replacing the linear function of $x_{k}$ in the above expression ( i.e., $\beta _{k}x_{k}$ ) by a general smooth function $f_{k}$ , analogous to the extension of multiple linear regression models to additive models and is expressed as, $\mathbb {E} (Y|X)=\mathbb {E} (Y)+\sum _{k=1}^{\infty }f_{k}(x_{k}),$ where $f_{k}$ satisfies $\mathbb {E} (f_{k}(x_{k}))=0$ for $k\in \mathbb {N}$ .^[80]^[64] This constraint on the general smooth functions $f_{k}$ ensures identifiability in the sense that the estimates of these additive component functions do not interfere with that of the intercept term $\mathbb {E} (Y)$ .

Another form of FAM is the continuously additive model^[81], expressed as, $\mathbb {E} (Y|X)=\mathbb {E} (Y)+\left(\int _{0}^{1}g(t,X(t))dt\right)$ for a bivariate smooth (i.e. twice differentiable) additive surface g : $[0,1]\times \mathbb {R} \longrightarrow \mathbb {R}$ which is required to satisfy $\mathbb {E} [g(t,X(t))]=0$ for all t in $[0,1]$ in order to ensure identifiability.

Generalized functional linear model

An obvious and direct extension of FLMs with scalar responses [shown in model (2)] is to add a link function so as to create a generalized functional linear model (GFLM)^[82] by analogy to extending linear model to generalized linear model (GLM), of which the three components are:

Linear predictor $\eta =\beta _{0}+\int _{0}^{1}X^{c}(t)\beta (t)\,dt$ ; [systematic component]
Variance function ${\text{Var}}(Y|X)=V(\mu )$ , where $\mu =\mathbb {E} (Y|X)$ is the conditional mean; [random component]
Link function $g$ connecting the conditional mean $\mu$ and the linear predictor $\eta$ through $\mu =g(\eta )$ . [systematic component]

Clustering and classification of functional data

For vector-valued multivariate data, k-means partitioning methods and hierarchical clustering are two main approaches. These classical clustering concepts for vector-valued multivariate data have been extended to functional data. For clustering of functional data, k-means clustering methods are more popular than hierarchical clustering methods. For k-means clustering on functional data, mean functions are usually regarded as the cluster centers. Covariance structures have also been taken into consideration.^[83] Besides k-means type clustering, functional clustering^[84] based on mixture models is also widely used in clustering vector-valued multivariate data and has been extended to functional data clustering.^[85]^[86]^[87]^[37]^[88] Furthermore, Bayesian hierarchical clustering also plays an important role in the development of model-based functional clustering.^[89]^[90]^[91]^[92] Functional classification assigns a group membership to a new data object either based on functional regression or functional discriminant analysis. Functional data classification methods based on functional regression models use class levels as responses and the observed functional data and other covariates as predictors. For regression based functional classification models, functional generalized linear models or more specifically, functional binary regression, such as functional logistic regression for binary response, are commonly used classification approaches. More generally, the generalized functional linear regression model based on the FPCA approach is used^[93]. Functional Linear Discriminant Analysis (FLDA) has also been considered as a classification method for functional data.^[94]^[95]^[96]^[97]^[98]Functional data classification involving density ratios has also been proposed.^[21] A study of the asymptotic behavior of the proposed classifiers in the large sample limit shows that under certain conditions the misclassification rate converges to zero, a phenomenon that has been referred to as "perfect classification".

Time warping

Motivations

In addition to amplitude variation^[6], time variation may also be assumed to present in functional data. Time variation occurs when the subject-specific timing of certain events of interest varies among subjects. One classical example is Berkeley Growth Study Data^[99], where the amplitude variation is the growth rate and the time variation explains the difference in children's biological age at which the pubertal and the pre-pubertal growth spurt occurred. In the presence of time variation, cross-sectional mean function may be an efficient estimate as peaks and troughs are located randomly and thus meaningful signals may be distorted or hidden.

Time warping, also known as curve registration^[100], curve alignment or time synchronization, aims to identify and separate amplitude variation and time variation. If assume both components exist, then the functional data are viewed as a result of an underlying template function "warped" non-linearly in time by a smooth random process, known as time warping function. Let $Y_{i}$ denote the observed functions with both phase and amplitude variability and $X_{i}$ to be the underlying functions where only the amplitude variation is present. The time warping function $h_{i}$ removes the time variation such that $Y_{i}(t)=X_{i}[h_{i}^{-1}(t_{j})],t\in [0,1]$ , where $h_{i}$ are the realizations of an underlying time warping function that transforms the subject-specific time to the time scale of the template. The time warping functions $h_{i}$ are assumed to be invertible and have average identity $\mathbb {E} (h(t))=t$ . In most cases, the time is assumed to flow forward, so $h_{i}$ are assumed to be strictly monotonic increasing. One exception where the time can flow backward has been presented in the context of modeling the declines in house price as time reversals^[101].

The simplest case of a family of warping functions to specify phase variation is linear transformation, that is $h(t)=\delta +\gamma t$ , which warps the time of an underlying template function by subjected-specific shift and scale. More general class of warping functions includes diffeomorphisms of the domain to itself, that is, loosely speaking, a class of invertible functions that maps the compact domain to itself such that both the function and its inverse are smooth. The set of linear transformation is contained in the set of diffeomorphisms^[102]. One challenge in time warping is identifiability of amplitude and phase variation. To break the non-identifiability requires specific assumptions.

Methods

Earlier approaches include dynamic time warping (DTW) for applications such as speech recognition^[103]. Another traditional method for time warping is landmark registration ^[104] ^[105], which aligns special features such as peak locations to an average location. Other relevant warping methods include pairwise warping^[106], registration using ${\mathcal {L}}^{2}$ distance^[102] and elastic warping^[107].

Dynamic time warping

The template function is determined through an iteration process, starting from cross-sectional mean, performing registration and recalculating the cross-sectional mean for the warped curves, expecting convergence after a few iterations. DTW minimizes a cost function through dynamic programming algorithm. Problems of non-smooth differentiable warps or greedy computation in DTW can be resolved by adding a regularization term to the cost function.

Landmark registration

Landmark registration (or feature alignment) assumes well-expressed features are present in all sample curves and uses the location of such features as a gold-standard. Special features such as peak or valley locations or derivatives on the observed sample functions are aligned to their average locations on the template function^[102]. Then the warping function is introduced through a smooth transformation from the average location to the subject-specific locations. A problem of landmark registration is that the features may be missing or hard to identify due to the noise in the data.

Extensions

So far we considered scalar valued stochastic process, $\{X(t)\}_{t\in {\mathcal {T}}}$ , defined on one dimensional time domain.

Extension 1: The dimension of time domain

The time domain of $X(t)$ can be extended from one dimension to multiple dimensions.

Extension 2: The range of the stochastic process

The range set of the stochastic process may be extended from scalar values to vector values and further to nonlinear manifolds^[108] and then to Hilbert spaces^[109] and eventually to metric spaces^[110].

R packages

While the presentation of the models above assumes fully-observed functions, software is available for fitting the models with discretely-observed functions in software such as R.

References

^ Cardot, H; Ferraty, F; Sarda, P. (2003). "Spline estimators for the functional linear model". Statistica Sinica. 13 (3): 571–591.
^ Hilgert, N; Mas, A; Verzelen, N. (2013). "Minimax adaptive tests for the functional linear model". Annals of Statistics. 41: 838–869.
^ Kong, D; Xue, K; Yao, F; Zhang, HH. (2016). "Partially functional linear regression in high dimensions". Biometrika. 103 (1): 147–159.
^ Hu, Z; Wang, N; Carroll, RJ. (2004). "Profile‐kernel versus backfitting in the partially linear models for longitudinal/clustered data". Biometrika. 91 (2): 251–262.
^ Horváth, L; Kokoszka, P. (2012). Inference for functional data with applications. Springer Series in Statistics. Springer-Verlag.
^ ^a ^b ^c ^d Wang, JL; Chiou, JM; Müller, HG. (2016). "Functional data analysis". Annual Review of Statistics and Its Application. 3 (1): 257–295. Cite error: The named reference "wang:16" was defined multiple times with different content (see the help page).
^ Ramsay, J; Silverman, BW. (2005). Functional Data Analysis, 2nd ed. Springer.
^ Ramsay, JO; Dalzell, CJ. (1991). "Some tools for functional data analysis". Journal of the Royal Statistical Society: Series B (Methodological). 53 (3): 539–561.
^ Malfait, N; Ramsay, JO. (2003). "The historical functional linear model". The Canadian Journal of Statistics. 31 (2): 115–128.
^ He, G; Müller, HG; Wang, JL. (2003). "Functional canonical analysis for square integrable stochastic processes". Journal of Multivariate Analysis. 85 (1): 54–77.
^ Yao, F; Müller, HG; Wang, JL. (2005). "Functional data analysis for sparse longitudinal data". Journal of the American Statistical Association. 100 (470): 577–590.
^ He, G; Müller, HG; Wang, JL; Yang, WJ. (2010). "Functional linear regression via canonical analysis". Journal of Multivariate Analysis. 16 (3): 705–729.
^ Fan, J; Zhang, W. (1999). "Statistical estimation in varying coefficient models". The Annals of Statistics. 27 (5): 1491–1518.
^ Wu, CO; Yu, KF. (2002). "Nonparametric varying-coefficient models for the analysis of longitudinal data". International Statistical Review. 70 (3): 373–393.
^ Huang, JZ; Wu, CO; Zhou, L. (2002). "Varying-coefficient models and basis function approximations for the analysis of repeated measurements". Biometrika. 89 (1): 111–128.
^ Huang, JZ; Wu, CO; Zhou, L. (2004). "Polynomial spline estimation and inference for varying coefficient models with longitudinal data". Statistica Sinica. 14 (3): 763–788.
^ Şentürk, D; Müller, HG. (2010). "Functional varying coefficient models for longitudinal data". Journal of the American Statistical Association. 105 (491): 1256–1264.
^ Eggermont, PPB; Eubank, RL; LaRiccia, VN. (2010). "Convergence rates for smoothing spline estimators in varying coefficient models". Journal of Statistical Planning and Inference. 140 (2): 369–381.
^ Abraham, C; Cornillon, PA; Matzner‐Løber, E; Molinari, N. (2003). "Unsupervised curve clustering using B-splines". Scandinavian Journal of Statistics. 30 (3): 581–595.
^ Serban, N; Wasserman, L. (2005). "CATS: Clustering after transformation and smoothing". Journal of the American Statistical Association. 100 (471): 990–999.
^ ^a ^b ^c Coffey, N; Hinde, J; Holian, E. (2014). "Clustering longitudinal profiles using P-splines and mixed effects models applied to time-course gene expression data". Computational Statistics & Data Analysis. 71 (C): 14–29. Cite error: The named reference ":0" was defined multiple times with different content (see the help page).
^ Kayano, M; Dozono, K; Konishi, S. (2010). "Functional cluster analysis via orthonormalized Gaussian basis expansions and Its application". Journal of Classification. 27 (2): 211–230.
^ Giacofci, M; Lambert‐Lacroix, S; Marot, G; Picard, F. (2013). "Wavelet-based clustering for mixed-effects functional models in high dimension". Biometrics. 69 (1): 31–40.
^ Peng, J; Müller, HG. (2008). "Distance-based clustering of sparsely observed stochastic processes, with applications to online auctions". The Annals of Applied Statistics. 2 (3): 1056–1077.
^ Chiou, JM; Li, PL. (2007). "Functional clustering and identifying substructures of longitudinal data". Journal of the Royal Statistical Society: Series B (Statistical Methodology). 69 (4): 679–699.
^ Banfield, JD; Raftery, AE. (1993). "Model-based Gaussian and non-Gaussian clustering". Biometrics. 49 (3): 803–821.
^ James, GM; Sugar, CA. (2003). "Clustering for sparsely sampled functional data". Journal of the American Statistical Association. 98 (462): 397–408.
^ Jacques, J; Preda, C. (2013). "Funclust: A curves clustering method using functional random variables density approximation". Neurocomputing. 112: 164–171.
^ Jacques, J; Preda, C. (2014). "Model-based clustering for multivariate functional data". Computational Statistics & Data Analysis. 71 (C): 92–106.
^ Heinzl, F; Tutz, G. (2014). "Clustering in linear-mixed models with a group fused lasso penalty". Biometrical Journal. 56 (1): 44–68.
^ Angelini, C; Canditiis, DD; Pensky, M. (2012). "Clustering time-course microarray data using functional Bayesian infinite mixture model". Journal of Applied Statistics. 39 (1): 129–149.
^ Rodríguez, A; Dunson, DB; Gelfand, AE. (2009). "Bayesian nonparametric functional data analysis through density estimation". Biometrika. 96 (1): 149–162.
^ Petrone, S; Guindani, M; Gelfand, AE. (2009). "Hybrid Dirichlet mixture models for functional data". Journal of the Royal Statistical Society. 71 (4): 755–782.
^ Heinzl, F; Tutz, G. (2013). "Clustering in linear mixed models with approximate Dirichlet process mixtures using EM algorithm". Statistical Modelling. 13 (1): 41–67.
^ Abraham, C; Cornillon, PA; Matzner‐Løber, E; Molinari, N. (2003). "Unsupervised curve clustering using B-splines". Scandinavian Journal of Statistics. 30 (3): 581–595.
^ Serban, N; Wasserman, L. (2005). "CATS: Clustering after transformation and smoothing". Journal of the American Statistical Association. 100 (471): 990–999.
^ ^a ^b ^c Coffey, N; Hinde, J; Holian, E. (2014). "Clustering longitudinal profiles using P-splines and mixed effects models applied to time-course gene expression data". Computational Statistics & Data Analysis. 71 (C): 14–29.
^ Kayano, M; Dozono, K; Konishi, S. (2010). "Functional cluster analysis via orthonormalized Gaussian basis expansions and Its application". Journal of Classification. 27 (2): 211–230.
^ Giacofci, M; Lambert‐Lacroix, S; Marot, G; Picard, F. (2013). "Wavelet-based clustering for mixed-effects functional models in high dimension". Biometrics. 69 (1): 31–40.
^ Peng, J; Müller, HG. (2008). "Distance-based clustering of sparsely observed stochastic processes, with applications to online auctions". The Annals of Applied Statistics. 2 (3): 1056–1077.
^ Chiou, JM; Li, PL. (2007). "Functional clustering and identifying substructures of longitudinal data". Journal of the Royal Statistical Society: Series B (Statistical Methodology). 69 (4): 679–699.
^ Banfield, JD; Raftery, AE. (1993). "Model-based Gaussian and non-Gaussian clustering". Biometrics. 49 (3): 803–821.
^ James, GM; Sugar, CA. (2003). "Clustering for sparsely sampled functional data". Journal of the American Statistical Association. 98 (462): 397–408.
^ Jacques, J; Preda, C. (2013). "Funclust: A curves clustering method using functional random variables density approximation". Neurocomputing. 112: 164–171.
^ Jacques, J; Preda, C. (2014). "Model-based clustering for multivariate functional data". Computational Statistics & Data Analysis. 71 (C): 92–106.
^ Heinzl, F; Tutz, G. (2014). "Clustering in linear-mixed models with a group fused lasso penalty". Biometrical Journal. 56 (1): 44–68.
^ Angelini, C; Canditiis, DD; Pensky, M. (2012). "Clustering time-course microarray data using functional Bayesian infinite mixture model". Journal of Applied Statistics. 39 (1): 129–149.
^ Rodríguez, A; Dunson, DB; Gelfand, AE. (2009). "Bayesian nonparametric functional data analysis through density estimation". Biometrika. 96 (1): 149–162.
^ Petrone, S; Guindani, M; Gelfand, AE. (2009). "Hybrid Dirichlet mixture models for functional data". Journal of the Royal Statistical Society. 71 (4): 755–782.
^ Heinzl, F; Tutz, G. (2013). "Clustering in linear mixed models with approximate Dirichlet process mixtures using EM algorithm". Statistical Modelling. 13 (1): 41–67.
^ Grenander, U. (1950). "Stochastic processes and statistical inference". Arkiv för Matematik. 1 (3): 195–277.
^ Rice, JA; Silverman, BW. (1991). "Estimating the mean and covariance structure nonparametrically when the data are curves". Journal of the Royal Statistical Society. 53 (1): 233–243.
^ Müller, HG. (2016). "Peter Hall, functional data analysis and random objects". Annals of Statistics. 44 (5): 1867–1887.
^ Karhunen, K (1946). Zur Spektraltheorie stochastischer Prozesse. Annales Academiae scientiarum Fennicae.
^ Kleffe, J. (1973). "Principal components of random variables with values in a seperable hilbert space". Mathematische Operationsforschung und Statistik. 4 (5): 391–406.
^ Dauxois, J; Pousse, A; Romain, Y. (1982). "Asymptotic theory for the principal component analysis of a vector random function: Some applications to statistical inference". Journal of Multivariate Analysis. 12 (1): 136–154.
^ ^a ^b ^c Ramsay, J; Silverman, BW. (2005). Functional Data Analysis, 2nd ed. Springer.
^ Hsing, T; Eubank, R (2015). Theoretical Foundations of Functional Data Analysis, with an Introduction to Linear Operators. Wiley Series in Probability and Statistics.
^ Shi, M; Weiss, RE; Taylor, JMG. (1996). "An analysis of paediatric CD4 counts for acquired immune deficiency syndrome using flexible random curves". Journal of the Royal Statistical Society. Series C (Applied Statistics). 45 (2): 151–163.
^ Hilgert, N; Mas, A; Verzelen, N. (2013). "Minimax adaptive tests for the functional linear model". Annals of Statistics. 41: 838–869.
^ Kong, D; Xue, K; Yao, F; Zhang, HH. (2016). "Partially functional linear regression in high dimensions". Biometrika. 103 (1): 147–159.
^ Horváth, L; Kokoszka, P. (2012). Inference for functional data with applications. Springer Series in Statistics. Springer-Verlag.
^ ^a ^b ^c Wang, JL; Chiou, JM; Müller, HG. (2016). "Functional data analysis". Annual Review of Statistics and Its Application. 3 (1): 257–295.
^ ^a ^b Ramsay, J; Silverman, BW. (2005). Functional Data Analysis, 2nd ed. Springer.
^ Ramsay, JO; Dalzell, CJ. (1991). "Some tools for functional data analysis". Journal of the Royal Statistical Society: Series B (Methodological). 53 (3): 539–561.
^ Malfait, N; Ramsay, JO. (2003). "The historical functional linear model". The Canadian Journal of Statistics. 31 (2): 115–128.
^ He, G; Müller, HG; Wang, JL. (2003). "Functional canonical analysis for square integrable stochastic processes". Journal of Multivariate Analysis. 85 (1): 54–77.
^ Yao, F; Müller, HG; Wang, JL. (2005). "Functional data analysis for sparse longitudinal data". Journal of the American Statistical Association. 100 (470): 577–590.
^ He, G; Müller, HG; Wang, JL; Yang, WJ. (2010). "Functional linear regression via canonical analysis". Journal of Multivariate Analysis. 16 (3): 705–729.
^ Fan, J; Zhang, W. (1999). "Statistical estimation in varying coefficient models". The Annals of Statistics. 27 (5): 1491–1518.
^ Wu, CO; Yu, KF. (2002). "Nonparametric varying-coefficient models for the analysis of longitudinal data". International Statistical Review. 70 (3): 373–393.
^ Huang, JZ; Wu, CO; Zhou, L. (2002). "Varying-coefficient models and basis function approximations for the analysis of repeated measurements". Biometrika. 89 (1): 111–128.
^ Huang, JZ; Wu, CO; Zhou, L. (2004). "Polynomial spline estimation and inference for varying coefficient models with longitudinal data". Statistica Sinica. 14 (3): 763–788.
^ Şentürk, D; Müller, HG. (2010). "Functional varying coefficient models for longitudinal data". Journal of the American Statistical Association. 105 (491): 1256–1264.
^ Eggermont, PPB; Eubank, RL; LaRiccia, VN. (2010). "Convergence rates for smoothing spline estimators in varying coefficient models". Journal of Statistical Planning and Inference. 140 (2): 369–381.
^ ^a ^b Yao, F; Müller, HG. (2010). "Functional quadratic regression". Biometrika. 97 (1):49–64.
^ Horváth, L; Reeder, R. (2013). "A test of significance in functional quadratic regression". Bernoulli. 19 (5A): 2120–2151.
^ Chen, D; Hall, P; Müller HG. (2011). "Single and multiple index functional regression models with nonparametric link". The Annals of Statistics. 39 (3):1720–1747.
^ Jiang, CR; Wang JL. (2011). "Functional single index models for longitudinal data". he Annals of Statistics. 39 (1):362–388.
^ Wang, JL; Chiou, JM; Müller, HG. (2016). "Functional data analysis". Annual Review of Statistics and Its Application. 3 (1): 257–295.
^ Müller HG; Wu Y; Yao, F. (2013). "Continuously additive models for nonlinear functional regression". Biometrika. 100 (3): 607–622.{{cite journal}}: CS1 maint: multiple names: authors list (link)
^ Müller HG; Stadmüller, U. (2005). "Generalized Functional Linear Models". The Annals of Statistics. 33 (2): 774–805.{{cite journal}}: CS1 maint: multiple names: authors list (link)
^ Chiou, JM; Li, PL. (2007). "Functional clustering and identifying substructures of longitudinal data". Journal of the Royal Statistical Society: Series B (Statistical Methodology). 69 (4): 679–699.
^ Banfield, JD; Raftery, AE. (1993). "Model-based Gaussian and non-Gaussian clustering". Biometrics. 49 (3): 803–821.
^ James, GM; Sugar, CA. (2003). "Clustering for sparsely sampled functional data". Journal of the American Statistical Association. 98 (462): 397–408.
^ Jacques, J; Preda, C. (2013). "Funclust: A curves clustering method using functional random variables density approximation". Neurocomputing. 112: 164–171.
^ Jacques, J; Preda, C. (2014). "Model-based clustering for multivariate functional data". Computational Statistics & Data Analysis. 71 (C): 92–106.
^ Heinzl, F; Tutz, G. (2014). "Clustering in linear-mixed models with a group fused lasso penalty". Biometrical Journal. 56 (1): 44–68.
^ Angelini, C; Canditiis, DD; Pensky, M. (2012). "Clustering time-course microarray data using functional Bayesian infinite mixture model". Journal of Applied Statistics. 39 (1): 129–149.
^ Rodríguez, A; Dunson, DB; Gelfand, AE. (2009). "Bayesian nonparametric functional data analysis through density estimation". Biometrika. 96 (1): 149–162.
^ Petrone, S; Guindani, M; Gelfand, AE. (2009). "Hybrid Dirichlet mixture models for functional data". Journal of the Royal Statistical Society. 71 (4): 755–782.
^ Heinzl, F; Tutz, G. (2013). "Clustering in linear mixed models with approximate Dirichlet process mixtures using EM algorithm". Statistical Modelling. 13 (1): 41–67.
^ Leng, X; Müller, HG. (2006). "Classification using functional data analysis for temporal gene expression data". Bioinformatics. 22 (1): 68–76.
^ James, GM; Hastie, TJ. (2001). "Functional linear discriminant analysis for irregularly sampled curves". Journal of the Royal Statistical Society. 63 (3): 533–550.
^ Hall, P; Poskitt, DS; Presnell, B. (2001). "A Functional Data—Analytic Approach to Signal Discrimination". Technometrics. 43 (1): 1–9.
^ Ferraty, F; Vieu, P. (2003). "Curves discrimination: a nonparametric functional approach". Computational Statistics & Data Analysis. 44 (1–2): 161–173.
^ Chang, C; Chen, Y; Ogden, RT. (2014). "Functional data classification: a wavelet approach". Computational Statistics. 29 (6): 1497–1513.
^ Zhu, H; Brown, PJ; Morris, JS. (2012). "Robust Classification of Functional and Quantitative Image Data Using Functional Mixed Models". Biometrics. 68 (4): 1260–1268.
^ Gasser, T; Müller, HG; Kohler, W; Molinari, L; Prader, A. (1984). "Nonparametric regression analysis of growth curves". The Annals of Statistics. 12 (1): 210 -- 229.
^ Ramsay, JO; Li, X. (1998). "Curve registration". Journal of the Royal Statistical Society: Series B. 60 (2): 351–363.
^ Peng, J; Paul, D; Müller, HG. (2014). "Time-warped growth processes, with applications to the modeling of boom-bust cycles in house prices". The Annals of Applied Statistics. 8 (3): 1561–1582.
^ ^a ^b ^c Marron, JS; Ramsay, JO; Sangalli, LM; Srivastava, A (2015). "Functional data analysis of amplitude and phase variation". Statistical Science. 30 (4): 468–484.
^ Sakoe, H; Chiba, S. (1978). "Dynamic programming algorithm optimization for spoken word recognition". IEEE Transactions on Acoustics, Speech, and Signal Processing. 26: 43--49.
^ Kneip, A; Gasser, T (1992). "Statistical tools to analyze data representing a sample of curves". Annals of Statistics. 20: 1266–1305.
^ Gasser, T; Kneip, A (1995). "Searching for structure in curve sample". Journal of the American Statistical Association. 90 (432): 1179–1188.
^ Tang, R; Müller, HG. (2008). "Pairwise curve synchronization for functional data". Biometrika. 95: 875–889.
^ ^a ^b Anirudh, R; Turaga, P; Su, J; Srivastava, A (2015). "Elastic functional coding of human actions: From vector-fields to latent variables". Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition: 3147-3155.
^ Dai, X; Müller, HG (2018). "Principal component analysis for functional data on Riemannian manifolds and spheres". The Annals of Statistics. 46 (6B): 3334–3361.
^ Chen, K; Delicado, P; Müller, HG (2017). "Modelling function-valued stochastic processes, with applications to fertility dynamics". Journal of the Royal Statistical Society. Series B (Statistical Methodology). 79 (1): 177–196.
^ Dubey, P; Müller, HG (2021). "Modeling Time-Varying Random Objects and Dynamic Networks". Journal of the American Statistical Association. 0 (0): 1–16.
^ Yao, F; Müller, HG; Wang, JL. (2005). "Functional data analysis for sparse longitudinal data". Journal of the American Statistical Association. 100 (470): 577–590.

Cite error: A list-defined reference named "Ramsay2005" is not used in the content (see the help page).
Cite error: A list-defined reference named "Grenander1950" is not used in the content (see the help page).
Cite error: A list-defined reference named "Müller2006" is not used in the content (see the help page).
Cite error: A list-defined reference named "Karhunen1946" is not used in the content (see the help page).
Cite error: A list-defined reference named "Rice2001" is not used in the content (see the help page).
Cite error: A list-defined reference named "Dauxois1982" is not used in the content (see the help page).
Cite error: A list-defined reference named "Refund" is not used in the content (see the help page).
Cite error: A list-defined reference named "FDboost" is not used in the content (see the help page).

Cite error: A list-defined reference named "Hsing2015" is not used in the content (see the help page).

Category:Regression analysis

[1] Cardot, H; Ferraty, F; Sarda, P. (2003). "Spline estimators for the functional linear model". Statistica Sinica. 13 (3): 571–591.

[2] Hilgert, N; Mas, A; Verzelen, N. (2013). "Minimax adaptive tests for the functional linear model". Annals of Statistics. 41: 838–869.

[3] Kong, D; Xue, K; Yao, F; Zhang, HH. (2016). "Partially functional linear regression in high dimensions". Biometrika. 103 (1): 147–159.

[4] Hu, Z; Wang, N; Carroll, RJ. (2004). "Profile‐kernel versus backfitting in the partially linear models for longitudinal/clustered data". Biometrika. 91 (2): 251–262.

[5] Horváth, L; Kokoszka, P. (2012). Inference for functional data with applications. Springer Series in Statistics. Springer-Verlag.

[wang:16-6] Wang, JL; Chiou, JM; Müller, HG. (2016). "Functional data analysis". Annual Review of Statistics and Its Application. 3 (1): 257–295. Cite error: The named reference "wang:16" was defined multiple times with different content (see the help page).

[7] Ramsay, J; Silverman, BW. (2005). Functional Data Analysis, 2nd ed. Springer.

[8] Ramsay, JO; Dalzell, CJ. (1991). "Some tools for functional data analysis". Journal of the Royal Statistical Society: Series B (Methodological). 53 (3): 539–561.

[9] Malfait, N; Ramsay, JO. (2003). "The historical functional linear model". The Canadian Journal of Statistics. 31 (2): 115–128.

[10] He, G; Müller, HG; Wang, JL. (2003). "Functional canonical analysis for square integrable stochastic processes". Journal of Multivariate Analysis. 85 (1): 54–77.

[11] Yao, F; Müller, HG; Wang, JL. (2005). "Functional data analysis for sparse longitudinal data". Journal of the American Statistical Association. 100 (470): 577–590.

[12] He, G; Müller, HG; Wang, JL; Yang, WJ. (2010). "Functional linear regression via canonical analysis". Journal of Multivariate Analysis. 16 (3): 705–729.

[13] Fan, J; Zhang, W. (1999). "Statistical estimation in varying coefficient models". The Annals of Statistics. 27 (5): 1491–1518.

[14] Wu, CO; Yu, KF. (2002). "Nonparametric varying-coefficient models for the analysis of longitudinal data". International Statistical Review. 70 (3): 373–393.

[15] Huang, JZ; Wu, CO; Zhou, L. (2002). "Varying-coefficient models and basis function approximations for the analysis of repeated measurements". Biometrika. 89 (1): 111–128.

[16] Huang, JZ; Wu, CO; Zhou, L. (2004). "Polynomial spline estimation and inference for varying coefficient models with longitudinal data". Statistica Sinica. 14 (3): 763–788.

[17] Şentürk, D; Müller, HG. (2010). "Functional varying coefficient models for longitudinal data". Journal of the American Statistical Association. 105 (491): 1256–1264.

[18] Eggermont, PPB; Eubank, RL; LaRiccia, VN. (2010). "Convergence rates for smoothing spline estimators in varying coefficient models". Journal of Statistical Planning and Inference. 140 (2): 369–381.

[19] Abraham, C; Cornillon, PA; Matzner‐Løber, E; Molinari, N. (2003). "Unsupervised curve clustering using B-splines". Scandinavian Journal of Statistics. 30 (3): 581–595.

[20] Serban, N; Wasserman, L. (2005). "CATS: Clustering after transformation and smoothing". Journal of the American Statistical Association. 100 (471): 990–999.

[:0-21] Coffey, N; Hinde, J; Holian, E. (2014). "Clustering longitudinal profiles using P-splines and mixed effects models applied to time-course gene expression data". Computational Statistics & Data Analysis. 71 (C): 14–29. Cite error: The named reference ":0" was defined multiple times with different content (see the help page).

[22] Kayano, M; Dozono, K; Konishi, S. (2010). "Functional cluster analysis via orthonormalized Gaussian basis expansions and Its application". Journal of Classification. 27 (2): 211–230.

[23] Giacofci, M; Lambert‐Lacroix, S; Marot, G; Picard, F. (2013). "Wavelet-based clustering for mixed-effects functional models in high dimension". Biometrics. 69 (1): 31–40.

[24] Peng, J; Müller, HG. (2008). "Distance-based clustering of sparsely observed stochastic processes, with applications to online auctions". The Annals of Applied Statistics. 2 (3): 1056–1077.

[25] Chiou, JM; Li, PL. (2007). "Functional clustering and identifying substructures of longitudinal data". Journal of the Royal Statistical Society: Series B (Statistical Methodology). 69 (4): 679–699.

[26] Banfield, JD; Raftery, AE. (1993). "Model-based Gaussian and non-Gaussian clustering". Biometrics. 49 (3): 803–821.

[27] James, GM; Sugar, CA. (2003). "Clustering for sparsely sampled functional data". Journal of the American Statistical Association. 98 (462): 397–408.

[28] Jacques, J; Preda, C. (2013). "Funclust: A curves clustering method using functional random variables density approximation". Neurocomputing. 112: 164–171.

[29] Jacques, J; Preda, C. (2014). "Model-based clustering for multivariate functional data". Computational Statistics & Data Analysis. 71 (C): 92–106.

[30] Heinzl, F; Tutz, G. (2014). "Clustering in linear-mixed models with a group fused lasso penalty". Biometrical Journal. 56 (1): 44–68.

[31] Angelini, C; Canditiis, DD; Pensky, M. (2012). "Clustering time-course microarray data using functional Bayesian infinite mixture model". Journal of Applied Statistics. 39 (1): 129–149.

[32] Rodríguez, A; Dunson, DB; Gelfand, AE. (2009). "Bayesian nonparametric functional data analysis through density estimation". Biometrika. 96 (1): 149–162.

[33] Petrone, S; Guindani, M; Gelfand, AE. (2009). "Hybrid Dirichlet mixture models for functional data". Journal of the Royal Statistical Society. 71 (4): 755–782.

[34] Heinzl, F; Tutz, G. (2013). "Clustering in linear mixed models with approximate Dirichlet process mixtures using EM algorithm". Statistical Modelling. 13 (1): 41–67.

[35] Abraham, C; Cornillon, PA; Matzner‐Løber, E; Molinari, N. (2003). "Unsupervised curve clustering using B-splines". Scandinavian Journal of Statistics. 30 (3): 581–595.

[36] Serban, N; Wasserman, L. (2005). "CATS: Clustering after transformation and smoothing". Journal of the American Statistical Association. 100 (471): 990–999.

[:03-37] Coffey, N; Hinde, J; Holian, E. (2014). "Clustering longitudinal profiles using P-splines and mixed effects models applied to time-course gene expression data". Computational Statistics & Data Analysis. 71 (C): 14–29.

[38] Kayano, M; Dozono, K; Konishi, S. (2010). "Functional cluster analysis via orthonormalized Gaussian basis expansions and Its application". Journal of Classification. 27 (2): 211–230.

[39] Giacofci, M; Lambert‐Lacroix, S; Marot, G; Picard, F. (2013). "Wavelet-based clustering for mixed-effects functional models in high dimension". Biometrics. 69 (1): 31–40.

[40] Peng, J; Müller, HG. (2008). "Distance-based clustering of sparsely observed stochastic processes, with applications to online auctions". The Annals of Applied Statistics. 2 (3): 1056–1077.

[41] Chiou, JM; Li, PL. (2007). "Functional clustering and identifying substructures of longitudinal data". Journal of the Royal Statistical Society: Series B (Statistical Methodology). 69 (4): 679–699.

[42] Banfield, JD; Raftery, AE. (1993). "Model-based Gaussian and non-Gaussian clustering". Biometrics. 49 (3): 803–821.

[43] James, GM; Sugar, CA. (2003). "Clustering for sparsely sampled functional data". Journal of the American Statistical Association. 98 (462): 397–408.

[44] Jacques, J; Preda, C. (2013). "Funclust: A curves clustering method using functional random variables density approximation". Neurocomputing. 112: 164–171.

[45] Jacques, J; Preda, C. (2014). "Model-based clustering for multivariate functional data". Computational Statistics & Data Analysis. 71 (C): 92–106.

[46] Heinzl, F; Tutz, G. (2014). "Clustering in linear-mixed models with a group fused lasso penalty". Biometrical Journal. 56 (1): 44–68.

[47] Angelini, C; Canditiis, DD; Pensky, M. (2012). "Clustering time-course microarray data using functional Bayesian infinite mixture model". Journal of Applied Statistics. 39 (1): 129–149.

[48] Rodríguez, A; Dunson, DB; Gelfand, AE. (2009). "Bayesian nonparametric functional data analysis through density estimation". Biometrika. 96 (1): 149–162.

[49] Petrone, S; Guindani, M; Gelfand, AE. (2009). "Hybrid Dirichlet mixture models for functional data". Journal of the Royal Statistical Society. 71 (4): 755–782.

[50] Heinzl, F; Tutz, G. (2013). "Clustering in linear mixed models with approximate Dirichlet process mixtures using EM algorithm". Statistical Modelling. 13 (1): 41–67.

[51] Grenander, U. (1950). "Stochastic processes and statistical inference". Arkiv för Matematik. 1 (3): 195–277.

[:4-52] Rice, JA; Silverman, BW. (1991). "Estimating the mean and covariance structure nonparametrically when the data are curves". Journal of the Royal Statistical Society. 53 (1): 233–243.

[53] Müller, HG. (2016). "Peter Hall, functional data analysis and random objects". Annals of Statistics. 44 (5): 1867–1887.

[54] Karhunen, K (1946). Zur Spektraltheorie stochastischer Prozesse. Annales Academiae scientiarum Fennicae.

[55] Kleffe, J. (1973). "Principal components of random variables with values in a seperable hilbert space". Mathematische Operationsforschung und Statistik. 4 (5): 391–406.

[56] Dauxois, J; Pousse, A; Romain, Y. (1982). "Asymptotic theory for the principal component analysis of a vector random function: Some applications to statistical inference". Journal of Multivariate Analysis. 12 (1): 136–154.

[:3-57] Ramsay, J; Silverman, BW. (2005). Functional Data Analysis, 2nd ed. Springer.

[58] Hsing, T; Eubank, R (2015). Theoretical Foundations of Functional Data Analysis, with an Introduction to Linear Operators. Wiley Series in Probability and Statistics.

[59] Shi, M; Weiss, RE; Taylor, JMG. (1996). "An analysis of paediatric CD4 counts for acquired immune deficiency syndrome using flexible random curves". Journal of the Royal Statistical Society. Series C (Applied Statistics). 45 (2): 151–163.

[60] Hilgert, N; Mas, A; Verzelen, N. (2013). "Minimax adaptive tests for the functional linear model". Annals of Statistics. 41: 838–869.

[61] Kong, D; Xue, K; Yao, F; Zhang, HH. (2016). "Partially functional linear regression in high dimensions". Biometrika. 103 (1): 147–159.

[62] Horváth, L; Kokoszka, P. (2012). Inference for functional data with applications. Springer Series in Statistics. Springer-Verlag.

[wang:163-63] Wang, JL; Chiou, JM; Müller, HG. (2016). "Functional data analysis". Annual Review of Statistics and Its Application. 3 (1): 257–295.

[:7-64] Ramsay, J; Silverman, BW. (2005). Functional Data Analysis, 2nd ed. Springer.

[65] Ramsay, JO; Dalzell, CJ. (1991). "Some tools for functional data analysis". Journal of the Royal Statistical Society: Series B (Methodological). 53 (3): 539–561.

[66] Malfait, N; Ramsay, JO. (2003). "The historical functional linear model". The Canadian Journal of Statistics. 31 (2): 115–128.

[67] He, G; Müller, HG; Wang, JL. (2003). "Functional canonical analysis for square integrable stochastic processes". Journal of Multivariate Analysis. 85 (1): 54–77.

[68] Yao, F; Müller, HG; Wang, JL. (2005). "Functional data analysis for sparse longitudinal data". Journal of the American Statistical Association. 100 (470): 577–590.

[69] He, G; Müller, HG; Wang, JL; Yang, WJ. (2010). "Functional linear regression via canonical analysis". Journal of Multivariate Analysis. 16 (3): 705–729.

[70] Fan, J; Zhang, W. (1999). "Statistical estimation in varying coefficient models". The Annals of Statistics. 27 (5): 1491–1518.

[71] Wu, CO; Yu, KF. (2002). "Nonparametric varying-coefficient models for the analysis of longitudinal data". International Statistical Review. 70 (3): 373–393.

[72] Huang, JZ; Wu, CO; Zhou, L. (2002). "Varying-coefficient models and basis function approximations for the analysis of repeated measurements". Biometrika. 89 (1): 111–128.

[73] Huang, JZ; Wu, CO; Zhou, L. (2004). "Polynomial spline estimation and inference for varying coefficient models with longitudinal data". Statistica Sinica. 14 (3): 763–788.

[74] Şentürk, D; Müller, HG. (2010). "Functional varying coefficient models for longitudinal data". Journal of the American Statistical Association. 105 (491): 1256–1264.

[75] Eggermont, PPB; Eubank, RL; LaRiccia, VN. (2010). "Convergence rates for smoothing spline estimators in varying coefficient models". Journal of Statistical Planning and Inference. 140 (2): 369–381.

[yao:10-76] Yao, F; Müller, HG. (2010). "Functional quadratic regression". Biometrika. 97 (1):49–64.

[77] Horváth, L; Reeder, R. (2013). "A test of significance in functional quadratic regression". Bernoulli. 19 (5A): 2120–2151.

[chen:11-78] Chen, D; Hall, P; Müller HG. (2011). "Single and multiple index functional regression models with nonparametric link". The Annals of Statistics. 39 (3):1720–1747.

[79] Jiang, CR; Wang JL. (2011). "Functional single index models for longitudinal data". he Annals of Statistics. 39 (1):362–388.

[wang:162-80] Wang, JL; Chiou, JM; Müller, HG. (2016). "Functional data analysis". Annual Review of Statistics and Its Application. 3 (1): 257–295.

[81] Müller HG; Wu Y; Yao, F. (2013). "Continuously additive models for nonlinear functional regression". Biometrika. 100 (3): 607–622.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[82] Müller HG; Stadmüller, U. (2005). "Generalized Functional Linear Models". The Annals of Statistics. 33 (2): 774–805.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[83] Chiou, JM; Li, PL. (2007). "Functional clustering and identifying substructures of longitudinal data". Journal of the Royal Statistical Society: Series B (Statistical Methodology). 69 (4): 679–699.

[84] Banfield, JD; Raftery, AE. (1993). "Model-based Gaussian and non-Gaussian clustering". Biometrics. 49 (3): 803–821.

[85] James, GM; Sugar, CA. (2003). "Clustering for sparsely sampled functional data". Journal of the American Statistical Association. 98 (462): 397–408.

[86] Jacques, J; Preda, C. (2013). "Funclust: A curves clustering method using functional random variables density approximation". Neurocomputing. 112: 164–171.

[87] Jacques, J; Preda, C. (2014). "Model-based clustering for multivariate functional data". Computational Statistics & Data Analysis. 71 (C): 92–106.

[88] Heinzl, F; Tutz, G. (2014). "Clustering in linear-mixed models with a group fused lasso penalty". Biometrical Journal. 56 (1): 44–68.

[89] Angelini, C; Canditiis, DD; Pensky, M. (2012). "Clustering time-course microarray data using functional Bayesian infinite mixture model". Journal of Applied Statistics. 39 (1): 129–149.

[90] Rodríguez, A; Dunson, DB; Gelfand, AE. (2009). "Bayesian nonparametric functional data analysis through density estimation". Biometrika. 96 (1): 149–162.

[91] Petrone, S; Guindani, M; Gelfand, AE. (2009). "Hybrid Dirichlet mixture models for functional data". Journal of the Royal Statistical Society. 71 (4): 755–782.

[92] Heinzl, F; Tutz, G. (2013). "Clustering in linear mixed models with approximate Dirichlet process mixtures using EM algorithm". Statistical Modelling. 13 (1): 41–67.

[93] Leng, X; Müller, HG. (2006). "Classification using functional data analysis for temporal gene expression data". Bioinformatics. 22 (1): 68–76.

[94] James, GM; Hastie, TJ. (2001). "Functional linear discriminant analysis for irregularly sampled curves". Journal of the Royal Statistical Society. 63 (3): 533–550.

[95] Hall, P; Poskitt, DS; Presnell, B. (2001). "A Functional Data—Analytic Approach to Signal Discrimination". Technometrics. 43 (1): 1–9.

[96] Ferraty, F; Vieu, P. (2003). "Curves discrimination: a nonparametric functional approach". Computational Statistics & Data Analysis. 44 (1–2): 161–173.

[97] Chang, C; Chen, Y; Ogden, RT. (2014). "Functional data classification: a wavelet approach". Computational Statistics. 29 (6): 1497–1513.

[98] Zhu, H; Brown, PJ; Morris, JS. (2012). "Robust Classification of Functional and Quantitative Image Data Using Functional Mixed Models". Biometrics. 68 (4): 1260–1268.

[99] Gasser, T; Müller, HG; Kohler, W; Molinari, L; Prader, A. (1984). "Nonparametric regression analysis of growth curves". The Annals of Statistics. 12 (1): 210 -- 229.

[100] Ramsay, JO; Li, X. (1998). "Curve registration". Journal of the Royal Statistical Society: Series B. 60 (2): 351–363.

[101] Peng, J; Paul, D; Müller, HG. (2014). "Time-warped growth processes, with applications to the modeling of boom-bust cycles in house prices". The Annals of Applied Statistics. 8 (3): 1561–1582.

[:6-102] Marron, JS; Ramsay, JO; Sangalli, LM; Srivastava, A (2015). "Functional data analysis of amplitude and phase variation". Statistical Science. 30 (4): 468–484.

[103] Sakoe, H; Chiba, S. (1978). "Dynamic programming algorithm optimization for spoken word recognition". IEEE Transactions on Acoustics, Speech, and Signal Processing. 26: 43--49.

[104] Kneip, A; Gasser, T (1992). "Statistical tools to analyze data representing a sample of curves". Annals of Statistics. 20: 1266–1305.

[105] Gasser, T; Kneip, A (1995). "Searching for structure in curve sample". Journal of the American Statistical Association. 90 (432): 1179–1188.

[:1-106] Tang, R; Müller, HG. (2008). "Pairwise curve synchronization for functional data". Biometrika. 95: 875–889.

[:2-107] Anirudh, R; Turaga, P; Su, J; Srivastava, A (2015). "Elastic functional coding of human actions: From vector-fields to latent variables". Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition: 3147-3155.

[108] Dai, X; Müller, HG (2018). "Principal component analysis for functional data on Riemannian manifolds and spheres". The Annals of Statistics. 46 (6B): 3334–3361.

[109] Chen, K; Delicado, P; Müller, HG (2017). "Modelling function-valued stochastic processes, with applications to fertility dynamics". Journal of the Royal Statistical Society. Series B (Statistical Methodology). 79 (1): 177–196.

[110] Dubey, P; Müller, HG (2021). "Modeling Time-Varying Random Objects and Dynamic Networks". Journal of the American Statistical Association. 0 (0): 1–16.

[:5-111] Yao, F; Müller, HG; Wang, JL. (2005). "Functional data analysis for sparse longitudinal data". Journal of the American Statistical Association. 100 (470): 577–590.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

[60]

[61]

[62]

[63]

[64]

[65]

[66]

[67]

[68]

[69]

[70]

[71]

[72]

[73]

[74]

[75]

[76]

[77]

[78]

[79]

[80]

[81]

[82]

[83]

[84]

[85]

[86]

[87]

[88]

[89]

[90]

[91]

[92]

[93]

[94]

[95]

[96]

[97]

[98]

[99]

[100]

Functional linear regression models

Functional regression models with scalar response

Functional regression models with functional response

Concurrent regression model

Clustering of functional data

Mean functions as cluster centers

Functional clustering via functional basis expansion

Functional clustering via FPCA

Subspaces as cluster centers

Functional clustering with mixture models

Clustering of functional data

Most recent developments in FDA

Applications

History

Mathematical formalism

Hilbertian random variables

Stochastic processes

Functional data designs

1. Fully observed functions without noise at arbitrarily dense grid

2. Dense design with noisy measurements

3. Sparse design with noisy measurements (longitudinal data)

Functional principal component analysis

Functional linear regression models

Functional regression models with scalar response

Functional regression models with functional response

Function-on-scalar regression

Concurrent regression models

Functional nonlinear regression models

Functional polynomial regression models

Functional single and multiple index models

Functional additive models (FAMs)

Generalized functional linear model

Clustering and classification of functional data

Time warping

Motivations

Methods

Dynamic time warping

Landmark registration

Extensions

Extension 1: The dimension of time domain

Extension 2: The range of the stochastic process

R packages

See also

Further reading

References