-
Background: We previously presented GraphVar as a user-friendly MATLAB
toolbox for comprehensive graph analyses of functional brain connectivity. Here
we introduce a comprehensive extension of the toolbox allowing users to
seamlessly explore easily customizable decoding models across functional
connectivity measures as well as additional features.
New Method: GraphVar 2.0 provides machine learning (ML) model construction,
validation and exploration. Machine learning can be performed across any
combination of network measures and additional variables, allowing for a
flexibility in neuroimaging applications.
Results: In addition to previously integrated functionalities, such as
network construction and graph-theoretical analyses of brain connectivity with
a high-speed general linear model (GLM), users can now perform customizable ML
across connectivity matrices, network metrics and additionally imported
variables. The new extension also provides parametric and nonparametric testing
of classifier and regressor performance, data export, figure generation and
high quality export.
Comparison with existing methods: Compared to other existing toolboxes,
GraphVar 2.0 offers (1) comprehensive customization, (2) an all-in-one user
friendly interface, (3) customizable model design and manual hyperparameter
entry, (4) interactive results exploration and data export, (5) automated
cueing for modelling multiple outcome variables within the same session, (6) an
easy to follow introductory review.
Conclusions: GraphVar 2.0 allows comprehensive, user-friendly exploration of
encoding (GLM) and decoding (ML) modelling approaches on functional
connectivity measures making big data neuroscience readily accessible to a
broader audience of neuroimaging investigators.
-
As very large studies of complex neuroimaging phenotypes become more common,
human quality assessment of MRI-derived data remains one of the last major
bottlenecks. Few attempts have so far been made to address this issue with
machine learning. In this work, we optimize predictive models of quality for
meshes representing deep brain structure shapes. We use standard vertex-wise
and global shape features computed homologously across 19 cohorts and over 7500
human-rated subjects, training kernelized Support Vector Machine and Gradient
Boosted Decision Trees classifiers to detect meshes of failing quality. Our
models generalize across datasets and diseases, reducing human workload by
30-70\%, or equivalently hundreds of human rater hours for datasets of
comparable size, with recall rates approaching inter-rater reliability.
-
Large-scale collaborative analysis of brain imaging data, in psychiatry and
neu-rology, offers a new source of statistical power to discover features that
boost ac-curacy in disease classification, differential diagnosis, and outcome
prediction. However, due to data privacy regulations or limited accessibility
to large datasets across the world, it is challenging to efficiently integrate
distributed information. Here we propose a novel classification framework
through multi-site weighted LASSO: each site performs an iterative weighted
LASSO for feature selection separately. Within each iteration, the
classification result and the selected features are collected to update the
weighting parameters for each feature. This new weight is used to guide the
LASSO process at the next iteration. Only the fea-tures that help to improve
the classification accuracy are preserved. In tests on da-ta from five sites
(299 patients with major depressive disorder (MDD) and 258 normal controls),
our method boosted classification accuracy for MDD by 4.9% on average. This
result shows the potential of the proposed new strategy as an ef-fective and
practical collaborative platform for machine learning on large scale
distributed imaging and biobank data.
-
The field of neuroimaging has truly become data rich, and novel analytical
methods capable of gleaning meaningful information from large stores of imaging
data are in high demand. Those methods that might also be applicable on the
level of individual subjects, and thus potentially useful clinically, are of
special interest. In the present study, we introduce just such a method, called
nonlinear functional mapping (NFM), and demonstrate its application in the
analysis of resting state fMRI from a 242-subject subset of the IMAGEN project,
a European study of adolescents that includes longitudinal phenotypic,
behavioral, genetic, and neuroimaging data. NFM employs a computational
technique inspired by biological evolution to discover and mathematically
characterize interactions among ROI (regions of interest), without making
linear or univariate assumptions. We show that statistics of the resulting
interaction relationships comport with recent independent work, constituting a
preliminary cross-validation. Furthermore, nonlinear terms are ubiquitous in
the models generated by NFM, suggesting that some of the interactions
characterized here are not discoverable by standard linear methods of analysis.
We discuss one such nonlinear interaction in the context of a direct comparison
with a procedure involving pairwise correlation, designed to be an analogous
linear version of functional mapping. We find another such interaction that
suggests a novel distinction in brain function between drinking and
non-drinking adolescents: a tighter coupling of ROI associated with emotion,
reward, and interoceptive processes such as thirst, among drinkers. Finally, we
outline many improvements and extensions of the methodology to reduce
computational expense, complement other analytical tools like graph-theoretic
analysis, and allow for voxel level NFM to eliminate the necessity of ROI
selection.