
While many statistical models and methods are now available for network
analysis, resampling network data remains a challenging problem.
Crossvalidation is a useful general tool for model selection and parameter
tuning, but is not directly applicable to networks since splitting network
nodes into groups requires deleting edges and destroys some of the network
structure. Here we propose a new network resampling strategy based on splitting
node pairs rather than nodes applicable to crossvalidation for a wide range of
network model selection tasks. We provide a theoretical justification for our
method in a general setting and examples of how our method can be used in
specific network model selection and parameter tuning tasks. Numerical results
on simulated networks and on a citation network of statisticians show that this
crossvalidation approach works well for model selection.

Prediction algorithms typically assume the training data are independent
samples, but in many modern applications samples come from individuals
connected by a network. For example, in adolescent health studies of
risktaking behaviors, information on the subjects' social network is often
available and plays an important role through network cohesion, the empirically
observed phenomenon of friends behaving similarly. Taking cohesion into account
in prediction models should allow us to improve their performance. Here we
propose a networkbased penalty on individual node effects to encourage
similarity between predictions for linked nodes, and show that incorporating it
into prediction leads to improvement over traditional models both theoretically
and empirically when network cohesion is present. The penalty can be used with
many lossbased prediction methods, such as regression, generalized linear
models, and Cox's proportional hazard model. Applications to predicting levels
of recreational activity and marijuana usage among teenagers from the AddHealth
study based on both demographic covariates and friendship networks are
discussed in detail and show that our approach to taking friendships into
account can significantly improve predictions of behavior while providing
interpretable estimates of covariate effects.

Abstraction and realization are bilateral processes that are key in deriving
intelligence and creativity. In many domains, the two processes are approached
through rules: highlevel principles that reveal invariances within similar yet
diverse examples. Under a probabilistic setting for discrete input spaces, we
focus on the rule realization problem which generates input sample
distributions that follow the given rules. More ambitiously, we go beyond a
mechanical realization that takes whatever is given, but instead ask for
proactively selecting reasonable rules to realize. This goal is demanding in
practice, since the initial rule set may not always be consistent and thus
intelligent compromises are needed. We formulate both rule realization and
selection as two strongly connected components within a single and symmetric
biconvex problem, and derive an efficient algorithm that works at large scale.
Taking music compositional rules as the main example throughout the paper, we
demonstrate our model's efficiency in not only music realization (composition)
but also music interpretation and understanding (analysis).

While graphical models for continuous data (Gaussian graphical models) and
discrete data (Ising models) have been extensively studied, there is little
work on graphical models linking both continuous and discrete variables (mixed
data), which are common in many scientific applications. We propose a novel
graphical model for mixed data, which is simple enough to be suitable for
highdimensional data, yet flexible enough to represent all possible graph
structures. We develop a computationally efficient regressionbased algorithm
for fitting the model by focusing on the conditional loglikelihood of each
variable given the rest. The parameters have a natural group structure, and
sparsity in the fitted graph is attained by incorporating a group lasso
penalty, approximated by a weighted $\ell_1$ penalty for computational
efficiency. We demonstrate the effectiveness of our method through an extensive
simulation study and apply it to a music annotation data set (CAL500),
obtaining a sparse and interpretable graphical model relating the continuous
features of the audio signal to categorical variables such as genre, emotions,
and usage associated with particular songs. While we focus on binary discrete
variables, we also show that the proposed methodology can be easily extended to
general discrete variables.

A very simple interpretation of matrix completion problem is introduced based
on statistical models. Combined with the wellknown results from missing data
analysis, such interpretation indicates that matrix completion is still a valid
and principled estimation procedure even without the missing completely at
random (MCAR) assumption, which almost all of the current theoretical studies
of matrix completion assume.

In this paper, we consider Gaussian models Markov with respect to an
arbitrary DAG. We first construct a family of conjugate priors for the Cholesky
parametrization of the covariance matrix of such models. This family has as
many shape parameters as the DAG has vertices, and naturally extends the work
of Geiger and Heckerman [8]. From these distributions, we derive prior
distributions for the covariance and precision parameters of the Gaussian DAG
Markov models. Our works thus extends the work of Dawid and Lauritzen [5] and
Letac and Massam [16] for Gaussian models Markov with respect to a decomposable
graph to arbitrary DAGs. For this reason, we call our distributions DAGWishart
distributions. An advantage of these distributions is that they possess strong
hyper Markov properties and thus allow for explicit estimation of the
covariance and precision parameters, regardless of the dimension of the
problem. They also allow us to develop methodology for model selection and
covariance estimation in the space of DAGMarkov models. We demonstrate via
several numerical examples that the proposed method scales well to
highdimensions.