• Studying caricature recognition is fundamentally important to understanding of face perception. However, little research has been conducted in the computer vision community, largely due to the shortage of suitable datasets. In this paper, a new caricature dataset is built, with the objective to facilitate research in caricature recognition. All the caricatures and face images were collected from the Web. Compared with two existing datasets, this dataset is much more challenging, with a much greater number of available images, artistic styles and larger intra-personal variations. Evaluation protocols are also offered together with their baseline performances on the dataset to allow fair comparisons. Besides, a framework for caricature face recognition is presented to make a thorough analyze of the challenges of caricature recognition. By analyzing the challenges, the goal is to show problems that worth to be further investigated. Additionally, based on the evaluation protocols and the framework, baseline performances of various state-of-the-art algorithms are provided. A conclusion is that there is still a large space for performance improvement and the analyzed problems still need further investigation.
  • Due to low tissue contrast, irregular object appearance, and unpredictable location variation, segmenting the objects from different medical imaging modalities (e.g., CT, MR) is considered as an important yet challenging task. In this paper, we present a novel method for interactive medical image segmentation with the following merits. (1) Our design is fundamentally different from previous pure patch-based and image-based segmentation methods. We observe that during delineation, the physician repeatedly check the inside-outside intensity changing to determine the boundary, which indicates that comparison in an inside-outside manner is extremely important. Thus, we innovatively model our segmentation task as learning the representation of the bi-directional sequential patches, starting from (or ending in) the given central point of the object. This can be realized by our proposed ConvRNN network embedded with a gated memory propagation unit. (2) Unlike previous interactive methods (requiring bounding box or seed points), we only ask the physician to merely click on the rough central point of the object before segmentation, which could simultaneously enhance the performance and reduce the segmentation time. (3) We utilize our method in a multi-level framework for better performance. We systematically evaluate our method in three different segmentation tasks including CT kidney tumor, MR prostate, and PROMISE12 challenge, showing promising results compared with state-of-the-art methods. The code is available here: \href{https://github.com/sunalbert/Sequential-patch-based-segmentation}{Sequential-patch-based-segmentation}.
  • Due to the irregular motion, similar appearance and diverse shape, accurate segmentation of kidney tumor in CT images is a difficult and challenging task. To this end, we present a novel automatic segmentation method, termed as Crossbar-Net, with the goal of accurate segmenting the kidney tumors. Firstly, considering that the traditional learning-based segmentation methods normally employ either whole images or squared patches as the training samples, we innovatively sample the orthogonal non-squared patches (namely crossbar patches), to fully cover the whole kidney tumors in either horizontal or vertical directions. These sampled crossbar patches could not only represent the detailed local information of kidney tumor as the traditional patches, but also describe the global appearance from either horizontal or vertical direction using contextual information. Secondly, with the obtained crossbar patches, we trained a convolutional neural network with two sub-models (i.e., horizontal sub-model and vertical sub-model) in a cascaded manner, to integrate the segmentation results from two directions (i.e., horizontal and vertical). This cascaded training strategy could effectively guarantee the consistency between sub-models, by feeding each other with the most difficult samples, for a better segmentation. In the experiment, we evaluate our method on a real CT kidney tumor dataset, collected from 94 different patients including 3,500 images. Compared with the state-of-the-art segmentation methods, the results demonstrate the superior results of our method on dice ratio score, true positive fraction, centroid distance and Hausdorff distance. Moreover, we have extended our crossbar-net to a different task: cardiac segmentation, showing the promising results for the better generalization.
  • There are two kinds of atomic vapor cell gyroscopes, one is nuclear-magnetic-resonance (NMR) gyroscope and the other is spin-exchange-relaxation-free (SERF) gyroscope. We demonstrate that there is a common model for these two kinds of gyroscope. The output signals of NMR and SERF gyroscopes are compared directly, which provides an important guidance for the scheme choosing and optimization of atomic gyroscope. Although this expose of the equivalence cannot solve the current problems of atomic gyroscopes, such as the narrow response bandwidth of SERF gyroscope and the drift due to the different Fermi-contact interaction coefficient in NMR gyroscope, it provides a simple understanding of atomic gyroscopes and may inspire new schemes combining advantages of both NMR and SERF gyroscopes.
  • In this work, our statements are based on the progress of current research on superatomic clusters. Combining the new trend of materials and device manufacture at the atomic level, we analyzed the opportunities for the development based on the use of superatomic clusters as units of functional materials, and presented a foresight of this new branch of science with relevant studies on superatoms.
  • In this paper, we propose the Cross-Domain Adversarial Auto-Encoder (CDAAE) to address the problem of cross-domain image inference, generation and transformation. We make the assumption that images from different domains share the same latent code space for content, while having separate latent code space for style. The proposed framework can map cross-domain data to a latent code vector consisting of a content part and a style part. The latent code vector is matched with a prior distribution so that we can generate meaningful samples from any part of the prior space. Consequently, given a sample of one domain, our framework can generate various samples of the other domain with the same content of the input. This makes the proposed framework different from the current work of cross-domain transformation. Besides, the proposed framework can be trained with both labeled and unlabeled data, which makes it also suitable for domain adaptation. Experimental results on data sets SVHN, MNIST and CASIA show the proposed framework achieved visually appealing performance for image generation task. Besides, we also demonstrate the proposed method achieved superior results for domain adaptation. Code of our experiments is available in https://github.com/luckycallor/CDAAE.
  • In this paper, a novel mask based deep ranking neural network with skipped fusing layer (MaskReID) is proposed for person re-identification (Re-ID). For person Re-ID, there are multiple challenges co-exist throughout the re-identification process, including cluttered background, appearance variations (illumination, pose, occlusion, etc.) among different camera views and interference of samples of similar appearance. A compact framework is proposed to address these problems. Firstly, to address the problem of cluttered background, masked images which are the image segmentations of the original images are incorporated as input in the proposed neural network. Then, to remove the appearance variations so as to obtain more discriminative feature, a new network structure is proposed which fuses feature of different layers as the final feature. This makes the final feature a combination of all the low, middle and high level feature, which is more informative. Lastly, as person Re-ID is a special image retrieval task, a novel ranking loss is designed to optimize the whole network. The ranking loss relieved the interference problem of similar samples while producing ranking results. The experimental results demonstrate that the proposed method consistently outperforms the state-of-the-art methods on many person Re-ID datasets, especially large-scale datasets, such as, CUHK03, Market1501 and DukeMTMC-reID.
  • Using the semiclassical theory of electron dynamics, we derive a gauge-invariant expression for the spin toroidization in a periodical crystal. We show that the spin toroidization is comprised of two contributions: one is due to the configuration of a classical spin array, while the other comes from the coordinate shift of the electron as spin carrier in response to the inhomogeneous magnetic field. We then establish a direct and elengant relation between our spin toroidization and the antisymmetric magnetoelectric polarizability in insulators. Finally, we demonstrate our spin toroidization in a tight-binding model and show that it is a genuine bulk quantity.
  • We present a microscopic theory of the magnetic quadrupole moment density $\Qij$ in periodic crystals using the semiclassical framework of electron dynamics. We obtain a gauge-invariant expression with clear physical interpretation and demonstrate the typical behaviour of $\Qij$ in a minimal two-band model that hosts a tilted Dirac cone. We then show that $\Qij$ leads to an intrinsic nonlinear anomalous thermoelectric current. This effect can be used to probe systems with combined time reversal and inversion symmetry. As an example, we calculate the nonlinear Nernst and Hall current in the loop-current model for cuprate superconductors, and find they are greatly enhanced around Dirac points and saddle points of the energy band, respectively.
  • The combination of argumentation and probability paves the way to new accounts of qualitative and quantitative uncertainty, thereby offering new theoretical and applicative opportunities. Due to a variety of interests, probabilistic argumentation is approached in the literature with different frameworks, pertaining to structured and abstract argumentation, and with respect to diverse types of uncertainty, in particular the uncertainty on the credibility of the premises, the uncertainty about which arguments to consider, and the uncertainty on the acceptance status of arguments or statements. Towards a general framework for probabilistic argumentation, we investigate a labelling-oriented framework encompassing a basic setting for rule-based argumentation and its (semi-) abstract account, along with diverse types of uncertainty. Our framework provides a systematic treatment of various kinds of uncertainty and of their relationships and allows us to back or question assertions from the literature.
  • This paper presents CAPE, a method to extract planes and cylinder segments from organized point clouds, which processes 640x480 depth images on a single CPU core at an average of 300 Hz, by operating on a grid of planar cells. While, compared to state-of-the-art plane extraction, the latency of CAPE is more consistent and 4-10 times faster, depending on the scene, we also demonstrate empirically that applying CAPE to visual odometry can improve trajectory estimation on scenes made of cylindrical surfaces (e.g. tunnels), whereas using a plane extraction approach that is not curve-aware deteriorates performance on these scenes. To use these geometric primitives in visual odometry, we propose extending a probabilistic RGB-D odometry framework based on points, lines and planes to cylinder primitives. Following this framework, CAPE runs on fused depth maps and the parameters of cylinders are modelled probabilistically to account for uncertainty and weight accordingly the pose optimization residuals.
  • Many state-of-the-art general object detection methods make use of shared full-image convolutional features (as in Faster R-CNN). This achieves a reasonable test-phase computation time while enjoys the discriminative power provided by large Convolutional Neural Network (CNN) models. Such designs excel on benchmarks which contain natural images but which have very unnatural distributions, i.e. they have an unnaturally high-frequency of the target classes and a bias towards a "friendly" or "dominant" object scale. In this paper we present further study of the use and adaptation of the Faster R-CNN object detection method for datasets presenting natural scale distribution and unbiased real-world object frequency. In particular, we show that better alignment of the detector scale sensitivity to the extant distribution improves vehicle detection performance. We do this by modifying both the selection of Region Proposals, and through using more scale-appropriate full-image convolution features within the CNN model. By selecting better scales in the region proposal input and by combining feature maps through careful design of the convolutional neural network, we improve performance on smaller objects. We significantly increase detection AP for the KITTI dataset car class from 76.3% on our baseline Faster R-CNN detector to 83.6% in our improved detector.
  • Voice impersonation is not the same as voice transformation, although the latter is an essential element of it. In voice impersonation, the resultant voice must convincingly convey the impression of having been naturally produced by the target speaker, mimicking not only the pitch and other perceivable signal qualities, but also the style of the target speaker. In this paper, we propose a novel neural network based speech quality- and style- mimicry framework for the synthesis of impersonated voices. The framework is built upon a fast and accurate generative adversarial network model. Given spectrographic representations of source and target speakers' voices, the model learns to mimic the target speaker's voice quality and style, regardless of the linguistic content of either's voice, generating a synthetic spectrogram from which the time domain signal is reconstructed using the Griffin-Lim method. In effect, this model reframes the well-known problem of style-transfer for images as the problem of style-transfer for speech signals, while intrinsically addressing the problem of durational variability of speech sounds. Experiments demonstrate that the model can generate extremely convincing samples of impersonated speech. It is even able to impersonate voices across different genders effectively. Results are qualitatively evaluated using standard procedures for evaluating synthesized voices.
  • Robust real-world learning should benefit from both demonstrations and interactions with the environment. Current approaches to learning from demonstration and reward perform supervised learning on expert demonstration data and use reinforcement learning to further improve performance based on the reward received from the environment. These tasks have divergent losses which are difficult to jointly optimize and such methods can be very sensitive to noisy demonstrations. We propose a unified reinforcement learning algorithm, Normalized Actor-Critic (NAC), that effectively normalizes the Q-function, reducing the Q-values of actions unseen in the demonstration data. NAC learns an initial policy network from demonstrations and refines the policy in the environment, surpassing the demonstrator's performance. Crucially, both learning from demonstration and interactive refinement use the same objective, unlike prior approaches that combine distinct supervised and reinforcement losses. This makes NAC robust to suboptimal demonstration data since the method is not forced to mimic all of the examples in the dataset. We show that our unified reinforcement learning algorithm can learn robustly and outperform existing baselines when evaluated on several realistic driving games.
  • This work proposes a robust visual odometry method for structured environments that combines point features with line and plane segments, extracted through an RGB-D camera. Noisy depth maps are processed by a probabilistic depth fusion framework based on Mixtures of Gaussians to denoise and derive the depth uncertainty, which is then propagated throughout the visual odometry pipeline. Probabilistic 3D plane and line fitting solutions are used to model the uncertainties of the feature parameters and pose is estimated by combining the three types of primitives based on their uncertainties. Performance evaluation on RGB-D sequences collected in this work and two public RGB-D datasets: TUM and ICL-NUIM show the benefit of using the proposed depth fusion framework and combining the three feature-types, particularly in scenes with low-textured surfaces, dynamic objects and missing depth measurements.
  • Galaxy integrated H{\alpha} star formation rate-stellar mass relation, or SFR(global)-M*(global) relation, is crucial for understanding star formation history and evolution of galaxies. However, many studies have dealt with SFR using unresolved measurements, which makes it difficult to separate out the contamination from other ionizing sources, such as active galactic nuclei and evolved stars. Using the integral field spectroscopic observations from SDSS-IV MaNGA, we spatially disentangle the contribution from different H{\alpha} powering sources for ~1000 galaxies. We find that, when including regions dominated by all ionizing sources in galaxies, the spatially-resolved relation between H{\alpha} surface density ({\Sigma}H{\alpha}(all)) and stellar mass surface density ({\Sigma}*(all)) progressively turns over at high {\Sigma}*(all) end for increasing M*(global) and bulge dominance (bulge-to-total light ratio, B/T). This in turn leads to the flattening of the integrated H{\alpha}(global)-M*(global) relation in the literature. By contrast, there is no noticeable flattening in both integrated H{\alpha}(HII)-M*(HII) and spatially-resolved {\Sigma}H{\alpha}(HII)-{\Sigma}*(HII) relations when only regions where star formation dominates the ionization are considered. In other words, the flattening can be attributed to the increasing regions powered by non-star-formation sources, which generally have lower ionizing ability than star formation. Analysis of the fractional contribution of non-star-formation sources to total H{\alpha} luminosity of a galaxy suggests a decreasing role of star formation as an ionizing source toward high-mass, high-B/T galaxies and bulge regions. This result indicates that the appearance of the galaxy integrated SFR-M* relation critically depends on their global properties (M*(global) and B/T) and relative abundances of various ionizing sources within the galaxies.
  • Atomically thin graphene exhibits fascinating mechanical properties, although its hardness and transverse stiffness are inferior to those of diamond. To date, there hasn't been any practical demonstration of the transformation of multi-layer graphene into diamond-like ultra-hard structures. Here we show that at room temperature and after nano-indentation, two-layer graphene on SiC(0001) exhibits a transverse stiffness and hardness comparable to diamond, resisting to perforation with a diamond indenter, and showing a reversible drop in electrical conductivity upon indentation. Density functional theory calculations suggest that upon compression, the two-layer graphene film transforms into a diamond-like film, producing both elastic deformations and sp2-to-sp3 chemical changes. Experiments and calculations show that this reversible phase change is not observed for a single buffer layer on SiC or graphene films thicker than 3 to 5 layers. Indeed, calculations show that whereas in two-layer graphene layer-stacking configuration controls the conformation of the diamond-like film, in a multilayer film it hinders the phase transformation.
  • We formulate a quasiclassical theory ($\omega_c\tau \lesssim 1$ with $\omega_c$ as the cyclotron frequency and $\tau$ as the relaxation time) to study the influence of magnetic field on electron-impurity scattering process in the two-dimensional electron gas. We introduce a general recipe based on an abstraction of the detailed impurity scattering process to define the scattering parameter such as the incoming and outgoing momentum and coordinate jump. In this picture, we can conveniently describe the skew scattering and coordinate jump, which will eventually modify the Boltzmann equation. We find an anomalous Hall resistivity different from the conventional Boltzmann-Drude result and a negative magnetoresistivity parabolic in magnetic field. The origin of these results has been analyzed. The relevance between our theory and recent simulation and experimental works is also discussed. Our theory dominates in dilute impurity system where the correlation effect is negligible.
  • SDSS Collaboration: Franco D. Albareti, Carlos Allende Prieto, Andres Almeida, Friedrich Anders, Scott Anderson, Brett H. Andrews, Alfonso Aragon-Salamanca, Maria Argudo-Fernandez, Eric Armengaud, Eric Aubourg, Vladimir Avila-Reese, Carles Badenes, Stephen Bailey, Beatriz Barbuy, Kat Barger, Jorge Barrera-Ballesteros, Curtis Bartosz, Sarbani Basu, Dominic Bates, Giuseppina Battaglia, Falk Baumgarten, Julien Baur, Julian Bautista, Timothy C. Beers, Francesco Belfiore, Matthew Bershady, Sara Bertran de Lis, Jonathan C. Bird, Dmitry Bizyaev, Guillermo A. Blanc, Michael Blanton, Michael Blomqvist, Adam S. Bolton, J. Borissova, Jo Bovy, William Nielsen Brandt, Jonathan Brinkmann, Joel R. Brownstein, Kevin Bundy, Etienne Burtin, Nicolas G. Busca, Hugo Orlando Camacho Chavez, M. Cano Diaz, Michele Cappellari, Ricardo Carrera, Yanping Chen, Brian Cherinka, Edmond Cheung, Cristina Chiappini, Drew Chojnowski, Chia-Hsun Chuang, Haeun Chung, Rafael Fernando Cirolini, Nicolas Clerc, Roger E. Cohen, Julia M. Comerford, Johan Comparat, Marie-Claude Cousinou, Kevin Covey, Jeffrey D. Crane, Rupert Croft, Katia Cunha, Luiz da Costa, Gabriele da Silva Ilha, Jeremy Darling, James W. Davidson Jr., Kyle Dawson, Nathan De Lee, Axel de la Macorra, Sylvain de la Torre, Alice Deconto Machado, Timothee Delubac, Aleksandar M. Diamond-Stanic, John Donor, Juan Jose Downes, Niv Drory, Helion du Mas des Bourboux, Cheng Du, Tom Dwelly, Garrett Ebelke, Arthur Eigenbrot, Daniel J. Eisenstein, Yvonne P. Elsworth, Eric Emsellem, Michael Eracleous, Stephanie Escoffier, Michael L. Evans, Jesus Falcon-Barroso, Xiaohui Fan, Ginevra Favole, Emma Fernandez-Alvar, J. G. Fernandez-Trincado, Diane Feuillet, Scott W. Fleming, Andreu Font-Ribera, Gordon Freischlad, Peter Frinchaboy, Hai Fu, Yang Gao, D. A. Garcia-Hernandez, Ana E. Garcia Perez, Rafael A. Garcia, R. Garcia-Dias, Patrick Gaulme, Junqiang Ge, Douglas Geisler, Hector Gil Marin, Bruce Gillespie, Leo Girardi, Daniel Goddard, Yilen Gomez Maqueo Chew, Violeta Gonzalez-Perez, Kathleen Grabowski, Paul Green, Catherine J. Grier, Thomas Grier, Hong Guo, Julien Guy, Alex Hagen, Matt Hall, Paul Harding, R. E. Harley, Sten Hasselquist, Suzanne Hawley, Christian R. Hayes, Fred Hearty, Saskia Hekker, Hector Hernandez Toledo, Shirley Ho, David W. Hogg, Kelly Holley-Bockelmann, Jon A. Holtzman, Parker H. Holzer, Jian Hu, Daniel Huber, Timothy Alan Hutchinson, Ho Seong Hwang, Hector J. Ibarra-Medel, Inese I. Ivans, KeShawn Ivory, Kurt Jaehnig, Trey W. Jensen, Jennifer A. Johnson, Amy Jones, Eric Jullo, T. Kallinger, Karen Kinemuchi, David Kirkby, Mark Klaene, Jean-Paul Kneib, Juna A. Kollmeier, Ivan Lacerna, Richard R. Lane, Dustin Lang, Pierre Laurent, David R. Law, Jean-Marc Le Goff, Alexie Leauthaud, Cheng Li, Ran Li, Chen Li, Niu Li, Fu-Heng Liang, Yu Liang, Marcos Lima, Lihwai Lin, Lin Lin, Yen-Ting Lin, Dan Long, Sara Lucatello, Nicholas MacDonald, Chelsea L. MacLeod, J. Ted Mackereth, Suvrath Mahadevan, Marcio Antonio-Geimba Maia, Roberto Maiolino, Steven R. Majewski, Olena Malanushenko, Nicolas Dullius Mallmann, Arturo Manchado, Claudia Maraston, Rui Marques-Chaves, Inma Martinez Valpuesta, Karen L. Masters, Savita Mathur, Ian D. McGreer, Andrea Merloni, Michael R. Merrifield, Szabolcs Meszaros, Andres Meza, Andrea Miglio, Ivan Minchev, Karan Molaverdikhani, Antonio D. Montero-Dorta, Benoit Mosser, Demitri Muna, Adam Myers, Preethi Nair, Kirpal Nandra, Melissa Ness, Jeffrey A. Newman, Robert C. Nichol, David L. Nidever, Christian Nitschelm, Julia O'Connell, Audrey Oravetz, Nelson Padilla, Nathalie Palanque-Delabrouille, Kaike Pan, John Parejko, Isabelle Paris, John A. Peacock, Sebastien Peirani, Marcos Pellejero-Ibanez, Samantha Penny, Will J. Percival, Jeffrey W. Percival, Ismael Perez-Fournon, Patrick Petitjean, Matthew Pieri, Marc H. Pinsonneault, Alice Pisani, Francisco Prada, Abhishek Prakash, Natalie Price-Jones, M. Jordan Raddick, Mubdi Rahman, Anand Raichoor, Sandro Barboza Rembold, A. M. Reyna, James Rich, Hannah Richstein, Jethro Ridl, Rogerio Riffel, Rogemar A. Riffel, Hans-Walter Rix, Annie C. Robin, Constance M. Rockosi, Sergio Rodriguez-Torres, Thaise S. Rodrigues, Natalie Roe, A. Roman Lopes, Carlos Roman-Zuniga, Ashley J. Ross, Graziano Rossi, John Ruan, Rossana Ruggeri, Jessie C. Runnoe, Salvador Salazar-Albornoz, Mara Salvato, Ariel G. Sanchez, Sebastian F. Sanchez, Jose R. Sanchez-Gallego, Basilio Xavier Santiago, Ricardo Schiavon, Jaderson S. Schimoia, Eddie Schlafly, David J. Schlegel, Donald P. Schneider, Ralph Schoenrich, Mathias Schultheis, Axel Schwope, Hee-Jong Seo, Aldo Serenelli, Branimir Sesar, Zhengyi Shao, Matthew Shetrone, Michael Shull, Victor Silva Aguirre, M. F. Skrutskie, Anže Slosar, Michael Smith, Verne V. Smith, Jennifer Sobeck, Garrett Somers, Diogo Souto, David V. Stark, Keivan G. Stassun, Matthias Steinmetz, Dennis Stello, Thaisa Storchi Bergmann, Michael A. Strauss, Alina Streblyanska, Guy S. Stringfellow, Genaro Suarez, Jing Sun, Manuchehr Taghizadeh-Popp, Baitian Tang, Charling Tao, Jamie Tayar, Mita Tembe, Daniel Thomas, Jeremy Tinker, Rita Tojeiro, Christy Tremonti, Nicholas Troup, Jonathan R. Trump, Eduardo Unda-Sanzana, O. Valenzuela, Remco van den Bosch, Mariana Vargas-Magana, Jose Alberto Vazquez, Sandro Villanova, M. Vivek, Nicole Vogt, David Wake, Rene Walterbos, Yuting Wang, Enci Wang, Benjamin Alan Weaver, Anne-Marie Weijmans, David H. Weinberg, Kyle B. Westfall, David G. Whelan, Eric Wilcots, Vivienne Wild, Rob A. Williams, John Wilson, W. M. Wood-Vasey, Dominika Wylezalek, Ting Xiao, Renbin Yan, Meng Yang, Jason E. Ybarra, Christophe Yeche, Fang-Ting Yuan, Nadia Zakamska, Olga Zamora, Gail Zasowski, Kai Zhang, Cheng Zhao, Gong-Bo Zhao, Zheng Zheng, Zheng Zheng, Zhi-Min Zhou, Guangtun Zhu, Joel C. Zinn, Hu Zou
    Sept. 25, 2017 astro-ph.GA
    The fourth generation of the Sloan Digital Sky Survey (SDSS-IV) began observations in July 2014. It pursues three core programs: APOGEE-2, MaNGA, and eBOSS. In addition, eBOSS contains two major subprograms: TDSS and SPIDERS. This paper describes the first data release from SDSS-IV, Data Release 13 (DR13), which contains new data, reanalysis of existing data sets and, like all SDSS data releases, is inclusive of previously released data. DR13 makes publicly available 1390 spatially resolved integral field unit observations of nearby galaxies from MaNGA, the first data released from this survey. It includes new observations from eBOSS, completing SEQUELS. In addition to targeting galaxies and quasars, SEQUELS also targeted variability-selected objects from TDSS and X-ray selected objects from SPIDERS. DR13 includes new reductions of the SDSS-III BOSS data, improving the spectrophotometric calibration and redshift classification. DR13 releases new reductions of the APOGEE-1 data from SDSS-III, with abundances of elements not previously included and improved stellar parameters for dwarf stars and cooler stars. For the SDSS imaging data, DR13 provides new, more robust and precise photometric calibrations. Several value-added catalogs are being released in tandem with DR13, in particular target catalogs relevant for eBOSS, TDSS, and SPIDERS, and an updated red-clump catalog for APOGEE. This paper describes the location and format of the data now publicly available, as well as providing references to the important technical papers that describe the targeting, observing, and data reduction. The SDSS website, http://www.sdss.org, provides links to the data, tutorials and examples of data access, and extensive documentation of the reduction and analysis procedures. DR13 is the first of a scheduled set that will contain new data and analyses from the planned ~6-year operations of SDSS-IV.
  • Active depth cameras suffer from several limitations, which cause incomplete and noisy depth maps, and may consequently affect the performance of RGB-D Odometry. To address this issue, this paper presents a visual odometry method based on point and line features that leverages both measurements from a depth sensor and depth estimates from camera motion. Depth estimates are generated continuously by a probabilistic depth estimation framework for both types of features to compensate for the lack of depth measurements and inaccurate feature depth associations. The framework models explicitly the uncertainty of triangulating depth from both point and line observations to validate and obtain precise estimates. Furthermore, depth measurements are exploited by propagating them through a depth map registration module and using a frame-to-frame motion estimation method that considers 3D-to-2D and 2D-to-3D reprojection errors, independently. Results on RGB-D sequences captured on large indoor and outdoor scenes, where depth sensor limitations are critical, show that the combination of depth measurements and estimates through our approach is able to overcome the absence and inaccuracy of depth measurements.
  • Robust perception-action models should be learned from training data with diverse visual appearances and realistic behaviors, yet current approaches to deep visuomotor policy learning have been generally limited to in-situ models learned from a single vehicle or a simulation environment. We advocate learning a generic vehicle motion model from large scale crowd-sourced video data, and develop an end-to-end trainable architecture for learning to predict a distribution over future vehicle egomotion from instantaneous monocular camera observations and previous vehicle state. Our model incorporates a novel FCN-LSTM architecture, which can be learned from large-scale crowd-sourced vehicle action data, and leverages available scene segmentation side tasks to improve performance under a privileged learning paradigm.
  • Most recent CNN architectures use average pooling as a final feature encoding step. In the field of fine-grained recognition, however, recent global representations like bilinear pooling offer improved performance. In this paper, we generalize average and bilinear pooling to "alpha-pooling", allowing for learning the pooling strategy during training. In addition, we present a novel way to visualize decisions made by these approaches. We identify parts of training images having the highest influence on the prediction of a given test image. It allows for justifying decisions to users and also for analyzing the influence of semantic parts. For example, we can show that the higher capacity VGG16 model focuses much more on the bird's head than, e.g., the lower-capacity VGG-M model when recognizing fine-grained bird categories. Both contributions allow us to analyze the difference when moving between average and bilinear pooling. In addition, experiments show that our generalized approach can outperform both across a variety of standard datasets.
  • Spherical hydrodynamic models with a polytropic equation of state (EoS) for forming protostars are revisited in order to investigate the so-called luminosity conundrum highlighted by observations. For a molecular cloud (MC) core with such an EoS with polytropic index $\gamma$ >1, the central mass accretion rate (MAR) decreases with increasing time as a protostar emerges, offering a sensible solution to this luminosity problem. As the MAR decreases, the protostellar luminosity also decreases, meaning that it is invalid to infer the star formation time from the currently observed luminosity using an isothermal model. Furthermore, observations of radial density profiles and the radio continua of numerous MC cores evolving towards protostars also suggest that polytropic dynamic spheres of $\gamma$ > 1 should be used in physical models.
  • Thermal-diffusional pulsation behaviors in planar as well as outwardly and inwardly propagating white dwarf carbon flames are systematically studied. In the 1D numerical simulation, the asymptotic degenerate equation of state and simplified one-step reaction rates for nuclear reactions are used to study the flame propagation and pulsation in white dwarfs. The numerical critical Zel'dovich numbers of planar flames at different densities ($\rho=2$, 3 and 4$\times 10^7$~g/cm$^3$) and of spherical flames (with curvature $c=$-0.01, 0, 0.01 and 0.05) at a particular density ($\rho=2\times 10^7$~g/cm$^3$) are presented. Flame front pulsation in different environmental densities and temperatures are obtained to form the regime diagram of pulsation, showing that carbon flames pulsate in the typical density of $2\times10^7~{\rm g/cm^3}$ and temperature of $0.6\times 10^9~{\rm K}$. While being stable at higher temperatures, at relatively lower temperatures the amplitude of the flame pulsation becomes larger. In outwardly propagating spherical flames the pulsation instability is enhanced and flames are also easier to quench due to pulsation at small radius, while the inwardly propagating flames are more stable.
  • This work proposes a visual odometry method that combines points and plane primitives, extracted from a noisy depth camera. Depth measurement uncertainty is modelled and propagated through the extraction of geometric primitives to the frame-to-frame motion estimation, where pose is optimized by weighting the residuals of 3D point and planes matches, according to their uncertainties. Results on an RGB-D dataset show that the combination of points and planes, through the proposed method, is able to perform well in poorly textured environments, where point-based odometry is bound to fail.