• Ultrahigh-power terahertz (THz) radiation sources are essential for many applications, such as nonlinear THz physics, THz-wave based compact accelerators, etc. However, until now none of THz sources reported, whether based upon large-scale accelerators or high power lasers, have produced THz pulses with energies above the millijoule (mJ) barrier. Here we report on the efficient generation of low-frequency (<3 THz) THz pulses with unprecedentedly high energies over 50 mJ. The THz radiation is produced by coherent transition radiation of a picosecond laser-accelerated ultra-bright bunch of relativistic electrons from a solid target. Such high energy THz pulses can not only trigger various nonlinear dynamics in matter, but also open up a new research field of relativistic THz optics.
  • When processing large amounts of data, the rate at which reading and writing can take place is a critical factor. High energy physics data processing relying on ROOT is no exception. The recent parallelisation of LHC experiments' software frameworks and the analysis of the ever increasing amount of collision data collected by experiments further emphasized this issue underlying the need of increasing the implicit parallelism expressed within the ROOT I/O. In this contribution we highlight the improvements of the ROOT I/O subsystem which targeted a satisfactory scaling behaviour in a multithreaded context. The effect of parallelism on the individual steps which are chained by ROOT to read and write data, namely (de)compression, (de)serialisation, access to storage backend, are discussed. Performance measurements are discussed through real life examples coming from CMS production workflows on traditional server platforms and highly parallel architectures such as Intel Xeon Phi.
  • The dependence of the mean kinetic energy of laser-accelerated electrons on the laser intensity, so-called ponderomotive scaling, was derived theoretically with consideration of the motion of a single electron in oscillating laser fields. This scaling explains well the experimental results obtained with high-intensity pulses and durations shorter than a picosecond; however, this scaling is no longer applicable to the multi-picosecond (multi-ps) facility experiments. Here, we experimentally clarified the generation of the super-ponderomotive-relativistic electrons (SP-REs) through multi-ps relativistic laser-plasma interactions using prepulse-free LFEX laser pulses that were realized using a plasma mirror (PM). The SP-REs are produced with direct laser acceleration assisted by the self-generated quasi-static electric field and with loop-injected direct acceleration by the self- generated quasi-static magnetic field, which grow in a blowout plasma heated by a multi-ps laser pulse. Finally, we theoretically derive the threshold pulse duration to boost the acceleration of REs, which provides an important insight into the determination of laser pulse duration at kilojoule- petawatt laser facilities.
  • In this paper, we present the PerceptIn Visual Inertial Odometry (PI-VIO), a tightly-coupled filtering-based stereo VIO system using both points and lines. Line features help improve system robustness in challenging scenarios when point features cannot be reliably detected or tracked, e.g. low-texture environment or lighting change. In addition, we propose a new lightweight filtering-based loop closing technique to reduce accumulated drift without global bundle adjustment. We formulate loop closure as EKF updates to optimally relocate the current sliding window maintained by the filter to past keyframes. We also present the PerceptIn Ironsides dataset, a new visual-inertial dataset, featuring high-quality synchronized stereo camera and IMU data from the Ironsides sensor with various motion types and textures and millimeter-accuracy groundtruth. To validate the performance of the proposed system, we conduct extensive comparison with state-of-the-art approaches (OKVIS, VINS-MONO and S-MSCKF) using both the public EuRoC dataset and the PerceptIn Ironsides dataset.
  • Enabling full robotic workloads with diverse behaviors on mobile systems with stringent resource and energy constraints remains a challenge. In recent years, attempts have been made to deploy single-accelerator-based computing platforms (such as GPU, DSP, or FPGA) to address this challenge, but with little success. The core problem is two-fold: firstly, different robotic tasks require different accelerators, and secondly, managing multiple accelerators simultaneously is overwhelming for developers. In this paper, we propose PIRT, the first robotic runtime framework to efficiently manage dynamic task executions on mobile systems with multiple accelerators as well as on the cloud to achieve better performance and energy savings. With PIRT, we enable a robot to simultaneously perform autonomous navigation with 25 FPS of localization, obstacle detection with 3 FPS, route planning, large map generation, and scene understanding, traveling at a max speed of 5 miles per hour, all within an 11W computing power envelope.
  • In this paper, we present the PerceptIn Robotics Vision System (PIRVS) system, a visual-inertial computing hardware with embedded simultaneous localization and mapping (SLAM) algorithm. The PIRVS hardware is equipped with a multi-core processor, a global-shutter stereo camera, and an IMU with precise hardware synchronization. The PIRVS software features a novel and flexible sensor fusion approach to not only tightly integrate visual measurements with inertial measurements and also to loosely couple with additional sensor modalities. It runs in real-time on both PC and the PIRVS hardware. We perform a thorough evaluation of the proposed system using multiple public visual-inertial datasets. Experimental results demonstrate that our system reaches comparable accuracy of state-of-the-art visual-inertial algorithms on PC, while being more efficient on the PIRVS hardware.
  • Big Data query systems represent data in a columnar format for fast, selective access, and in some cases (e.g. Apache Drill), perform calculations directly on the columnar data without row materialization, avoiding runtime costs. However, many analysis procedures cannot be easily or efficiently expressed as SQL. In High Energy Physics, the majority of data processing requires nested loops with complex dependencies. When faced with tasks like these, the conventional approach is to convert the columnar data back into an object form, usually with a performance price. This paper describes a new technique to transform procedural code so that it operates on columnar data natively, without row materialization. It can be viewed as a compiler pass on the typed abstract syntax tree, rewriting references to objects as columnar array lookups. We will also present performance comparisons between transformed code and conventional object-oriented code in a High Energy Physics context.
  • We present a novel subset scan method to detect if a probabilistic binary classifier has statistically significant bias -- over or under predicting the risk -- for some subgroup, and identify the characteristics of this subgroup. This form of model checking and goodness-of-fit test provides a way to interpretably detect the presence of classifier bias or regions of poor classifier fit. This allows consideration of not just subgroups of a priori interest or small dimensions, but the space of all possible subgroups of features. To address the difficulty of considering these exponentially many possible subgroups, we use subset scan and parametric bootstrap-based methods. Extending this method, we can penalize the complexity of the detected subgroup and also identify subgroups with high classification errors. We demonstrate these methods and find interesting results on the COMPAS crime recidivism and credit delinquency data.
  • ROOT provides an flexible format used throughout the HEP community. The number of use cases - from an archival data format to end-stage analysis - has required a number of tradeoffs to be exposed to the user. For example, a high "compression level" in the traditional DEFLATE algorithm will result in a smaller file (saving disk space) at the cost of slower decompression (costing CPU time when read). At the scale of the LHC experiment, poor design choices can result in terabytes of wasted space or wasted CPU time. We explore and attempt to quantify some of these tradeoffs. Specifically, we explore: the use of alternate compressing algorithms to optimize for read performance; an alternate method of compressing individual events to allow efficient random access; and a new approach to whole-file compression. Quantitative results are given, as well as guidance on how to make compression decisions for different use cases.
  • The rise of robotic applications has led to the generation of a huge volume of unstructured data, whereas the current cloud infrastructure was designed to process limited amounts of structured data. To address this problem, we propose a learn-memorize-recall-reduce paradigm for robotic cloud computing. The learning stage converts incoming unstructured data into structured data; the memorization stage provides effective storage for the massive amount of data; the recall stage provides efficient means to retrieve the raw data; while the reduction stage provides means to make sense of this massive amount of unstructured data with limited computing resources.
  • We describe the computing tasks involved in autonomous driving, examine existing autonomous driving computing platform implementations. To enable autonomous driving, the computing stack needs to simultaneously provide high performance, low power consumption, and low thermal dissipation, at low cost. We discuss possible approaches to design computing platforms that will meet these needs.
  • Nuclear fusion reactions are the most important processes in nature to power stars and produce new elements, and lie at the center of the understanding of nucleosynthesis in the universe. It is critically important to study the reactions in full plasma environments that are close to true astrophysical conditions. By using laser-driven counter-streaming collisionless plasmas, we studied the fusion D$+$D$\rightarrow n +^3$He in a Gamow-like window around 27 keV. The results show that astrophysical nuclear reaction yield can be modulated significantly by the self-generated electromagnetic fields and the collective motion of the plasma. This plasma-version mini-collider may provide a novel tool for studies of astrophysics-interested nuclear reactions in plasma with tunable energies in earth-based laboratories.
  • Two dimensional (2D) semiconductor materials of transition-metal dichalcogenides (TMDCs) manifest many peculiar physical phenomena in the light-matter interaction. Due to their ultrathin property, strong interaction with light and the robust excitons at room temperature, they provide a perfect platform for studying the physics of strong coupling in low dimension and at room temperature. Here we report the strong coupling between 2D semiconductor excitons and Tamm plasmon polaritons (TPPs). We observe a Rabi splitting of about 54 meV at room temperature by measuring the angle resolved differential reflectivity spectra and simulate the theoretical results by using the transfer matrix method. Our results will promote the realization of the TPP based ultrathin polariton devices at room temperature.
  • A new simple mechanism due to cold electron flow to produce strong magnetic field is proposed. A 600-T strong magnetic field is generated in the free space at the laser intensity of 5.7x10^15 Wcm^-2. Theoretical analysis indicates that the magnetic field strength is proportional to laser intensity. Such a strong magnetic field offers a new experimental test bed to study laser-plasma physics, in particular, fast-ignition laser fusion research and laboratory astrophysics.
  • As Internet is changing from network of data into network of functionalities, a federated Internet of applications, that every application can cooperate with each other smoothly, is a natural trending topic. However, existing integration techniques did not pay enough attention to multiple control domains for participants, i.e. application providers and end-users. In this study, we advocate a global cooperation model for all the participants counts. In particular, we propose a hybrid model to manage the cooperation among applications to achieve more optimized allocation of efforts, which means users perform lighter actions and application providers concerning less uncontrollable information. In addition, we implement the required system and show a case study which demonstrates the effectiveness of this model.
  • The unusual reentrant phenomenon is observed in the anisotropic 3-state Potts model on a gen- eralized Kagome lattice. By employing the linearized tensor renormalization group method, we find that the reentrance can appear in the region not only under a partial ordered phase as commonly known but also a phase without a local order parameter, which is uncovered to fall into the uni- versality of the Kosterlitz-Thouless (KT) type. The region of the reentrance depends strongly on the ratios of the next nearest couplings {\alpha} = J2 /|J1 | and {\beta} = J3 /|J1 |. The phase diagrams in the plane of temperature versus {\beta} for different {\alpha} are obtained. Through massive calculations, it is also revealed that the quasi-entanglement entropy can be used to accurately detect the KT transition temperature.
  • A novel algorithm based on the optimized decimation of tensor networks with super-orthogonalization (ODTNS) that can be applied to simulate efficiently and accurately not only the thermodynamic but also the ground state properties of two-dimensional (2D) quantum lattice models is proposed. By transforming the 2D quantum model into a three-dimensional (3D) closed tensor network (TN) comprised of the tensor product density operator and a 3D brick-wall TN, the free energy of the system can be calculated with the imaginary time evolution, in which the network Tucker decomposition is suggested for the first time to obtain the optimal lower-dimensional approximation on the bond space by transforming the TN into a super-orthogonal form. The efficiency and accuracy of this algorithm are testified, which are fairly comparable with the quantum Monte Carlo calculations. Besides, the present ODTNS scheme can also be applicable to the 2D frustrated quantum spin models with nice efficiency.