-
Crowd employment is a new form of short-term and flexible employment which
has emerged during the past decade. In order to understand this new form of
employment, it is crucial to illuminate the underlying motivations of the
workforce involved in it. This paper introduces the Multidimensional
Crowdworker Motivation Scale (MCMS), a scale for measuring the motivation of
crowdworkers on micro-task platforms. The MCMS is theoretically grounded in
self-determination theory and tailored specifically to the context of paid
crowdsourced micro-labor. The scale measures the motivation of crowdworkers
along six motivational dimensions, ranging from amotivation to intrinsic
motivation. We validated the MCMS on data collected in ten countries and three
income groups. Factor analyses demonstrated that the MCMS's six dimensions
showed good model fit, validity, and reliability. Furthermore, our measurement
invariance tests showed that motivations measured with the MCMS are comparable
across countries and income groups, and we present a first cross-country
comparison of crowdworker motivations. This work constitutes an important first
step towards understanding the motivations of the international crowd
workforce.
-
Ubiquitous technology platforms have been created to track and improve health
and fitness; similar technologies can help individuals monitor and reduce their
carbon footprints. This paper proposes CarbonKit, a platform combining
technology, markets, and incentives to empower and reward people for reducing
their carbon footprint. We argue that a goal-and-reward behavioral feedback
loop can be combined with the Big Data available from tracked activities, apps,
and social media to make CarbonKit an integral part of individuals daily lives.
CarbonKit comprises five modules that link personal carbon tracking, health and
fitness, social media, and economic incentives. Protocols for safeguarding
security, privacy and individuals control over their own data are essential to
the design of the CarbonKit. Initially CarbonKit would operate on a voluntary
basis, but such a system can also serve as part of a mandatory region-wide
initiative. We use the example of the British Columbia to illustrate the
regulatory framework and participating stakeholders that would be required to
support the CarbonKit in specific jurisdictions.
-
Characterizing human values is a topic deeply interwoven with the sciences,
humanities, art, and many other human endeavors. In recent years, a number of
thinkers have argued that accelerating trends in computer science, cognitive
science, and related disciplines foreshadow the creation of intelligent
machines which meet and ultimately surpass the cognitive abilities of human
beings, thereby entangling an understanding of human values with future
technological development. Contemporary research accomplishments suggest
sophisticated AI systems becoming widespread and responsible for managing many
aspects of the modern world, from preemptively planning users' travel schedules
and logistics, to fully autonomous vehicles, to domestic robots assisting in
daily living. The extrapolation of these trends has been most forcefully
described in the context of a hypothetical "intelligence explosion," in which
the capabilities of an intelligent software agent would rapidly increase due to
the presence of feedback loops unavailable to biological organisms. The
possibility of superintelligent agents, or simply the widespread deployment of
sophisticated, autonomous AI systems, highlights an important theoretical
problem: the need to separate the cognitive and rational capacities of an agent
from the fundamental goal structure, or value system, which constrains and
guides the agent's actions. The "value alignment problem" is to specify a goal
structure for autonomous agents compatible with human values. In this brief
article, we suggest that recent ideas from affective neuroscience and related
disciplines aimed at characterizing neurological and behavioral universals in
the mammalian class provide important conceptual foundations relevant to
describing human values. We argue that the notion of "mammalian value systems"
points to a potential avenue for fundamental research in AI safety and AI
ethics.
-
We propose a camera-based assistive text reading framework to help blind
persons read text labels and product packaging from hand-held objects in their
daily life. To isolate the object from untidy backgrounds or other surrounding
objects in the camera vision, we initially propose an efficient and effective
motion based method to define a region of interest (ROI) in the video by asking
the user to tremble the object. This scheme extracts moving object region by a
mixture-of-Gaussians-based background subtraction technique. In the extracted
ROI, text localization and recognition are conducted to acquire text details.
To automatically focus the text regions from the object ROI, we offer a novel
text localization algorithm by learning gradient features of stroke
orientations and distributions of edge pixels in an Adaboost model. Text
characters in the localized text regions are then binarized and recognized by
off-the-shelf optical character identification software. The renowned text
codes are converted into audio output to the blind users. Performance of the
suggested text localization algorithm is quantitatively evaluated on ICDAR-2003
and ICDAR-2011 Robust Reading Datasets. Experimental results demonstrate that
our algorithm achieves the highest level of developments at present time. The
proof-of-concept example is also evaluated on a dataset collected using ten
blind persons to evaluate the effectiveness of the scheme. We explore the user
interface issues and robustness of the algorithm in extracting and reading text
from different objects with complex backgrounds.
-
Visual query systems (VQSs) empower users to interactively search for line
charts with desired visual patterns, typically specified using intuitive
sketch-based interfaces. Despite decades of past work on VQSs, these efforts
have not translated to adoption in practice, possibly because VQSs are largely
evaluated in unrealistic lab-based settings. To remedy this gap in adoption, we
collaborated with experts from three diverse domains---astronomy, genetics, and
material science---via a year-long user-centered design process to develop a
VQS that supports their workflow and analytical needs, and evaluate how VQSs
can be used in practice. Our study results reveal that ad-hoc sketch-only
querying is not as commonly used as prior work suggests, since analysts are
often unable to precisely express their patterns of interest. In addition, we
characterize three essential sensemaking processes supported by our enhanced
VQS. We discover that participants employ all three processes, but in different
proportions, depending on the analytical needs in each domain. Our findings
suggest that all three sensemaking processes must be integrated in order to
make future VQSs useful for a wide range of analytical inquiries.
-
Jitter is an inevitable by-product of gaze detection. Because of this, gaze
typing tends to be a slow and frustrating process. In this paper, we propose
SliceType, a soft keyboard that is optimized for gaze input. Our main design
objective is to use the screen area more efficiently by allocating a larger
area to the target keys. We achieve this by determining the keys that will not
be used for the next input, and allocating their space to the adjacent keys
with a merging animation. Larger keys are faster to navigate towards, and easy
to dwell on in the presence of eye tracking jitter. As a result, the user types
faster and more comfortably. In addition, we employ a word completion scheme
that complements gaze typing mechanics. A character and a related prediction is
displayed at each key. Dwelling at a key enters the character, and
double-dwelling enters the prediction. While dwelling on a key to enter a
character, the user reads the related prediction effortlessly. The improvements
provided by these features are quantified using the Fitts' law. The performance
of the proposed keyboard is compared with two other soft keyboards designed for
gaze typing, Dasher and GazeTalk. 37 novice users gaze-typed a piece of text
using all three keyboards. The results of the experiment show that the proposed
keyboard allows faster typing, and is more preferred by the users.
-
We introduce a novel approach to visualizing temporal clickstream behaviour
in the context of a degree-satisfying online course, Habitable Worlds, offered
through Arizona State University. The current practice for visualizing
behaviour within a digital learning environment has been to generate plots
based on hand engineered or coded features using domain knowledge. While this
approach has been effective in relating behaviour to known phenomena, features
crafted from domain knowledge are not likely well suited to make unfamiliar
phenomena salient and thus can preclude discovery. We introduce a methodology
for organically surfacing behavioural regularities from clickstream data,
conducting an expert in-the-loop hyperparameter search, and identifying
anticipated as well as newly discovered patterns of behaviour. While these
visualization techniques have been used before in the broader machine learning
community to better understand neural networks and relationships between word
vectors, we apply them to online behavioural learner data and go a step
further; exploring the impact of the parameters of the model on producing
tangible, non-trivial observations of behaviour that are suggestive of
pedagogical improvement to the course designers and instructors. The
methodology introduced in this paper led to an improved understanding of
passing and non-passing student behaviour in the course and is widely
applicable to other datasets of clickstream activity where investigators and
stakeholders wish to organically surface principal patterns of behaviour.
-
Detecting motor activities from sensor datasets is becoming increasingly
common in a wide range of applications with the rapid commoditization of
wearable sensors. To detect activities, data scientists iteratively experiment
with different classifiers before deciding on a single model. Evaluating,
comparing, and reasoning about prediction results of alternative classifiers is
a crucial step in the process of iterative model development. However, standard
aggregate performance metrics (such as accuracy score) and textual display of
individual event sequences have limited granularity and scalability to
effectively perform this critical step.
To ameliorate these limitations, we introduce Track Xplorer, an interactive
visualization system to query, analyze and compare the classification output of
activity detection in multi-sensor data. Track Xplorer visualizes the results
of different classifiers as well as the ground truth labels and the video of
activities as temporally-aligned linear tracks. Through coordinated track
visualizations, Track Xplorer enables users to interactively explore and
compare the results of different classifiers, assess their accuracy with
respect to the ground truth labels and video. Users can brush arbitrary regions
of any classifier track, zoom in and out with ease, and playback the
corresponding video segment to contextualize the performance of the classifier
within the selected region.
Track Xplorer also contributes an algebra over track representations to
filter, compose, and compare classification outputs, enabling users to
effectively reason about the performance of classifiers. We demonstrate how our
tool helps data scientists debug misclassifications and improve the prediction
performance in developing activity classifiers for real-world, multi-sensor
data gathered from Parkinson's patients.
-
In mobile crowdsourcing (MCS), mobile users accomplish outsourced human
intelligence tasks. MCS requires an appropriate task assignment strategy, since
different workers may have different performance in terms of acceptance rate
and quality. Task assignment is challenging, since a worker's performance (i)
may fluctuate, depending on both the worker's current personal context and the
task context, (ii) is not known a priori, but has to be learned over time.
Moreover, learning context-specific worker performance requires access to
context information, which may not be available at a central entity due to
communication overhead or privacy concerns. Additionally, evaluating worker
performance might require costly quality assessments. In this paper, we propose
a context-aware hierarchical online learning algorithm addressing the problem
of performance maximization in MCS. In our algorithm, a local controller (LC)
in the mobile device of a worker regularly observes the worker's context,
her/his decisions to accept or decline tasks and the quality in completing
tasks. Based on these observations, the LC regularly estimates the worker's
context-specific performance. The mobile crowdsourcing platform (MCSP) then
selects workers based on performance estimates received from the LCs. This
hierarchical approach enables the LCs to learn context-specific worker
performance and it enables the MCSP to select suitable workers. In addition,
our algorithm preserves worker context locally, and it keeps the number of
required quality assessments low. We prove that our algorithm converges to the
optimal task assignment strategy. Moreover, the algorithm outperforms simpler
task assignment strategies in experiments based on synthetic and real data.
-
With the mounting global interest for optical see-through head-mounted
displays (OST-HMDs) across medical, industrial and entertainment settings, many
systems with different capabilities are rapidly entering the market. Despite
such variety, they all require display calibration to create a proper mixed
reality environment. With the aid of tracking systems, it is possible to
register rendered graphics with tracked objects in the real world. We propose a
calibration procedure to properly align the coordinate system of a 3D virtual
scene that the user sees with that of the tracker. Our method takes a blackbox
approach towards the HMD calibration, where the tracker's data is its input and
the 3D coordinates of a virtual object in the observer's eye is the output; the
objective is thus to find the 3D projection that aligns the virtual content
with its real counterpart. In addition, a faster and more intuitive version of
this calibration is introduced in which the user simultaneously aligns multiple
points of a single virtual 3D object with its real counterpart; this reduces
the number of required repetitions in the alignment from 20 to only 4, which
leads to a much easier calibration task for the user. In this paper, both
internal (HMD camera) and external tracking systems are studied. We perform
experiments with Microsoft HoloLens, taking advantage of its self localization
and spatial mapping capabilities to eliminate the requirement for line of sight
from the HMD to the object or external tracker. The experimental results
indicate an accuracy of up to 4 mm in the average reprojection error based on
two separate evaluation methods. We further perform experiments with the
internal tracking on the Epson Moverio BT-300 to demonstrate that the method
can provide similar results with other HMDs.
-
A large number of statistical decision problems in the social sciences and
beyond can be framed as a (contextual) multi-armed bandit problem. However, it
is notoriously hard to develop and evaluate policies that tackle these types of
problem, and to use such policies in applied studies. To address this issue,
this paper introduces StreamingBandit, a Python web application for developing
and testing bandit policies in field studies. StreamingBandit can sequentially
select treatments using (online) policies in real time. Once StreamingBandit is
implemented in an applied context, different policies can be tested, altered,
nested, and compared. StreamingBandit makes it easy to apply a multitude of
bandit policies for sequential allocation in field experiments, and allows for
the quick development and re-use of novel policies. In this article, we detail
the implementation logic of StreamingBandit and provide several examples of its
use.
-
We present an approach for the verification and validation (V&V) of robot
assistants in the context of human-robot interactions (HRI), to demonstrate
their trustworthiness through corroborative evidence of their safety and
functional correctness. Key challenges include the complex and unpredictable
nature of the real world in which assistant and service robots operate, the
limitations on available V&V techniques when used individually, and the
consequent lack of confidence in the V&V results. Our approach, called
corroborative V&V, addresses these challenges by combining several different
V&V techniques; in this paper we use formal verification (model checking),
simulation-based testing, and user validation in experiments with a real robot.
We demonstrate our corroborative V&V approach through a handover task, the most
critical part of a complex cooperative manufacturing scenario, for which we
propose some safety and liveness requirements to verify and validate. We
construct formal models, simulations and an experimental test rig for the HRI.
To capture requirements we use temporal logic properties, assertion checkers
and textual descriptions. This combination of approaches allows V&V of the HRI
task at different levels of modelling detail and thoroughness of exploration,
thus overcoming the individual limitations of each technique. Should the
resulting V&V evidence present discrepancies, an iterative process between the
different V&V techniques takes place until corroboration between the V&V
techniques is gained from refining and improving the assets (i.e., system and
requirement models) to represent the HRI task in a more truthful manner.
Therefore, corroborative V&V affords a systematic approach to 'meta-V&V,' in
which different V&V techniques can be used to corroborate and check one
another, increasing the level of certainty in the results of V&V.
-
Time series and signals are attracting more attention across statistics,
machine learning and pattern recognition as it appears widely in the industry
especially in sensor and IoT related research and applications, but few
advances has been achieved in effective time series visual analytics and
interaction due to its temporal dimensionality and complex dynamics. Inspired
by recent effort on using network metrics to characterize time series for
classification, we present an approach to visualize time series as complex
networks based on the first order Markov process in its temporal ordering. In
contrast to the classical bar charts, line plots and other statistics based
graph, our approach delivers more intuitive visualization that better preserves
both the temporal dependency and frequency structures. It provides a natural
inverse operation to map the graph back to raw signals, making it possible to
use graph statistics to characterize time series for better visual exploration
and statistical analysis. Our experimental results suggest the effectiveness on
various tasks such as pattern discovery and classification on both synthetic
and the real time series and sensor data.
-
As more scholarly content is born digital or converted to a digital format,
digital libraries are becoming increasingly vital to researchers seeking to
leverage scholarly big data for scientific discovery. Although scholarly
products are available in abundance-especially in environments created by the
advent of social networking services-little is known about international
scholarly information needs, information-seeking behavior, or information use.
The purpose of this paper is to address these gaps via an in-depth analysis of
the information needs and information-seeking behavior of researchers, both
students and faculty, at two universities, one in the U.S. and the other in
Qatar. Based on this analysis, the study identifies and describes new behavior
patterns on the part of researchers as they engage in the information-seeking
process. The analysis reveals that the use of academic social networks has
notable effects on various scholarly activities. Further, this study identifies
differences between students and faculty members in regard to their use of
academic social networks, and it identifies differences between researchers
according to discipline. Although the researchers who participated in the
present study represent a range of disciplinary and cultural backgrounds, the
study reports a number of similarities in terms of the researchers' scholarly
activities.
-
Non-linear dimensionality reduction (NDR) methods such as LLE and t-SNE are
popular with visualization researchers and experienced data analysts, but
present serious problems of interpretation. In this paper, we present
DimReader, a technique that recovers readable axes from such techniques.
DimReader is based on analyzing infinitesimal perturbations of the dataset with
respect to variables of interest. The perturbations define exactly how we want
to change each point in the original dataset and we measure the effect that
these changes have on the projection. The recovered axes are in direct analogy
with the axis lines (grid lines) of traditional scatterplots. We also present
methods for discovering perturbations on the input data that change the
projection the most. The calculation of the perturbations is efficient and
easily integrated into programs written in modern programming languages. We
present results of DimReader on a variety of NDR methods and datasets both
synthetic and real-life, and show how it can be used to compare different NDR
methods. Finally, we discuss limitations of our proposal and situations where
further research is needed.
-
We present a method to improve the accuracy of a foot-mounted,
zero-velocity-aided inertial navigation system (INS) by varying estimator
parameters based on a real-time classification of motion type. We train a
support vector machine (SVM) classifier using inertial data recorded by a
single foot-mounted sensor to differentiate between six motion types (walking,
jogging, running, sprinting, crouch-walking, and ladder-climbing) and report
mean test classification accuracy of over 90% on a dataset with five different
subjects. From these motion types, we select two of the most common (walking
and running), and describe a method to compute optimal zero-velocity detection
parameters tailored to both a specific user and motion type by maximizing the
detector F-score. By combining the motion classifier with a set of optimal
detection parameters, we show how we can reduce INS position error during mixed
walking and running motion. We evaluate our adaptive system on a total of 5.9
km of indoor pedestrian navigation performed by five different subjects moving
along a 130 m path with surveyed ground truth markers.
-
This document is meant to help individuals use the Cerebral Signal Phase
Analysis toolbox which implements different methods for estimating the
instantaneous phase and frequency of a signal and calculating some related
popular quantities.The toolbox -- which is distributed under the terms of the
GNU GENERAL PUBLIC LICENSE as a set of MATLAB routines -- can be downloaded at
the address http://oset.ir/category.php?dir=Tools.The purpose of this toolbox
is to calculate the instantaneous phase and frequency sequences of cerebral
signals (EEG, MEG, etc.) and some related popular features and quantities in
brain studies and Neuroscience such as Phase Shift, Phase Resetting, Phase
Locking Value (PLV), Phase Difference and more, to help researchers in these
fields.
-
We developed a simulation game to study the effectiveness of decision-makers
in overcoming two complexities in building cybersecurity capabilities:
potential delays in capability development; and uncertainties in predicting
cyber incidents. Analyzing 1,479 simulation runs, we compared the performances
of a group of experienced professionals with those of an inexperienced control
group. Experienced subjects did not understand the mechanisms of delays any
better than inexperienced subjects; however, experienced subjects were better
able to learn the need for proactive decision-making through an iterative
process. Both groups exhibited similar errors when dealing with the uncertainty
of cyber incidents. Our findings highlight the importance of training for
decision-makers with a focus on systems thinking skills, and lay the groundwork
for future research on uncovering mental biases about the complexities of
cybersecurity.
-
Rapport plays an important role during communication because it can help
people understand each other's feelings or ideas and leads to a smooth
communication. Computational rapport model has been proposed based on theory in
previous work. But there lacks solid verification. In this paper, we apply
structural equation model (SEM) to the theoretical model on both dyads of
friend and stranger. The results indicate some unfavorable paths. Based on the
results and more literature, we modify the original model to integrate more
nonverbal behaviors, including gaze and smile. Fit indices and other
examination show the goodness of our new models, which can give us more insight
into rapport management during conversation.
-
As autonomous service robots become more affordable and thus available also
for the general public, there is a growing need for user friendly interfaces to
control the robotic system. Currently available control modalities typically
expect users to be able to express their desire through either touch, speech or
gesture commands. While this requirement is fulfilled for the majority of
users, paralyzed users may not be able to use such systems. In this paper, we
present a novel framework, that allows these users to interact with a robotic
service assistant in a closed-loop fashion, using only thoughts. The
brain-computer interface (BCI) system is composed of several interacting
components, i.e., non-invasive neuronal signal recording and decoding,
high-level task planning, motion and manipulation planning as well as
environment perception. In various experiments, we demonstrate its
applicability and robustness in real world scenarios, considering
fetch-and-carry tasks and tasks involving human-robot interaction. As our
results demonstrate, our system is capable of adapting to frequent changes in
the environment and reliably completing given tasks within a reasonable amount
of time. Combined with high-level planning and autonomous robotic systems,
interesting new perspectives open up for non-invasive BCI-based human-robot
interactions.
-
In the field of tutoring systems, investigations have shown that there are
many tutoring systems specific to a specific domain that, because of their
static architecture, cannot be adapted to other domains. As consequence, often
neither methods nor knowledge can be reused. In addition, the knowledge
engineer must have programming skills in order to enhance and evaluate the
system. One particular challenge is to tackle these problems with the
development of a generic tutoring system. AnITA, as a stand-alone application,
has been developed and implemented particularly for this purpose. However, in
the testing phase, we discovered that this architecture did not fully match the
user's intuitive understanding of the use of a learning tool. Therefore, AnITA
has been redesigned to exclusively work as a client/server application and
renamed to AnITA2. This paper discusses the evolvements made on the AnITA
tutoring system, the goal of which is to use generic principles for system
re-use in any domain. Two experiments were conducted, and the results are
presented in this paper.
-
This study examines the acceptance of technology and behavioral intention to
use learning management systems (LMS). In specific, the aim of this research is
to examine whether students ultimately accept and use educational learning
systems such as e-class and the impact of behavioral intention on their
decision to use them. An extended version of technology acceptance model has
been proposed and used by employing the System Usability Scale to measure
perceived ease of use. 345 university students participated in the study and
the data analysis was based on partial least squares method. The results were
confirmed in most of the research hypotheses. In particular, social norm,
system access and self-efficacy significantly affect behavioral intention to
use. As a result, it is suggested that e-learning developers and stakeholders
should focus on these factors to increase acceptance and effectiveness of
learning management systems.
-
User Interfaces (UIs) intensively rely on event-driven programming: widgets
send UI events, which capture users' interactions, to dedicated objects called
controllers. Controllers use several UI listeners that handle these events to
produce UI commands. First, we reveal the presence of design smells in the code
that describes and controls UIs. Second, we demonstrate that specific code
analyses are necessary to analyze and refactor UI code, because of its coupling
with the rest of the code. We conducted an empirical study on four large Java
Swing and SWT open-source software systems. We study to what extent the number
of UI commands that a UI listener can produce has an impact on the change- and
fault-proneness of the UI listener code. We develop a static code analysis for
detecting UI commands in the code. We identify a new type of design smell,
called Blob Listener that characterizes UI listeners that can produce more than
two UI commands. We propose a systematic static code analysis procedure that
searches for Blob Listeners that we implement in InspectorGuidget. We conducted
experiments on the four software systems for which we manually identified 53
instances of Blob Listener. InspectorGuidget successfully detected 52 Blob
Listeners out of 53. The results exhibit a precision of 81.25% and a recall of
98.11%. We then developed a semi-automatically and behavior-preserving
refactoring process to remove Blob Listeners. 49.06% of the 53 Blob Listeners
were automatically refactored. Patches for JabRef, and FreeCol have been
accepted and merged. Discussions with developers of the four software systems
assess the relevance of the Blob Listener. This work shows that UI code also
suffers from design smells that have to be identified and characterized. We
argue that studies have to be conducted to find other UI design smells and
tools that analyze UI code must be developed.
-
Conventional HVAC control systems are usually incognizant of the physical
structures and materials of buildings. These systems merely follow pre-set HVAC
control logic based on abstract building thermal response models, which are
rough approximations to true physical models, ignoring dynamic spatial
variations in built environments. To enable more accurate and responsive HVAC
control, this paper introduces the notion of "self-aware" smart buildings, such
that buildings are able to explicitly construct physical models of themselves
(e.g., incorporating building structures and materials, and thermal flow
dynamics). The question is how to enable self-aware buildings that
automatically acquire dynamic knowledge of themselves. This paper presents a
novel approach using "augmented reality". The extensive user-environment
interactions in augmented reality not only can provide intuitive user
interfaces for building systems, but also can capture the physical structures
and possibly materials of buildings accurately to enable real-time building
simulation and control. This paper presents a building system prototype
incorporating augmented reality, and discusses its applications.
-
We investigate grasping of rigid objects in unilateral robot-assisted
minimally invasive surgery (RAMIS) in this paper. We define a human-centered
transparency that quantifies natural action and perception in RAMIS. We
demonstrate this human-centered transparency analysis for different values of
gripper scaling - the scaling between the grasp aperture of the surgeon-side
manipulator and the aperture of the surgical instrument grasper. Thirty-one
participants performed teleoperated grasping and perceptual assessment of rigid
objects in one of three gripper scaling conditions (fine, normal, and quick,
trading off precision and responsiveness). Psychophysical analysis of the
variability of maximal grasping aperture during prehension and of the reported
size of the object revealed that in normal and quick (but not in the fine)
gripper scaling conditions, teleoperated grasping with our system was similar
to natural grasping, and therefore, human-centered transparent. We anticipate
that using motor control and psychophysics for human-centered optimizing of
teleoperation control will eventually improve the usability of RAMIS.