


default search action
BigData Conference 2017: Boston, MA, USA
- Jian-Yun Nie, Zoran Obradovic, Toyotaro Suzumura, Rumi Ghosh, Raghunath Nambiar, Chonggang Wang, Hui Zang, Ricardo Baeza-Yates, Xiaohua Hu, Jeremy Kepner, Alfredo Cuzzocrea, Jian Tang, Masashi Toyoda:
2017 IEEE International Conference on Big Data (IEEE BigData 2017), Boston, MA, USA, December 11-14, 2017. IEEE Computer Society 2017, ISBN 978-1-5386-2715-0 - Carla E. Brodley:
Human-in-the-loop applied machine learning. 1 - Alan Edelman:
A more open efficient future for AI development and data science with an introduction to Julia. 2 - John Langford:
Contextual reinforcement learning. 3 - Jure Leskovec:
Large-scale graph representation learning. 4 - Satoshi Matsuoka:
Being "BYTES-oriented" in HPC leads to an open big data/AI ecosystem and further advances into the post-moore era. 5 - ChengXiang Zhai:
TextScope: Enhance human perception via text mining. 6 - Feng Chen, Chunpai Wang, Jin-Hee Cho:
Collective subjective logic: Scalable uncertainty-based opinion inference. 7-16 - Natascha Harth
, Christos Anagnostopoulos
:
Quality-aware aggregation & predictive analytics at the edge. 17-26 - Sheng Li, Yun Fu:
Robust multi-label semi-supervised classification. 27-36 - Xiaoli Li, Sai Nivedita Chandrasekaran, Jun Huan:
Lifelong multi-task multi-view learning using latent spaces. 37-46 - Natalia Ponomareva, Thomas Colthurst, Gilbert Hendry, Salem Haykal, Soroush Radpour:
Compact multi-class boosted trees. 47-56 - Daniel Yue Zhang, Dong Wang, Yang Zhang:
Constraint-aware dynamic truth discovery in big data social media sensing. 57-66 - Peter Baumann
:
Standardizing big earth datacubes. 67-73 - Salima Benbernou, Mourad Ouziri:
Enhancing data quality by cleaning inconsistent big RDF data. 74-79 - Byron J. Gao, Robert Tung, Yong Yang:
Iterative matrix correlation for bisection clustering. 80-87 - Diego Granziol, Stephen J. Roberts:
Entropic determinants of massive matrices. 88-93 - Er-Chen Huang, Hsing-Kuo Pao, Yuh-Jye Lee
:
Big active learning. 94-101 - Hasan Kurban
, Mehmet M. Dalkilic:
A novel approach to optimization of iterative machine learning algorithms: Over heap structure. 102-109 - Sheng Li, Hongfu Liu, Zhiqiang Tao, Yun Fu:
Multi-view graph learning with adaptive label propagation. 110-115 - Christian S. Schmid, Bruce A. Desmarais:
Exponential random graph models with big networks: Maximum pseudolikelihood estimation and the parametric bootstrap. 116-121 - Sam Wood, Rohit Muthyala, Yi Jin, Yixing Qin, Nilaj Rukadikar, Amit Rai, Hua Gao:
Automated industry classification with deep learning. 122-129 - Jonghyun Bae, Hakbeom Jang, Wenjing Jin, Jun Heo, Jaeyoung Jang, Joo Young Hwang, Sangyeun Cho, Jae W. Lee:
Jointly optimizing task granularity and concurrency for in-memory mapreduce frameworks. 130-140 - Nathanael Cheriere
, Gabriel Antoniu:
How fast can one scale down a distributed file system? 141-150 - Thomas Swearingen, Will Drevo, Bennett Cyphers, Alfredo Cuesta-Infante
, Arun Ross, Kalyan Veeramachaneni:
ATM: A distributed, collaborative, scalable system for automated machine learning. 151-162 - Ioannis Giannakopoulos, Dimitrios Tsoumakos, Nectarios Koziris:
A decision tree based approach towards adaptive modeling of big data applications. 163-172 - Shashank Gugnani, Xiaoyi Lu, Houliang Qi, Li Zha, Dhabaleswar K. Panda:
Characterizing and accelerating indexing techniques on distributed ordered tables. 173-182 - Yuki Ito, Ryo Matsumiya, Toshio Endo:
ooc_cuDNN: Accommodating convolutional neural networks over GPU memory capacity. 183-192 - HyeongSik Kim, Padmashree Ravindra, Kemafor Anyanwu
:
A semantics-aware storage framework for scalable processing of knowledge graphs on Hadoop. 193-202 - Konstantinos Lolos, Ioannis Konstantinou
, Verena Kantere, Nectarios Koziris:
Elastic management of cloud applications using adaptive reinforcement learning. 203-212 - Xiaoyi Lu, Haiyang Shi, Dipti Shankar, Dhabaleswar K. Panda:
Performance characterization and acceleration of big data workloads on OpenPOWER system. 213-222 - Diego Marron, Eduard Ayguadé, José R. Herrero, Jesse Read, Albert Bifet
:
Low-latency multi-threaded ensemble learning for dynamic big data streams. 223-232 - Arnab Kumar Paul
, Arpit Goyal, Feiyi Wang, Sarp Oral
, Ali Raza Butt
, Michael J. Brim
, Sangeetha B. Srinivasa:
I/O load balancing for big data HPC applications. 233-242 - Bo Peng, Bingjing Zhang, Langshi Chen, Mihai Avram, Robert Henschel, Craig A. Stewart, Shaojuan Zhu, Emily McCallum, Lisa Smith, Tom Zahniser, Jon Omer, Judy Qiu:
HarpLDA+: Optimizing latent dirichlet allocation for parallel efficiency. 243-252 - Jim Pivarski, Peter Elmer, Brian Bockelman
, Zhe Zhang:
Fast access to columnar, hierarchically nested data via code transformation. 253-262 - Alex Watson, Deepigha Shree Vittal Babu, Suprio Ray:
Sanzu: A data science benchmark. 263-272 - Luna Xu, Seung-Hwan Lim, Min Li, Ali Raza Butt
, Ramakrishnan Kannan:
Scaling up data-parallel analytics platforms: Linear algebraic operation cases. 273-282 - Xiaodong Yu, Kaixi Hou, Hao Wang, Wu-chun Feng:
Robotomata: A framework for approximate pattern matching of big data on an automata processor. 283-292 - Yunming Zhang, Vladimir Kiriansky, Charith Mendis, Saman P. Amarasinghe, Matei Zaharia
:
Making caches work for graph analytics. 293-302 - Bilal Akil, Ying Zhou, Uwe Röhm
:
On the usability of Hadoop MapReduce, Apache Spark & Apache flink for data science. 303-310 - Mohammed M. Alawad
, Hong-Jun Yoon, Georgia D. Tourassi:
Energy efficient stochastic-based deep spiking neural networks for sparse datasets. 311-318 - Lars Arge, Mathias Rav, Svend C. Svendsen, Jakob Truelsen:
External memory pipelining made easy with TPIE. 319-324 - Dapeng Dong
, John Herbert:
Compressed domain-specific data processing and analysis. 325-330 - Celestine Dünner, Thomas P. Parnell, Kubilay Atasu
, Manolis Sifalakis
, Haralampos Pozidis:
Understanding and optimizing the performance of distributed machine learning applications on apache spark. 331-338 - Xiao Meng, Lukasz Golab:
Optimal reducer placement to minimize data transfer in MapReduce-style processing. 339-346 - Michael Mercier, David Glesser, Yiannis Georgiou, Olivier Richard:
Big data and HPC collocation: Using HPC idle resources for Big Data analytics. 347-352 - Axel Oehmichen
, Florian Guitton
, Kai Sun, Jean Grizet, Thomas Heinis, Yike Guo:
eTRIKS analytical environment: A modular high performance framework for medical data analysis. 353-360 - Ilia Pietri
, Yannis Chronis, Yannis E. Ioannidis:
Multi-objective optimization of scheduling dataflows on heterogeneous cloud resources. 361-368 - Md. Wasi-ur-Rahman, Nusrat Sharmin Islam, Xiaoyi Lu, Dhabaleswar K. Panda:
NVMD: Non-volatile memory assisted design for accelerating MapReduce and DAG execution frameworks on HPC systems. 369-374 - Xinhui Tian, Yuanqing Guo, Jianfeng Zhan, Lei Wang:
Towards memory and computation efficient graph processing on spark. 375-382 - Alexander Ulanov, Manish Marwah, Mijung Kim, Roshan Dathathri, Carlos Zubieta, Jun Li:
Sandpiper: Scaling probabilistic inferencing to large scale graphical models. 383-388 - Nikos Zacheilas, Stathis Maroulis, Vana Kalogeraki
:
Dione: Profiling spark applications exploiting graph similarity. 389-394 - Mohammad Asghari, Cyrus Shahabi:
On on-line task assignment in spatial crowdsourcing. 395-404 - Ilir Fetai, Alexander Stiemer
, Heiko Schuldt
:
QuAD: A quorum protocol for adaptive data management in the cloud. 405-414 - Valérie Hayot-Sasson, Yongping Gao, Yuhong Yan, Tristan Glatard:
Sequential algorithms to split and merge ultra-high resolution 3D images. 415-424 - Shahab Helmi, Farnoush Banaei Kashani:
Spatiotemporal range pattern queries on large-scale co-movement pattern datasets. 425-434 - Srinivasan Venkatramanan, Sichao Wu, Bowen Shi, Achla Marathe, Madhav V. Marathe, Stephen G. Eubank
, Lalit P. Sah, A. P. Giri, Luke A. Colavito, K. S. Nitin, V. Sridhar, R. Asokan, Rangaswamy Muniappan, G. Norton, Abhijin Adiga:
Towards robust models of food flows and their role in invasive species spread. 435-444 - Juan A. Colmenares, Reza Dorrigiv, Daniel G. Waddington:
A single-node datastore for high-velocity multidimensional sensor data. 445-452 - Isabelle Comyn-Wattiau
, Jacky Akoka:
Model driven reverse engineering of NoSQL property graph databases: The case of Neo4j. 453-458 - Helge Holzmann, Vinay Goel, Emily Novak Gustainis:
Universal distant reading through metadata proxies with archivespark. 459-464 - Md. S. Q. Zulkar Nine, Kemal Guner
, Ziyun Huang
, Xiangyu Wang, Jinhui Xu, Tevfik Kosar
:
Big data transfer optimization based on offline knowledge discovery and adaptive sampling. 465-472 - Ramyar Saeedi, Skyler Norgaard, Assefaw Hadish Gebremedhin:
A closed-loop deep learning architecture for robust activity recognition using wearable sensors. 473-479 - Haiying Shen, Heng Zhou:
CStorage: An efficient classification-based image storage system in cloud datacenters. 480-485 - Dingwen Tao
, Sheng Di, Zizhong Chen
, Franck Cappello:
In-depth exploration of single-snapshot lossy compression techniques for N-body simulations. 486-493 - Xian Wu, Yuxiao Dong, Jun Tao, Chao Huang, Nitesh V. Chawla
:
Reliable fake review detection via modeling temporal and behavioral patterns. 494-499 - Masahiro Yokoyama, Takahiro Hara, Sanjay Kumar Madria:
Efficient diversified set monitoring for mobile sensor stream environments. 500-507 - Yangwen Yu, James Jian Qiao Yu
, Victor O. K. Li, Jacqueline C. K. Lam:
Low-rank singular value thresholding for recovering missing air quality data. 508-513 - Lina Yu, Michael L. Rilee, Yu Pan, Feiyu Zhu, Kwo-Sen Kuo, Hongfeng Yu:
Visual analytics with unparalleled variety scaling for big earth data. 514-521 - Ming Zeng, Tong Yu, Xiao Wang, Le T. Nguyen, Ole J. Mengshoel, Ian R. Lane:
Semi-supervised convolutional neural networks for human activity recognition. 522-529 - Xibo Zhou, Ye Ding, Fengchao Peng, Qiong Luo
, Lionel M. Ni:
Detecting unmetered taxi rides from trajectory data. 530-535 - Giambattista Amati
, Simone Angelini, Giorgio Gambosi, Gianluca Rossi
, Paola Vocca
:
Estimation of distance-based metrics for very large graphs with MinHash Signatures. 536-545 - Philipp Baumann, Dorit S. Hochbaum, Quico Spaen:
High-performance geometric algorithms for sparse computation in big data analytics. 546-555 - Sreyasee Das Bhattacharjee, Ashit Talukder
, Bala Venkatram Balantrapu:
Active learning based news veracity detection with feature weighting and deep-shallow fusion. 556-565 - Chandramani Chaudhary, Poonam Goyal
, Yi-Ping Phoebe Chen
:
Exploiting visual and textual neighborhood information to improve image-tag relevance. 566-575 - Limeng Cui, Jiawei Zhang, Zhensong Chen, Yong Shi, Philip S. Yu:
Inverse extreme learning machine for learning with label proportions. 576-585 - Vachik S. Dave
, Nesreen K. Ahmed, Mohammad Al Hasan:
E-CLoG: Counting edge-centric local graphlets. 586-595 - Bo Dong, Yifan Li, Yang Gao, Ahsanul Haque, Latifur Khan
, Mohammad M. Masud:
Multistream regression with asynchronous concept drift detection. 596-605 - Roohollah Etemadi, Jianguo Lu:
Bias correction in clustering coefficient estimation. 606-615 - Guyue Han, Harish Sethu:
Closed walk sampler: An efficient method for estimating the spectral radius of large graphs. 616-625 - Jun Hu, Yuxin Wang, Ping Li:
Online city-scale hyper-local event detection via analysis of social media and human mobility. 626-635 - Jianfeng Jia, Chen Li, Michael J. Carey:
Drum: A rhythmic approach to interactive analytics on large data. 636-645 - Ryoya Kaneko, Kohei Miyaguchi, Kenji Yamanishi
:
Detecting changes in streaming data with information-theoretic windowing. 646-655 - Foteini Katsarou, Nikos Ntarmos
, Peter Triantafillou:
Hybrid algorithms for subgraph pattern queries in graph databases. 656-665 - Sarasi Lalithsena, Sujan Perera, Pavan Kapanipathi, Amit P. Sheth:
Domain-specific hierarchical subgraph extraction: A recommendation use case. 666-675 - Panagiotis Liakos, Alexandros Ntoulas, Alex Delis:
COEUS: Community detection via seed-set expansion on graph streams. 676-685 - Panagiotis Liakos, Alexandros Ntoulas, Alex Delis:
Rhea: Adaptively sampling authoritative content from social activity streams. 686-695 - Ismini Lourentzou, Alex Morales, ChengXiang Zhai:
Text-based geolocation prediction of social media users with neural networks. 696-705 - Alessandro Lulli, Luca Oneto
, Davide Anguita
:
Crack random forest for arbitrary large datasets. 706-715 - Suchismit Mahapatra
, Varun Chandola:
S-Isomap++: Multi manifold learning from streaming data. 716-725 - Sheikh Motahar Naim, Arnold P. Boedihardjo, Mahmud Shahriar Hossain
:
A scalable model for tracking topical evolution in large document collections. 726-735 - Mehrnaz Najafi, Lifang He
, Philip S. Yu:
Error-robust multi-view clustering. 736-745 - Axel-Cyrille Ngonga Ngomo
, Michael Hoffmann, Ricardo Usbeck
, Kunal Jha:
Holistic and scalable ranking of RDF data. 746-755 - Haekyu Park, Jinhong Jung, U Kang:
A comparative study of matrix factorization and random walk with restart in recommender systems. 756-765 - Chao Shang, Aaron Palmer, Jiangwen Sun, Ko-Shin Chen, Jin Lu
, Jinbo Bi:
VIGAN: Missing view imputation with generative adversarial networks. 766-775 - Lorenzo De Stefani, Erisa Terolli, Eli Upfal:
Tiered sampling: An efficient method for approximate counting sparse motifs in massive graph streams. 776-786 - Cheng-Chin Tu, Mi-Yen Yeh
, Tei-Wei Kuo
:
A fast non-volatile memory aware algorithm for generating random scale-free networks. 787-796 - Nguyen Vo
, Kyumin Lee, Thanh Tran:
MRAttractor: Detecting communities from large-scale graphs. 797-806 - Yueyao Wang, Qinmin Hu, Yang Song, Liang He:
Potentiality of healthcare big data: Improving search by automatic query reformulation. 807-816 - Ichitaro Yamazaki, Stanimire Tomov
, Jack J. Dongarra:
Sampling algorithms to update truncated SVD. 817-826 - Yizhou Yan, Lei Cao
, Elke A. Rundensteiner:
Distributed Top-N local outlier detection in big data. 827-836 - Tong Yang, Binchao Yin, Hang Li, Muhammad Shahzad, Steve Uhlig, Bin Cm, Xiaoming Li:
Rectangular hash table: Bloom filter and bitmap assisted hash table with high speed. 837-846 - Xinli Yu, Zheng Chen, Wei-Shih Yang, Xiaohua Hu, Erjia Yan
, Guangrong Li:
Large-scale joint topic, sentiment & user preference analysis for online reviews. 847-856 - Chuxu Zhang, Lu Yu, Xiangliang Zhang
, Nitesh V. Chawla
:
ImWalkMF: Joint matrix factorization and implicit walk integrative learning for recommendation. 857-866 - Lei Zheng, Bokai Cao, Vahid Noroozi, Philip S. Yu, Nianzu Ma:
Hierarchical collaborative embedding for context-aware recommendations. 867-876 - Ebad Ahmadzadeh, Philip K. Chan
:
Mining pros and cons of actions from social media for decision support. 877-882 - Masato Asahara, Ryohei Fujimaki:
Distributed Bayesian piecewise sparse linear models. 883-888 - Kubilay Atasu
, Thomas P. Parnell, Celestine Dünner, Manolis Sifalakis
, Haralampos Pozidis, Vasileios Vasileiadis, Michail Vlachos
, Cesar Berrospi, Abdel Labbi:
Linear-complexity relaxed word Mover's distance with GPU acceleration. 889-896 - Ricardo Baeza-Yates
, Zeinab Liaghat:
Quality-efficiency trade-offs in machine learning for text processing. 897-904 - Jose Cadena, Saliya Ekanayake, Anil Vullikanti:
Fast graph scan statistics optimization using algebraic fingerprints. 905-910 - Zaineb Chelly Dagdia
, Christine Zarges
, Gaël Beck, Mustapha Lebbah:
A distributed rough set theory based algorithm for an efficient big data pre-processing under the spark framework. 911-916 - Hoang Anh Dau, Diego Furtado Silva, François Petitjean, Germain Forestier
, Anthony J. Bagnall, Eamonn J. Keogh:
Judicious setting of Dynamic Time Warping's window width allows more accurate classification of time series. 917-922 - Alexander Denzler, Michael Kaufmann:
Toward granular knowledge analytics for data intelligence: Extracting granular entity-relationship graphs for knowledge profiling. 923-928 - Ankit Desai, Sanjay Chaudhary:
Distributed decision tree v.2.0. 929-934 - Mohammad M. Ghassemi, Willow Jarvis, Tuka Alhanai
, Emery N. Brown, Roger G. Mark, M. Brandon Westover:
An open-source tool for the transcription of paper-spreadsheet data: Code and supplemental materials available online: Https: //github.com/deskool/images to spreadsheets. 935-941 - Poonam Goyal
, Jagat Sesh Challa, Shivin Shrivastava, Navneet Goyal:
AnyFI: An anytime frequent itemset mining algorithm for data streams. 942-947 - Tatsuru Kobayashi, Shin Matsushima, Taito Lee, Kenji Yamanishi
:
Discovering potential traffic risks in Japan using a supervised learning approach. 948-955 - Martin Koehler, Alex Bogatu, Cristina Civili, Nikolaos Konstantinou, Edward Abel
, Alvaro A. A. Fernandes, John A. Keane, Leonid Libkin
, Norman W. Paton
:
Data context informed data wrangling. 956-963 - Naama Kraus, David Carmel, Idit Keidar:
Fishing in the stream: Similarity search over endless data. 964-969 - Liang Ma, Guohong Cao, Lance M. Kaplan:
Graphical approach for influence maximization in social networks under generic threshold-based non-submodular model. 970-975 - Aritra Mandal, Mohammad Al Hasan:
A distributed k-core decomposition algorithm on spark. 976-981 - Mohammad Hossein Namaki, Peng Lin, Yinghui Wu:
Event pattern discovery by keywords in graph streams. 982-987 - Michael Nelson, Sridhar Radhakrishnan, Amlan Chatterjee, Chandra N. Sekharan:
Queryable compression on streaming social networks. 988-993 - Fengchao Peng, Yudian Ji, Qiong Luo
, Lionel M. Ni:
Event-based non-parametric clustering of team sport trajectories. 994-999 - Sumit Purohit, Sutanay Choudhury, Lawrence B. Holder:
Application-specific graph sampling for frequent subgraph mining and community detection. 1000-1005 - Hung Tran-The, Koji Zettsu:
Discovering co-occurrence patterns of heterogeneous events from unevenly-distributed spatiotemporal data. 1006-1011 - Takeaki Uno, Hiroki Maegawa, Takanobu Nakahara, Yukinobu Hamuro, Ryo Yoshinaka
, Makoto Tatsuta:
Micro-clustering by data polishing. 1012-1018 - Chenwei Zhang, Nan Du, Wei Fan, Yaliang Li, Chun-Ta Lu, Philip S. Yu:
Bringing semantic structures to user intent detection in online medical queries. 1019-1026 - Daniel Yue Zhang, Dong Wang, Hao Zheng, Xin Mu, Qi Li, Yang Zhang:
Large-scale point-of-interest category prediction using natural language processing models. 1027-1032 - Alexander Heifetz, Vaikkunth Mugunthan, Lalana Kagal
:
Shade: A differentially-private wrapper for enterprise big data. 1033-1042 - Balaji Palanisamy, Chao Li
, Prashant Krishnamurthy:
Group privacy-aware disclosure of association graph data. 1043-1052 - Lichao Sun
, Xiaokai Wei, Jiawei Zhang, Lifang He, Philip S. Yu, Witawas Srisa-an
:
Contaminant removal for Android malware detection systems. 1053-1062 - Xi Zhang, Yu Zeng, Xiao-Bo Jin, Zhiwei Yan, Guang-Gang Geng
:
Boosting the phishing detection performance by semantic analysis. 1063-1070 - Robert A. Bridges, Jessie D. Jamieson, Joel W. Reed:
Setting the threshold for high throughput detectors: A mathematical approach for ensembles of dynamic, heterogeneous, probabilistic anomaly detectors. 1071-1078 - Dong Chen, David E. Irwin:
Weatherman: Exposing weather-based privacy threats in big energy data. 1079-1086 - Jiuyong Li
, Jixue Liu, Lin Liu
, Thuc Duy Le, Saisai Ma, Yizhao Han:
Discrimination detection by causal effect estimation. 1087-1094 - Amit Pande, Vishal Ahuja:
WEAC: Word embeddings for anomaly classification from event logs. 1095-1100 - Shuo Wang, Richard O. Sinnott, Surya Nepal
:
Privacy-protected place of activity mining on big location data. 1101-1108 - Shuo Wang, Richard O. Sinnott, Surya Nepal
:
Sensitive gazetteer discovery and protection for mobile social media users. 1109-1116 - Tianqing Zhu, Ping Xiong, Gang Li
, Wanlei Zhou
, Philip S. Yu:
Differentially private query learning: From data publishing to model publishing. 1117-1122 - Eric Breck, Shanqing Cai, Eric Nielsen, Michael Salib, D. Sculley:
The ML test score: A rubric for ML production readiness and technical debt reduction. 1123-1132 - Meng-Fen Chiang, Ee-Peng Lim
, Wang-Chien Lee, Agus Trisnajaya Kwee:
BTCI: A new framework for identifying congestion cascades using bus trajectory data. 1133-1142 - Pankaj Goel
, Aniruddha Datta, M. Sam Mannan:
Application of big data analytics in process safety and risk management. 1143-1152 - Lei Huang, Weijia Xu, Si Liu, Venktesh Pandey
, Natalia Ruiz-Juri:
Enabling versatile analysis of large scale traffic video data with deep learning and HiveQL. 1153-1162 - Hiroshi Inoue:
Fast interpolation of grid data at a non-grid point. 1163-1172 - Xiaowei Jia, Yifan Hu, Ankush Khandelwal, Anuj Karpatne, Vipin Kumar:
Joint sparse auto-encoder: A semi-supervised spatio-temporal approach in mapping large-scale croplands. 1173-1182 - Pasan Karunaratne, Masud Moshtaghi, Shanika Karunasekera, Aaron Harwood, Trevor Cohn:
Multi-step prediction with missing smart sensor data using multi-task Gaussian processes. 1183-1192 - Abhinav Maurya, Rahul Telang:
Bayesian multi-view models for member-job matching and personalized skill recommendations. 1193-1202 - Mai H. Nguyen, Daniel Crawl, Jiaxin Li, Dylan Uys, Ilkay Altintas:
Automated scalable detection of location-specific Santa Ana conditions from weather data using unsupervised learning. 1203-1212 - Haoyu Wang, Jiaqi Gong, Yan Zhuang, Haiying Shen, John C. Lach:
HealthEdge: Task scheduling for edge computing with health emergency and human behavior consideration in smart homes. 1213-1222 - Jingyuan Zhang, Chun-Ta Lu, Bokai Cao, Yi Chang
, Philip S. Yu:
Connecting emerging relationships from news via tensor factorization. 1223-1232 - Yuan Zhang, Chen Lin, Min Chi, Julie S. Ivy, Muge Capan, Jeanne M. Huddleston:
LSTM for septic shock: Adding unreliable labels to reliable predictions. 1233-1242 - Baoxin Zhao, Chengzhong Xu
, Siyuan Liu:
A data-driven congestion diffusion model for characterizing traffic in metrocity scales. 1243-1252 - Allard J. van Altena, Perry D. Moerland, Aeilko H. Zwinderman, Sílvia D. Olabarriaga:
Analysis of the term 'big data': Usage in biomedical publications. 1253-1258 - Marzieh Bakhshandeh, Dennis M. M. Schunselaar, Henrik Leopold, Hajo A. Reijers:
Predicting treatment repetitions in the implant denture therapy process. 1259-1264 - Jian Cao, Fangzhou Yang, Yuchang Xu, Yudong Tan, Quan-Wu Xiao:
Personalized flight recommendations via paired choice modeling. 1265-1270 - Zhitang Chen, Ke He, Jian Li, Yanhui Geng:
Seq2Img: A sequence-to-image based approach towards IP traffic classification using convolutional neural networks. 1271-1276 - Chung Ming Cheung, Palash Goyal
, Viktor K. Prasanna, Arash Saber Tehrani:
OReONet: Deep convolutional network for oil reservoir optimization. 1277-1282 - Giuseppe Cuccu
, Somayeh Danafar, Philippe Cudré-Mauroux
, Martin Gassner, Stefano Bernero, Krzysztof Kryszczuk:
A data-driven approach to predict NOx-emissions of gas turbines. 1283-1288 - Angelo Furno, Nour-Eddin El Faouzi, Rajesh Sharma, Eugenio Zimeo:
Two-level clustering fast betweenness centrality computation for requirement-driven approximation. 1289-1294 - Xueying Guo, George Trimponias, Xiaoxiao Wang, Zhitang Chen, Yanhui Geng, Xin Liu:
Cellular network configuration via online learning and joint optimization. 1295-1300 - Jiankun Huang, Wenjun Wu:
T-BMIRT: Estimating representations of student knowledge and educational components in online education. 1301-1306 - Xinjiang Lu, Zhiwen Yu
, Chuanren Liu
, Yanchi Liu, Hui Xiong, Bin Guo
:
Forecasting the rise and fall of volatile point-of-interests. 1307-1312 - Stanislav Sobolevsky, Emanuele Massaro, Iva Bojic, Juan Murillo Arias, Carlo Ratti:
Predicting regional economic indices using big data of individual bank card transactions. 1313-1318 - Chuishi Meng, Yu Cui, Qing He, Lu Su, Jing Gao:
Travel purpose inference with GPS trajectories, POIs, and geo-tagged social media data. 1319-1324 - Jennifer Sleeman, Milton Halem, Tim Finin, Mark Cane:
Discovering scientific influence using cross-domain dynamic topic modeling. 1325-1332 - Mohiuddin Solaimani, Sayeed Salam, Latifur Khan
, Patrick T. Brandt, Vito D'Orazio:
RePAIR: Recommend political actors in real-time from news websites. 1333-1340 - Xing Su, Yuan Yao, Qing He, Jie Lu, Hanghang Tong
:
Personalized travel mode detection with smartphone sensors. 1341-1348 - Ashish Tapdiya, Daniel Fabbri:
A comparative analysis of state-of-the-art SQL-on-Hadoop systems for interactive analytics. 1349-1356 - Tingyang Xu
, Tan Yan, Dongjin Song, Wei Cheng, Haifeng Chen, Geoff Jiang, Jinbo Bi:
Identifying and quantifying nonlinear structured relationships in complex manufactural systems. 1357-1362 - Yuchang Xu, Jian Cao:
OTPS: A decision support service for optimal airfare Ticket Purchase. 1363-1368 - Hu Xu, Sihong Xie, Lei Shu, Philip S. Yu:
Product function need recognition via semi-supervised attention network. 1369-1374 - Wenbo Zhang, Dheeraj Kumar, Satish V. Ukkusuri:
Exploring the dynamics of surge pricing in mobility-on-demand taxi services. 1375-1380 - Yihua Shi Astle, Xuning Tang, Craig Freeman:
Application of dynamic logistic regression with unscented Kalman filter in predictive coding. 1381-1389 - Mansurul Alam Bhuiyan, Mohammad Al Hasan:
RAVEN: Web-based smart home exploration system through interactive pattern discovery. 1390-1399 - Simon Bin, Patrick Westphal, Jens Lehmann
, Axel Ngonga
:
Implementing scalable structured machine learning for big data in the SAKE project. 1400-1407 - Zheng Chen, Xinli Yu, Chi Zhang, Jin Zhang, Cui Lin, Bo Song, Jianliang Gao, Xiaohua Hu, Wei-Shih Yang, Erjia Yan
:
Fast botnet detection from streaming logs using online lanczos method. 1408-1417 - Yuheng Du, Alexander Herzog, André Luckow, Ramu Nerella, Christopher Gropp, Amy W. Apon:
Representativeness of latent dirichlet allocation topics estimated from data samples with application to common crawl. 1418-1427 - Rishi Chhatwal, Nathaniel Huber-Fliflet, Robert Keeling, Jianping Zhang, Haozhen Zhao:
Empirical evaluations of active learning strategies in legal document review. 1428-1437 - T. F. Kennedy, Robert S. Provence, James L. Broyan, Patrick W. Fink, Phong H. Ngo, Lazaro D. Rodriguez:
Topic models for RFID data modeling and localization. 1438-1446 - Ishita K. Khan, Prathyusha Senthil Kumar, Daniel Miranda, David Goldberg:
What is skipped: Finding desirable items in e-commerce search by discovering the worst title tokens. 1447-1456 - Youngho Kim, Petros Zerfos, Vadim Sheinin, Nancy Greco:
Ranking the importance of ontology concepts using document summarization techniques. 1457-1466 - Lay Wai Kong:
Performance optimization in scale-out storage using design of experiment as heuristic. 1467-1474 - Hyunjong Lee, Youngin Jo, Sanghyuk Chun, Kwangseob Kim:
A study on intelligent personalized push notification with user history. 1475-1482 - Xiaomo Liu, Armineh Nourbakhsh, Quanzhi Li, Sameena Shah, Robert Martin, John Duprey:
Reuters tracer: Toward automated news production using large scale social media data. 1483-1493 - Justin McHugh, Paul E. Cuddihy, Jenny Weisenberg Williams, Kareem S. Aggour
, Vijay S. Kumar, Varish Mulwad:
Integrated access to big data polystores through a knowledge-driven framework. 1494-1503 - Jacob Montiel
, Albert Bifet
, Talel Abdessalem:
Predicting over-indebtedness on batch and streaming data. 1504-1513 - Ye Ouyang
, Zhongyuan Li, Le Su, Wenyuan Lu, Zhenyi Lin:
APP-SON: Application characteristics-driven SON to optimize 4G/5G network performance and quality of experience. 1514-1523 - Karthikeyan Natesan Ramamurthy, Dennis Wei, Emily Ray, Moninder Singh, Vijay S. Iyengar, Dmitriy A. Katz-Rogozhnikov, Jingwei Yang, Kevin N. Tran, Gigi Y. Yuen-Reed:
A configurable, big data system for on-demand healthcare cost prediction. 1524-1533 - Syed Yousaf Shah, Zengwen Yuan, Songwu Lu, Petros Zerfos:
Dependency analysis of cloud applications for performance monitoring using recurrent neural networks. 1534-1543 - Walid Shalaby, BahaaEddin AlAila, Mohammed Korayem, Layla Pournajaf, Khalifeh AlJadda, Shannon Quinn, Wlodek Zadrozny:
Help me find a job: A graph-based approach for job recommendation at scale. 1544-1553 - Derrick C. Spell, Xiao-Han T. Zeng, Jae Young Chung, Bahador Nooraei, Richard T. Shomer, Ling-Yong Wang, James C. Gibson, Daniel Kirsche:
Flux: Groupon's automated, scalable, extensible machine learning platform. 1554-1559 - Nenad Stojanovic, Marko Dinic, Ljiljana Stojanovic:
A data-driven approach for multivariate contextualized anomaly detection: Industry use case. 1560-1569 - Dharmashankar Subramanian, Debarun Bhattacharjya, Ruben Rodriguez Torrado, Jeffrey O. Kephart, Vijil Chenthamarakshan, Jesus Rios:
A cognitive assistant for risk identification and modeling. 1570-1579 - Warut D. Vijitbenjaronk, Jinho Lee
, Toyotaro Suzumura, Gabriel Tanase:
Scalable time-versioning support for property graph databases. 1580-1589 - Xuchao Zhang, Liang Zhao, Zhiqian Chen, Arnold P. Boedihardjo, Jing Dai, Chang-Tien Lu
:
Trendi: Tracking stories in news and microblogs via emerging, evolving and fading topics. 1590-1599 - Zhiwei Zhang, Ning Chen, Jun Wang, Luo Si:
SMART: Sponsored mobile app recommendation by balancing app downloads and appstore profit. 1600-1609 - Wen-Yuan Zhu, Wen-Yueh Shih, Ying-Hsuan Lee, Wen-Chih Peng, Jiun-Long Huang:
A gamma-based regression for winning price estimation in real-time bidding advertising. 1610-1619 - Nirupama Appiktala, Miao Chen, Michael Natkovich, Joshua J. Walters:
Demystifying dark matter for online experimentation. 1620-1626 - Neela Avudaiappan, Alexander Herzog, Sneha Kadam, Yuheng Du, Jason Thatcher, Ilya Safro:
Detecting and summarizing emergent events in microblogs and social media streams by dynamic centralities. 1627-1634 - Russell Chen, Miao Chen, Mahendrasinh Ramsinh Jadav, Joonsuk Bae, Don Matheson:
Faster online experimentation by eliminating traditional A/A validation. 1635-1641 - Ferosh Jacob, Ilamgumaran Karunanithi, Pramod Salian, Ravi Sambhu:
BBC: A DSL for designing cloud-based heterogeneous bigdata pipelines. 1642-1645 - George Mathew:
Architectural considerations for highly scalable computing to support on-demand video analytics. 1646-1649 - Leonardo Maria Millefiori, Paolo Braca, Gianfranco Arcieri:
Scalable distributed change detection and its application to maritime traffic. 1650-1657 - Ankita R. Nambiar, Nikitha Reddy, Debojyoti Dutta:
Connected health: Opportunities and challenges. 1658-1662 - Emmanuel Oyekanlu
:
Predictive edge computing for time series of industrial IoT and large scale critical infrastructure based on open-source software analytic of big data. 1663-1669 - Kevin B. Pratt:
Linking many unusual co-incidences. 1670-1675 - Martin Ringsquandl, Evgeny Kharlamov, Daria Stepanova, Steffen Lamparter, Raffaello Lepratti, Ian Horrocks, Peer Kröger:
On event-driven knowledge graph completion in digital factories. 1676-1681 - Giannis Spiliopoulos
, Konstantinos Chatzikokolakis, Dimitrios Zissis
, Evmorfia Biliri
, Dimitrios Papaspyros, Giannis Tsapelas, Spyros Mouzakitis
:
Knowledge extraction from maritime spatiotemporal data: An evaluation of clustering algorithms on Big Data. 1682-1687 - Xuchao Zhang, Zhiqian Chen, Liang Zhao, Arnold P. Boedihardjo, Chang-Tien Lu
:
TRACES: Generating Twitter stories via shared subspace and temporal smoothness. 1688-1693 - Christine Balili, Aviv Segev, Uichin Lee:
Tracking and predicting the evolution of research topics in scientific literature. 1694-1697 - Gong Cheng, Evgeny Kharlamov:
Towards a semantic keyword search over industrial knowledge graphs (extended abstract). 1698-1700 - Ajay Dholakia, Prasad Venkatachar, Kshitij A. Doshi, Ravikanth Durgavajhala, Stewart Tate, Berni Schiefer, Matthew Sheard, Ramnath Sai Sagar:
Designing a high performance cluster for large-scale SQL-on-hadoop analytics. 1701-1703 - Maurizio Montagnuolo, Alberto Messina, Nicolo Bidotti, Paolo Platter, Alessio Bosca:
Real time semantic enrichment of broadcast content in the big data age. 1704-1708 - Yiran Zhao, Shuochao Yao, Shaohan Hu, Shiyu Chang, Raghu K. Ganti, Mudhakar Srivatsa, Shen Li, Tarek F. Abdelzaher:
On the improvement of classifying EEG recordings using neural networks. 1709-1711 - Zhou Fa, Guang-Gang Geng
, Zhiwei Yan, Xiao-Dong Lee:
A robust internet abuse detection method. 1712-1715 - Alexander Brodsky, Mohan Krishnamoorthy, M. Omar Nachawati, William Z. Bernstein, Daniel A. Menascé:
Manufacturing and contract service networks: Composition, optimization and tradeoff analysis based on a reusable repository of performance models. 1716-1725 - Max Ferguson, Ronay Ak, Yung-Tsun Tina Lee, Kincho H. Law:
Automatic localization of casting defects with convolutional neural networks. 1726-1735 - Yunpeng Li, Heng Zhang, Utpal Roy, Y. Tina Lee:
A data-driven approach for improving sustainability assessment in advanced manufacturing. 1736-1745 - Don Libes, David Lechevalier, Sanjay Jain:
Issues in synthetic data generation for advanced manufacturing. 1746-1754 - Srinivasan Radhakrishnan, Yung-Tsun Tina Lee, Sagar V. Kamarthi:
Estimation of online tool wear in turning processes using recurrence quantification analysis (RQA). 1755-1759 - Heather M. Reed, Richard P. Vinci
, Corbin Robeck, Trevor Verdonik, Michael Pires, Maria Castro, Wojciech Z. Misiolek, Christina Viau Haden:
Statistically-substantiated density characterizations of additively manufactured steel alloys through verification, validation, and uncertainty quantification. 1760-1768 - Thurston Sexton, Michael P. Brundage, Michael Hoffman, K. C. Morris:
Hybrid datafication of maintenance logs from AI-assisted human tags. 1769-1777 - Akinori Abe, Yuki Hayashi:
Data treatment from the viewpoint of granular computing. 1778-1785 - Fatemeh Cheraghchi, Ibrahim Y. Abualhaol, Rafael Falcon, Rami S. Abielmona, Bijan Raahemi, Emil M. Petriu:
Big-data-enabled modelling and optimization of granular speed-based vessel schedule recovery problem. 1786-1794 - Lihao Ge, Teng-Sheng Moh:
Improving text classification with word embedding. 1796-1805 - Marek Grzegorowski
, Andrzej Janusz
, Dominik Slezak
, Marcin S. Szczuka:
On the role of feature space granulation in feature selection processes. 1806-1815 - Tzung-Pei Hong
, Lu-Hung Chen, Shyue-Liang Wang, Chun-Wei Lin
, Bay Vo:
Quasi-erasable itemset mining. 1816-1820 - Tsau Young Lin, Pierre Vachon:
Secure information flow and file movements: A topological theory of discretionary access controls. 1821-1829 - Ahmad M. Mustafa
, Gbadebo Ayoade, Khaled Al-Naami, Latifur Khan
, Kevin W. Hamlen, Bhavani Thuraisingham, Frederico Araujo:
Unsupervised deep embedding for novel class detection over data stream. 1830-1839 - Dominik Slezak
, Agnieszka Chadzynska-Krasowska
, Joel Holland, Piotr Synak, Rick Glick, Marcin Perkowski:
Scalable cyber-security analytics with a new summary-based approximate query engine. 1840-1849 - Shusaku Tsumoto, Tomohiro Kimura, Haruko Iwata, Shoji Hirano:
Mining text for disease diagnosis in hospital information system. 1850-1859 - Shuyin Xia, Guoyin Wang
, Yunsheng Liu, Qun Liu, Hong Yu:
Noise self-filtering K-nearest neighbors algorithms. 1860-1965 - Josh Jia-Ching Ying, Po-Yu Huang, Chih-Kai Chang, Don-Lin Yang:
A preliminary study on deep learning for predicting social insurance payment behavior. 1866-1875 - Hayri Volkan Agun, Sibel Yilmazel, Özgür Yilmazel:
Effects of language processing in Turkish authorship attribution. 1876-1881 - Nora Alkhamees, Maria Fasli:
Event detection from time-series streams using directional change and dynamic thresholds. 1882-1891 - Yusuf Arslan, Aysenur Birturk
, Bekjan Djumabaev, Dilek Küçük
:
Real-time Lexicon-based sentiment analysis experiments on Twitter with a mild (more information, less data) approach. 1892-1897 - Inci Batmaz, Pinar Karagoz
, Gulsah Serdar:
A comparative study on learning to rank with computational methods. 1898-1906 - Belainine Billal, Alexsandro Fonseca, Fatiha Sadat, Hakim Lounis:
Semi-supervised learning and social media text analysis towards multi-labeling categorization. 1907-1916 - Tugce Dongel, Yasemin Timar:
B3SafirBiyo: Genomic variant analysis with big data technologies. 1917-1925 - Vasco Furtado, Elizabeth Furtado
, Carlos Caminha
, André Lopes, Victor Dantas, Caio Ponte, Sofia Cavalcante:
A data-driven approach to help understanding the preferences of public transport users. 1926-1935 - Lovedeep Gondara, Ke Wang:
Recovering loss to followup information using denoising autoencoders. 1936-1945 - Muhittin Isik, Hasan Dag
:
A recommender model based on trust value and time decay: Improve the quality of product rating score in E-commerce platforms. 1946-1955 - Maryam Bahojb Imani, Swarup Chandra, Samuel Ma, Latifur Khan
, Bhavani Thuraisingham:
Focus location extraction from political news reports with bias correction. 1956-1964 - Kishlay Jha
, Guangxu Xun
, Vishrawas Gopalakrishnan, Aidong Zhang:
Augmenting word embeddings through external knowledge-base for biomedical application. 1965-1974 - Shady S. Refaat, Amira Mohamed, Haitham Abu-Rub:
Big data impact on stability and reliability improvement of smart grid. 1975-1982 - Ibrahim Kok
, Mehmet Ulvi Simsek, Suat Özdemir
:
A deep learning model for air quality prediction in smart cities. 1983-1990 - Giannis V. Koumoutsos, Maria Fasli, Ian Lewin, David Milward:
Graph-based information exploration over structured and unstructured data. 1991-2000 - Paula Lauren, Guangzhi Qu, Paul Watta:
Convolutional neural network for clinical narrative categorization. 2001-2008 - Kwan Hui Lim, Shanika Karunasekera, Aaron Harwood:
ClusTop: A clustering-based topic modelling algorithm for twitter using word networks. 2009-2018 - Long Hoang Nguyen, Andrew Salopek, Liang Zhao, Fang Jin
:
A natural language normalization approach to enhance social media text reasoning. 2019-2026 - Mustafa V. Nural, Hao Peng, John A. Miller
:
Using meta-learning for model type selection in predictive big data analytics. 2027-2036 - Aras Can Onal, Omer Berat Sezer, A. Murat Ozbayoglu
, Erdogan Dogdu
:
Weather data analysis and sensor fault detection using an extended IoT framework with semantics, big data, and machine learning. 2037-2046 - Yiming Pan, Xuefeng Peng, Tianran Hu, Jiebo Luo
:
Understanding what affects career progression using linkedin and twitter data. 2047-2055 - Thomas Papastergiou
, Vasileios Megalooikonomou:
A distributed proximal gradient descent method for tensor completion. 2056-2065 - Xuefeng Peng, Yiming Pan, Jiebo Luo
:
Predicting high taxi demand regions using social media check-ins. 2066-2075 - Xuefeng Peng, Jiebo Luo
, Catherine Glenn
, Li-Kai Chi, Jingyao Zhan:
Sleep-deprived fatigue pattern analysis using large-scale selfies from social media. 2076-2084 - Harun Pirim:
Mathematical programming for social network analysis. 2085-2088 - Ali Sekmen, Ahmet Bugra Koku
, Mustafa Parlaktuna, Ayad Abdul-Malek, Nagendrababu Vanamala:
Unsupervised deep learning for subspace clustering. 2089-2094 - Ali Sekmen, Akram Aldroubi
, Ahmet Bugra Koku
, Keaton Hamm:
Principal coordinate clustering. 2095-2101 - Gokberk Serin
, M. Ugur Gudelek
, A. Murat Ozbayoglu
, Hakki Özgür Ünver
:
Estimation of parameters for the free-form machining with deep neural network. 2102-2111 - M. Omair Shafiq, Eric Torunski:
Towards MapReduce based Bayesian deep learning network for monitoring big data applications. 2112-2121 - Walid Shalaby, Wlodek Zadrozny:
Mined semantic analysis: A new concept space model for semantic representation of textual data. 2122-2131 - Adisak Sukul, Baskar Gopalakrishnan, Wallapak Tavanapong, David A. M. Peterson:
Online video ad measurement for political science research. 2132-2140 - Fangzhou Sun
, Abhishek Dubey
, Jules White:
DxNAT - Deep neural networks for explaining non-recurring traffic congestion. 2141-2150 - Imtiaz Ullah
, Qusay H. Mahmoud:
A filter-based feature selection model for anomaly-based intrusion detection systems. 2151-2159 - Imtiaz Ullah
, Qusay H. Mahmoud:
A hybrid model for anomaly-based intrusion detection in SCADA networks. 2160-2167 - Daniel Xie, Jiejun Xu, Tsai-Ching Lu:
What's trending tomorrow, today: Using early adopters to discover popular posts on Tumblr. 2168-2176 - Zhou Yang, Long Hoang Nguyen, Joshua Stuve, Guofeng Cao, Fang Jin
:
Harvey flooding rescue in social media. 2177-2185 - Ozlem Yavanoglu, Murat Aydos
:
A review on cyber security datasets for machine learning algorithms. 2186-2193 - Jianbo Yuan, Han Guo, Zhiwei Jin, Hongxia Jin, Xianchao Zhang, Jiebo Luo
:
One-shot learning for fine-grained relation extraction via convolutional siamese neural network. 2194-2199 - Semih Yumusak
, Riza Emre Aras, Elif Uysal
, Erdogan Dogdu
, Halife Kodaz
, Kasim Oztoprak:
SpEnD portal: Linked data discovery using SPARQL endpoints. 2200-2202 - Philipp Zehnder, Dominik Riemer:
Modeling self-service machine-learning agents for distributed stream processing. 2203-2212 - Bethany G. Anderson
, Christopher J. Prom
, Kevin Hamilton, James A. Hutchinson, Mark Sammons, Alex Dolski:
The cybernetics thought collective project: Using computational methods to reveal intellectual context in archival material. 2213-2218 - Tobias Blanke
, Jon Wilson:
Identifying epochs in text archives. 2219-2224 - Mike Bryant
:
GraphQL for archival metadata: An overview of the EHRI GraphQL API. 2225-2230 - Pascal Dugenie, Nuno Freire, Daan Broeder:
Building new knowledge from distributed scientific corpus: HERBADROP & EUROPEANA: Two concrete case studies for exploring big archival data. 2231-2239 - Todd Richard Goodall, Maria Esteva
, Sandra Sweat, Alan C. Bovik:
Towards automated quality curation of video collections from a realistic perspective. 2240-2245 - Nicola Horsley
:
What can a knowledge complexity approach reveal about big data and archival practice? 2246-2250 - Tim Hutchinson:
Protecting privacy in the archives: Preliminary explorations of topic modeling for born-digital collections. 2251-2255 - Benjamin Charles Germain Lee:
Line detection in binary document scans: A case study with the international tracing service archives. 2256-2261 - Myeong Lee
, Yuheng Zhang, Shiyun Chen, Edel Spencer, Jhon Dela Cruz, Hyeonggi Hong, Richard Marciano:
Heuristics for assessing Computational Archival Science (CAS) research: The case of the human face of big data project. 2262-2270 - Victoria L. Lemieux:
A typology of blockchain recordkeeping solutions and some reflections on their implications for the future of archival preservation. 2271-2278 - Ji-Ping Lin:
An infrastructure and application of computational archival science to enrich and integrate big digital archival data: Using Taiwan Indigenous Peoples Open Research Data (TIPD) as an example. 2279-2287 - Nathaniel Payne, Jason R. Baron:
Auto-categorization methods for digital archives. 2288-2298 - T. D. Smith:
The blockchain litmus test. 2299-2308 - William Underwood, Richard Marciano
, Sandra Laib, Carl Apgar, Luis Beteta, Waleed Falak, Marisa Gilman, Riss Hardcastle, Keona Holden, Yun Huang, David Baasch, Brittni Ballard, Tricia Glaser, Adam Gray, Leigh Plummer
, Zeynep Diker, Mayanka Jha, Aakanksha Singh, Namrata Walanj:
Computational curation of a digitized record series of WWII Japanese-American Internment. 2309-2313 - Darlan Arruda
, Nazim H. Madhavji:
Towards a requirements engineering artefact model in the context of big data software development projects: Research in progress. 2314-2319 - David K. Becker:
Predicting outcomes for big data projects: Big Data Project Dynamics (BDPD): Research in progress. 2320-2330 - Nancy W. Grady, Jason A. Payne, Huntley Parker:
Agile big data analytics: AnalyticsOps for data science. 2331-2339 - Mike Lakoju, Alan Serrano:
Saving costs with a big data strategy framework. 2340-2347 - Jeffrey S. Saltz
, Ivan Shamshurin:
Does pair programming work in a data science context? An initial case study. 2348-2354 - Jeffrey S. Saltz
, Nancy W. Grady:
The ambiguity of data science team roles and the need for a data science workforce framework. 2355-2361 - Toshiyuki Shimono:
Make accumulated data in companies eloquent by SQL statement constructors. 2362-2369 - Shaaban Abbady, Cheng-Yuan Ke, Jennifer Lavergne, Jian Chen
, Vijay V. Raghavan, Ryan Benton
:
Online mining for association rules and collective anomalies in data streams. 2370-2379 - Junzhi Gong, Tong Yang, Yang Zhou, Dongsheng Yang, Shigang Chen, Bin Cui
, Xiaoming Li:
ABC: A practicable sketch framework for non-uniform multisets. 2380-2389 - Vibhuti Gupta
, Rattikorn Hewett:
Harnessing the power of hashtags in tweet analytics. 2390-2395 - Ayae Ichinose, Atsuko Takefusa, Hidemoto Nakada
, Masato Oguchi:
A study of a video analysis framework using Kafka and spark streaming. 2396-2401 - Ovidiu-Cristian Marcu
, Alexandru Costan
, Gabriel Antoniu, María S. Pérez-Hernández, Radu Tudoran, Stefano Bortoli, Bogdan Nicolae
:
Towards a unified storage and ingestion architecture for stream processing. 2402-2407 - Salman Ahmed Shaikh
, Hiroyuki Kitagawa
:
Smart distributed query execution over data streams. 2408-2413 - Georgios Touloupas, Ioannis Konstantinou
, Nectarios Koziris:
RASP: Real-time network analytics with distributed NoSQL stream processing. 2414-2419 - Qian Zhao, Christian Klaue, Chih Lai:
Predicting concept drift via dynamic Naïve Bayes. 2420-2425 - Hadeel Alghamdi, Farhana H. Zulkernine, Patrick Martin:
Leveraging distributed big data storage support in CLAaaS for WINGS workflow management system. 2426-2432 - Hanieh Alipour, Yan Liu:
Online machine learning for cloud resource provisioning of microservice backend systems. 2433-2441 - Chin-Jung Hsu, Vincent W. Freeh, Flavio Villanustre
:
Trilogy: Data placement to improve performance and robustness of cloud computing. 2442-2451 - Bipin Karunakaran, Debdipto Misra, Kyle Marshall, Dhruv Mathrawala, Shravan Kethireddy:
Closing the loop - Finding lung cancer patients using NLP. 2452-2461 - Meike Klettke, Hannes Awolin, Uta Störl
, Daniel Müller, Stefanie Scherzinger:
Uncovering the evolution history of data lakes. 2462-2471 - Joichiro Kon, Naoki Mizusawa, Ayaka Umezawa, Saneyasu Yamaguchi, Jian Tao:
Highly consolidated servers with container-based virtualization. 2472-2479 - Leandro Ordoñez-Ante, Thomas Vanhove, Gregory van Seghbroeck
, Tim Wauters, Bruno Volckaert, Filip De Turck:
Dynamic data transformation for low latency querying in big data systems. 2480-2489 - Marco Vogt
, Alexander Stiemer
, Heiko Schuldt
:
Icarus: Towards a multistore database system. 2490-2499 - Chenxiao Wang, Jason Arenson, Florian Helff, Le Gruenwald, Laurent d'Orazio:
Improving user interaction in mobile-cloud database query processing. 2500-2507 - Kaihui Zhang, Yusuke Tanimura, Hidemoto Nakada
, Hirotaka Ogawa:
Understanding and improving disk-based intermediate data caching in Spark. 2508-2517 - Azim Ahmadzadeh
, Dustin J. Kempton
, Michael A. Schuh, Rafal A. Angryk
:
Improving the functionality of tamura directionality on solar images. 2518-2526 - Sunitha Basodi, Berkay Aydin
, Rafal A. Angryk
:
Parallel computation of magnetic field parameters from HMI active region patches. 2527-2532 - Soukaina Filali Boubrahimi, Berkay Aydin
, Petrus C. Martens, Rafal A. Angryk
:
On the prediction of >100 MeV solar energetic particle events using GOES satellite data. 2533-2542 - Shah Muhammad Hamdi
, Dustin Kempton
, Ruizhe Ma, Soukaina Filali Boubrahimi, Rafal A. Angryk
:
A time series classification-based approach for solar flare prediction. 2543-2551 - Ahmet Küçük, Berkay Aydin
, Rafal A. Angryk
:
Multi-wavelength solar event detection using faster R-CNN. 2552-2558 - Hasan Kurban
, Can Kockan, Mark Jenne, Mehmet M. Dalkilic:
Improving expectation maximization algorithm over stellar data. 2559-2568 - Ruizhe Ma, Soukaina Filali Boubrahimi, Shah Muhammad Hamdi
, Rafal A. Angryk
:
Solar flare prediction using multivariate time series decision trees. 2569-2578 - Simon Marcin, André Csillaghy
:
Accelerating scientific algorithms in array databases with GPUs. 2579-2587 - Adrienne Colborne, Michael Smit
:
Identifying and mitigating risks to the quality of open data in the post-truth era. 2588-2594 - Matthew L. Dering, Conrad S. Tucker:
Generative adversarial networks for increasing the veracity of big data. 2595-2602 - Junhua Ding, XinChuan Li, Venkat N. Gudivada:
Augmentation and evaluation of training data for deep learning. 2603-2611 - Kim Hee:
Is data quality enough for a clinical decision?: Apply machine learning and avoid bias. 2612-2619 - Alina Lazar, Ling Jin, C. Anna Spurlock, Kesheng Wu
, Alex Sim
:
Data quality challenges with missing values and mixed types in joint sequence analysis. 2620-2627 - Daniel Muller, Yiea-Funk Te, Pratiksha Jain:
Improving data quality through high precision gender categorization. 2628-2636 - Tim Marple, Bruce A. Desmarais, Kevin L. Young:
Collapsing corporate confusion: Leveraging network structures for effective entity resolution in relational corporate data. 2637-2643 - Shahab Tayeb, Matin Pirouz, Brittany Cozzens, Richard Huang, Maxwell Jay, Kyle Khembunjong, Sahan Paliskara, Felix Zhan, Mark Zhang, Justin Zhan, Shahram Latifi:
Toward data quality analytics in signature verification using a convolutional neural network. 2644-2651 - Yongle Chen, Hui Li, Kejiao Li, Jiyang Zhang:
An improved P2P file system scheme based on IPFS and Blockchain. 2652-2657 - Hui Li, Jiawei Hu, Huajun Ma, Ting Huang:
The architecture of distributed storage system under mimic defense theory. 2658-2663 - Haopeng Li, Hui Li:
A scheduling strategy based on multi-queues of Cassandra. 2664-2669 - Zhili Lin, Kedan Li, Hanxu Hou, Xin Yang, Hui Li:
MDFS: A mimic defense theory based architecture for distributed file system. 2670-2675 - Jiyang Zhang, Hanxu Hou, Kedan Li, Hui Li:
On the implementation of BRS codes in Ceph. 2676-2681 - Mahsa Badami, Olfa Nasraoui
, Wenlong Sun
, Patrick Shafto:
Detecting polarization in ratings: An automated pipeline and a preliminary quantification on several benchmark data sets. 2682-2690 - Stephen Bonner, John Brennan
, Ibad Kureshi, Georgios Theodoropoulos
, Andrew Stephen McGough, Boguslaw Obara:
Evaluating the quality of graph embeddings via topological feature reconstruction. 2691-2700 - Wei-Lun Chang:
Using sentiment analysis to explore the degree of risk in sharing economy. 2701-2709 - Hsin-Yu Chen, Cheng-Te Li:
PSEISMIC: A personalized self-exciting point process model for predicting tweet popularity. 2710-2713 - Anahita Davoudi, Mainak Chatterjee:
Detection of profile injection attacks in social recommender systems using outlier analysis. 2714-2719 - Benjamin Flesch, Ravi Vatrapu
, Raghava Rao Mukkamala
:
A big social media data study of the 2017 german federal election based on social set analysis of political party Facebook pages with SoSeVi. 2720-2729 - K. M. George:
Using an asset price bubble model in tweet analytics. 2730-2739 - Takako Hashimoto, Hiroshi Okamoto
, Tetsuji Kuboyama, Kilho Shin:
Topic life cycle extraction from big Twitter data based on community detection in bipartite networks. 2740-2745 - Hsiao-Wei Hu, Ching-Han Cheng, Yun-Chu Chung, Chia-Yu Lee:
Ticket-purchase behavior under the effects of marketing campaigns on facebook fan pages. 2746-2751 - Dijana Kosmajac, Vlado Keselj
:
Language identification in multilingual, short and noisy texts using common N-grams. 2752-2759 - Thomas-Joseph Loiseau, Sonia Djebali, Thomas Raimbault, Bérengère Branchet, Gaël Chareyron
:
Characterization of daily tourism behaviors based on place sequence analysis from photo sharing websites. 2760-2765 - Gang Wu, Viswanathan Swaminathan, Saayan Mitra, Ratnesh Kumar:
Digital content recommendation system using implicit feedback data. 2766-2771 - Nadiya Straton
, Raghava Rao Mukkamala
, Ravi Vatrapu
:
Big social data analytics for public health: Comparative methods study and performance indicators of health care content on Facebook. 2772-2777 - Tianqi Xia, Xuan Song, Dou Huang, Satoshi Miyazawa
, Zipei Fan
, Renhe Jiang
, Ryosuke Shibasaki:
Outbound behavior analysis through social network data: A case study of Chinese people in Japan. 2778-2786 - Tariq Abughofa, Farhana H. Zulkernine:
Towards online graph processing with spark streaming. 2787-2794 - Maaike de Boer, Barry Nouwt, Michael van Bekkum:
SUDS: System for uncertainty decision support. 2795-2803 - Giuseppe Bruno, Demetrio Condello, Alberto Falzone, Andrea Luciani:
Big data processing: Is there a framework suitable for economists and statisticians? 2804-2811 - Keren Ouaknine, Michael J. Carey:
A performance study of AsterixDB. 2812-2820 - Sheriffo Ceesay, Adam Barker, Blesson Varghese:
Plug and play bench: Simplifying big data benchmarking using containers. 2821-2828 - Wanghu Chen, Xintian Li, Jing Li, Jianwu Wang:
Enhancing the MapReduce training of BP neural networks based on local weight matrix evolution. 2829-2835 - Wei-Chun Chung, Jan-Ming Ho, Chung-Yen Lin
, D. T. Lee:
CloudEC: A MapReduce-based algorithm for correcting errors in next-generation sequencing big data. 2836-2842 - Rustem Dautov, Salvatore Distefano:
Quantifying volume, velocity, and variety to support (Big) data-intensive application development. 2843-2852 - Janakiram Dharanipragada, Srikant Padala, Balaji Kammili, Vikram Kumar:
Tula: A disk latency aware balancing and block placement strategy for Hadoop. 2853-2858 - Sina Gholamian, Wojciech M. Golab, Paul A. S. Ward:
Efficient incremental data analytics with apache spark. 2859-2868 - Pei Guo, Jianwu Wang, Zhiyuan Chen:
A comparison of big data application programming approaches: A travel companion case study. 2869-2878 - Andrew Halterman, Jill Irvine, Manar Landis, Phanindra Jalla, Yan Liang, Christan Grant
, Mohiuddin Solaimani:
Adaptive scalable pipelines for political event data generation. 2879-2883 - Chengzhi Lu, Kejiang Ye, Guoyao Xu, Cheng-Zhong Xu, Tongxin Bai:
Imbalance in the cloud: An analysis on Alibaba cluster trace. 2884-2892 - Piotr Luszczek, Jakub Kurzak, Ichitaro Yamazaki, David J. Keffer
, Jack J. Dongarra:
Scaling point set registration in 3D across thread counts on multicore and hardware accelerator platforms through autotuning for large scale analysis of scientific point clouds. 2893-2902 - Yuri Nishikawa
, Hitoshi Sato, Jun Ozawa:
Performance evaluation of multiple sports player tracking system based on graph optimization. 2903-2910 - Pouria Pirzadeh, Michael J. Carey, Till Westmann:
A performance study of big data analytics platforms. 2911-2920 - Vincent Reniers, Dimitri Van Landuyt
, Ansar Rafique, Wouter Joosen:
Schema design support for semi-structured data: Finding the sweet spot between NF and De-NF. 2921-2930 - Shanshan Huang, Jungang Xu, Renfeng Liu, Husheng Liao:
A novel compression algorithm decision method for spark shuffle process. 2931-2940 - Lili Xu, Edin Muharemagic, Amy W. Apon:
ECL-watch: A big data application performance tuning tool in the HPCC systems platform. 2941-2950 - Huayi Fang, Baijian Yang
, Tonglin Zhang:
Finding the best box-cox transformation from massive datasets on spark. 2951-2960 - Elisa Bertino, Geeth de Mel, Alessandra Russo
, Seraphin B. Calo, Dinesh C. Verma:
Community-based self generation of policies and processes for assets: Concepts and research directions. 2961-2969 - Seraphin B. Calo, Emil Lupu, Elisa Bertino, Saritha Arunkumar, Gregory H. Cirincione, Brian Rivera, Alan Cullen:
Research challenges in dynamic policy-based autonomous security. 2970-2973 - Tiziana Catarci, Monica Scannapieco, Marco Console, Camil Demetrescu:
My (fair) big data. 2974-2979 - Supriyo Chakraborty, Wentao Robin Ouyang, Mani B. Srivastava
:
LightSpy: Optical eavesdropping on displays using light sensors on mobile devices. 2980-2989 - Emre Göynügür, Murat Sensoy
, Geeth de Mel:
Combining semantic web and IoT to reason with health and safety policies. 2990-2997 - Erisa Karafili, Emil C. Lupu, Alan Cullen, Bill Williams, Saritha Arunkumar, Seraphin B. Calo:
Improving data sharing in data rich environments. 2998-3005 - Antara Palit, Mudhakar Srivatsa, Raghu K. Ganti, Christopher Simpkin:
Identifying sensor accesses from service descriptions. 3006-3011 - Seraphin B. Calo, Maroun Touma, Dinesh C. Verma, Alan Cullen:
Edge computing architecture for applying AI to IoT. 3012-3016 - Dinesh C. Verma, Graham A. Bent:
Policy enabled caching for distributed AI. 3017-3023 - Hussain Z. Al-Ajmi:
Case: Big geosciences data validation challenges and achievements. 3024-3030 - Priyaa Thavasimani
, Jacek Cala, Paolo Missier
:
Why-Diff: Explaining differences amongst similar workflow runs by exploiting scientific metadata. 3031-3041 - Benjamin E. Bagozzi, Ore Koren:
Using machine learning methods to identify atrocity perpetrators. 3042-3051 - Shouji Fujimoto
, Atushi Ishikawa, Takayuki Mizuno:
Comparison between spatial distributions of tweet base and population in Japan. 3052-3057 - Masanori Fujita
, Hiroto Inoue, Takao Terano:
Evaluating funding programs through network centrality measures of co-author networks of technical papers. 3058-3063 - Kouki Hayashi, Eiichi Umehara, Yuuki Ogawa:
Analysis of twitter messages about the osaka metropolis plan in Japan. 3064-3070 - Ayae Ide, Kazuya Yamashita, Yoichi Motomura, Takao Terano:
Analyzing regional characteristics of living activities of elderly people from large survey data with probabilistic latent spatial semantic structure modeling. 3071-3077 - Akira Ishii, Takayuki Mizuno, Yasuko Kawahata:
Position-sensitive propagation of information on social media using social physics approach. 3078-3085 - Shotaro Ito, Koji Eguchi:
Time dependent analysis of financial networks using supervised latent feature relational models. 3086-3090 - Mitsuki Murase, Masanori Takano, Reiji Suzuki, Takaya Arita:
A statistical analysis of behavioral bursts occurring in a social networking game. 3091-3097 - Daniel Rajchwald, Natasha Markuzon, Edoardo M. Airoldi:
Bias reduction of peer influence effects with latent coordinates and community membership. 3098-3103 - Takuto Sakamoto
, Hiroki Takikawa:
Cross-national measurement of polarization in political discourse: Analyzing floor debate in the U.S. the Japanese legislatures. 3104-3110 - Yuya Shibuya
:
Mining social media for disaster management: Leveraging social media data for community recovery. 3111-3118 - Jinsei Shima, Mitsuo Yoshida
, Kyoji Umemura:
When do users change their profile information on twitter? 3119-3122 - Nadiya Straton
, Ravi Vatrapu
, Raghava Rao Mukkamala
:
Facebook and public health: A study to understand facebook post performance with organizations' strategy. 3123-3132 - Hirohiko Suwa
, Yuki Ogawa, Eiichi Umehara, Kento Kakigi, Keiichi Yasumoto
, Tatsuo Yamashita, Kota Tsubouchi
:
Develop method to predict the increase in the Nikkei VI index. 3133-3138 - Masanori Takano, Hiroki Mizukami, Fujio Toriumi
, Makoto Takeuchi, Kazuya Wada, Masahiro Yasuda, Ichiro Fukiida:
Analysis of the changes in listening trends of a music streaming service. 3139-3142 - Hiroki Takikawa, Kikuko Nagayoshi:
Political polarization in social media: Analysis of the "Twitter political field" in Japan. 3143-3150 - Toshimichi Wakabayashi, Yasuko Kawahata, Akira Ishii:
Analysis of EXILE TRIBE in the music scene using mathematical model of hit phenomenon. 3151-3155 - Kenta Yamada, Takayuki Mizuno:
Relationships between market impact characteristics and order book properties. 3156-3161 - Kenta Yamada:
Detecting two types of seasonal words using simple autocorrelation analysis. 3162-3167 - Take Yo, Kazutoshi Sasahara
:
Inference of personal attributes from tweets using machine learning. 3168-3174 - Jacob Bolewski, Stavros Papadopoulos:
Managing massive multi-dimensional array data with TileDB: - Invited demo paper. 3175-3176 - Subhasis Dasgupta
, Charles McKay, Amarnath Gupta:
Generating polystore ingestion plans - A demonstration with the AWESOME system. 3177-3179 - Hayden Jananthan, Ziqi Zhou, Vijay Gadepally, Dylan Hutchison, Suna Kim, Jeremy Kepner:
Polystore mathematics of relational algebra. 3180-3189 - Yasar Khan, Antoine Zimmermann, Alokkumar Jha, Dietrich Rebholz-Schuhmann, Ratnesh Sahay:
Querying web polystores. 3190-3195 - Antonios Makris
, Konstantinos Tserpes
, Dimosthenis Anagnostopoulos:
A novel object placement protocol for minimizing the average response time of get operations in distributed key-value stores. 3196-3205 - Jonathan Rivers:
SciDB: An array-native computational database for heterogeneous, multi-dimensional data sets. 3206-3210 - Ran Tan, Rada Chirkova, Vijay Gadepally, Timothy G. Mattson:
Enabling query processing across heterogeneous data models: A survey. 3211-3220 - Ashwin Kumar Vajantri, Kunwar Deep Singh Toor, Edmon Begoli
, Jack Bates:
An apache calcite-based polystore variation for federated querying of heterogeneous healthcare sources. 3221-3227 - Jose Luis, Guerrero Cusumano:
A detection mechanism with text mining cross correlation approach. 3228-3232 - Gürdal Ertek
, Xu Chi, Allan N. Zhang
, Sobhan Asian
:
Text mining analysis of wind turbine accidents: An ontology-based framework. 3233-3241 - Aloysious J. L. Lee, D. Paul, W. J. Yan, Allan N. Zhang
, Mark Goh
:
A model for analysing a disrupted supply chain's time-to-recovery under uncertainty. 3242-3247 - Yong Oh Lee, Jun Jo, Jongwoon Hwang:
Application of deep neural network and generative adversarial network to industrial maintenance: A case study of induction motor fault detection. 3248-3253 - Haoye Lu, Anand Srinivasan, Amiya Nayak
:
Learning automata based method for solving demand and supply problem with periodic behaviors. 3254-3260 - Nigel Pugh, Lauren B. Davis
:
Forecast and analysis of food donations using support vector regression. 3261-3267 - Murat Mustafa Tunç, Alexandru Valcov, Allan N. Zhang
, Wenjing Yan, Rong Wen:
Association analysis of supply chain risk and company sales. 3268-3277 - Rong Wen, Wenjing Yan, Allan N. Zhang
:
Adaptive spatio-temporal mining for route planning and travel time estimation. 3278-3284 - Yi-Hsin Wu, Sheng-De Wang, Li-Jung Chen, Cheng-Juei Yu:
Streaming analytics processing in manufacturing performance monitoring and prediction. 3285-3289 - Dazhi Yang
, Allan N. Zhang
, Wenjing Yan:
Performing literature review using text mining, Part I: Retrieving technology infrastructure using Google Scholar and APIs. 3290-3296 - Dazhi Yang
, Jihoon Hong:
Performing literature review using text mining, Part II: Expanding domain knowledge with abbreviation identification. 3297-3301 - Md. Maksudul Alam
, Kalyan S. Perumalla
:
GPU-based parallel algorithm for generating massive scale-free networks using the preferential attachment model. 3302-3311 - Md Hasanuzzaman Bhuiyan, Maleq Khan, Madhav V. Marathe:
A parallel algorithm for generating a random graph with a prescribed degree sequence. 3312-3321 - Florian Demesmaeker, Amine Ghrab, Siegfried Nijssen
, Sabri Skhiri:
Discovering interesting patterns in large graph cubes. 3322-3331 - Colleen Heinemann, Talita Perciano, Daniela Ushizima, E. Wes Bethel:
Distributed memory parallel Markov random fields using graph partitioning. 3332-3341 - Weiyi Liu, Toyotaro Suzumura, Lingli Chen, Guangmin Hu:
A generalized incremental bottom-up community detection framework for highly dynamic graphs. 3342-3351 - Hannu Reittu
, Ilkka Norros:
Regular decomposition of large graphs and other structures: Scalability and robustness towards missing data. 3352-3357 - Xiangnan Ren, Olivier Curé, Hubert Naacke, Jérémy Lhez, Ke Li:
StriderR: Massive and distributed RDF graph stream reasoning. 3358-3367 - Akira Tanaka, Nozomi Hata, Nariaki Tateiwa, Katsuki Fujisawa
:
Practical approach to evacuation planning via network flow and deep learning. 3368-3377 - Adil Alim, Aparna Joshi, Feng Chen, Catherine T. Lawson:
Techniques for efficient detection of rapid weather changes and analysis of their impacts on a highway network. 3378-3387 - Elena Baralis, Andrea Dalla Valle, Paolo Garza, Claudio Rossi
, Francesco Scullino:
SQL versus NoSQL databases for geospatial applications. 3388-3397 - Savitha Baskaran, Shiaofen Fang, Shenhui Jiang:
Spatiotemporal visualization of traffic paths using color space time curve. 3398-3405 - Peter Baumann
, Eric Hirschorn, Joan Masó, Vlad Merticariu, Dimitar Misev:
All in One: Encoding spatio-temporal big data in XML, JSON, and RDF without information loss. 3406-3415 - Thaleia Dimitra Doudali
, Ioannis Konstantinou
, Nectarios Koziris:
Spaten: A spatio-temporal and textual big data generator. 3416-3421 - Ronald D. Hagan, Charles A. Phillips, Michael A. Langston, Bradley J. Rhodes:
Multiscale graph theoretical tools reveal subtle patterns in big geospatial data. 3422-3425 - Masahiko Itoh, Daisaku Yokoyama, Masashi Toyoda, Masaru Kitsuregawa:
Optimal viewpoint finding for 3D visualization of spatio-temporal vehicle trajectories on caution crossroads detected from vehicle recorder big data. 3426-3434 - Kulsawasd Jitkajornwanich
, Peerapon Vateekul
, Teerapong Panboonyuen
, Siam Lawawirojwong, Siwapon Srisonphan:
Road map extraction from satellite imagery using connected component analysis and landscape metrics. 3435-3442 - Sangchul Kim, Junhee Lee, Taehoon Kim, Bongki Moon:
Scalable parallel data loading in SciDB. 3443-3446 - Zhicheng Liu
, Jun Cao, Junyan Yang, Qiao Wang:
Discovering dynamic patterns of urban space via semi-nonnegative matrix factorization. 3447-3453 - Adway Mitra:
Identifying coherent anomalies in multi-scale spatio-temporal data using Markov random fields. 3454-3460 - Rene Richard
, Suprio Ray:
A tale of two cities: Analyzing road accidents with big spatial data. 3461-3470 - Victor Saquicela
, Luis Manuel Vilches Blázquez
, Andrés Tello
:
Challenges and trends about smart big geospatial data: A position paper. 3471-3475 - Purnima Shah
, Deepak B. Hiremath, Sanjay Chaudhary:
Towards development of spark based agricultural information system including geo-spatial data. 3476-3481 - Dongbo Zhou, Hao Li, Sannyuya Liu, Bo Song, Xiaohua Tony Hu:
A map-based visual analysis method for patterns discovery of mobile learning in education with big data. 3482-3491 - Mehdi Assefi, Ehsun Behravesh, Guangchi Liu, Ahmad Pahlavan Tafti:
Big data machine learning using apache spark MLlib. 3492-3498 - Christophe Cérin, Jean-Luc Gaudiot, Mustapha Lebbah, Foutse Yuehgoh
:
Return of experience on the mean-shift clustering for heterogeneous architecture use case. 3499-3507 - Alex Kaplunovich, Yelena Yesha:
Cloud big data decision support system for machine learning on AWS: Analytics of analytics. 3508-3516 - Hui Zhang, Yiwen Zhong, Juan Lin:
Divide-and-conquer strategies for large-scale simulations in R. 3517-3523 - Mihaela Malita, Gheorghe M. Stefan:
Map-scan node accelerator for big-data. 3524-3529 - Cuong Nguyen, Charles Lovering, Rodica Neamtu:
Ranked time series matching by interleaving similarity distances. 3530-3539 - Sergiy Peredriy, Deovrat Kakde, Arin Chaudhuri:
Kernel bandwidth selection for SVDD: The sampling peak criterion method for large data. 3540-3549 - Hong Yan, Zhongqiang Zhang
, Jian Zou:
An online spatio-temporal model for inference and predictions of taxi demand. 3550-3557 - Halim Abbas
, Ford Garberson
, Eric Glover, Dennis P. Wall:
Machine learning for early detection of autism (and other conditions) using a parental questionnaire and home video screening. 3558-3561 - Ravi Santosh Arvapally, Hasan Hicsasmaz, Wally Lo Faro:
Artificial intelligence applied to challenges in the fields of operations and customer support. 3562-3569 - Ricardo Baeza-Yates
:
Semantic search (invited talk). 3570 - Richard Boire:
Artificial intelligence(AI), automation, and its impact on data science. 3571-3574 - Yong Cai, Shaorong Liu, Jinlong Hu
, Guihong Bai, Shoubin Dong
:
A hybrid bipartite graph based recommendation algorithm for mobile games. 3575-3582 - Brian Johnston, Benjamin Zweig, Michael Peran, Charlie Wang, Rachel Rosenfeld:
Estimating skill fungibility and forecasting services labor demand. 3583-3585 - Eva K. Lee:
Innovation in big data analytics: Applications of mathematical programming in medicine and healthcare. 3586-3595 - Srishty Saha, Karuna P. Joshi
, Renee Frank, Michael Aebig, Jiayong Lin:
Automated knowledge extraction from the federal acquisition regulations system (FARS). 3596-3603 - Paul Squires, Harold G. Kaufman, Julian Togelius
, Catalina M. Jaramillo:
A comparative sequence analysis of career paths among knowledge workers in a multinational bank. 3604-3612 - Xin Xu Lei
, Tang Venkat Rangan:
Hitting your number or not? A robust & intelligent sales forecast system. 3613-3622 - Atsushi Yamada, Michael Peran:
Governance framework for enterprise analytics and data. 3623-3631 - Anja Evelyn Amundsen, Kenneth M. Ovens:
Forensics analysis of Wi-Fi communication traces in mobile devices. 3632-3637 - Sreyasee Das Bhattacharjee
, Bala Venkatram Balantrapu, William J. Tolone, Ashit Talukder
:
Identifying extremism in social media with multi-view context-aware subset optimization. 3638-3647 - Isuf Deliu, Carl Leichter, Katrin Franke:
Extracting cyber threat intelligence from hacker forums: Support vector machines versus convolutional neural networks. 3648-3656 - Asif Iqbal, Mathias Ekstedt, Hanan Alobaidli:
Exploratory studies into forensic logs for criminal investigation using case studies in industrial control systems in the power sector. 3657-3661 - Pierre Lison, Vasileios Mavroeidis:
Neural reputation models learned from passive DNS data. 3662-3671 - Andrii Shalaginov, Jan William Johnsen, Katrin Franke:
Cyber crime investigations in the era of big data. 3672-3676 - Shih-Chieh Su:
Topical behavior prediction from massive logs. 3677-3683 - Peter Xenopoulos:
Introducing DeepBalance: Random deep belief network ensembles to address class imbalance. 3684-3689 - Haohua Sun Yin, Ravi Vatrapu
:
A first estimation of the proportion of cybercriminal entities in the bitcoin ecosystem using supervised machine learning. 3690-3699 - Joshua Sablatura, Bing Zhou:
Forensic database reconstruction. 3700-3704 - Conrad Bielski
, V. O'Brien, C. Whitmore, Kaisa Riikka Ylinen, I. Juga, Pertti Nurmi, Juha Pekka Kilpinen, I. Porras, J. M. Sole, P. Gamez, M. Navarro, Azra Alikadic
, Andrea Gobbi
, Cesare Furlanello, Gunter Zeug, M. Weirathe, J. Martinez, R. Yuste, S. Castro, V. Moreno, T. Velin, Claudio Rossi
:
Coupling early warning services, crowdsourcing, and modelling for improved decision support and wildfire emergency management. 3705-3712 - Luca Cagliero
:
Summarization of emergency news articles driven by relevance feedback. 3713-3721 - Evelina Di Corso, Francesco Ventura, Tania Cerquitelli:
All in a twitter: Self-tuning strategies for a deeper understanding of a crisis tweet collection. 3722-3726 - Antonella Frisiello
, Quynh Nhu Nguyen, Claudio Rossi
:
Gamified crowdsourcing for disaster risk management. 3727-3733 - Andrea Gobbi
, Azra Alikadic
, Kaisa Riikka Ylinen, Federico Angaramo, Cesare Furlanello:
A heat wave forecast system for Europe. 3734-3738 - Jacopo Longhini, Claudio Rossi
, Claudio Casetti, Federico Angaramo:
A language-agnostic approach to exact informative tweets during emergency situations. 3739-3475 - Laura Lopez-Fuentes
, Claudio Rossi
, Harald Skinnemoen:
River segmentation for flood monitoring. 3746-3749 - Timothy Nugent, Fabio Petroni, Natraj Raman, Lucas Carstens, Jochen L. Leidner
:
A comparison of classification models for natural disaster and critical event detection from news. 3750-3759 - Jasmin Pielorz
, Matthias Prandtstetter
, Markus Straub
, Christoph H. Lampert:
Optimal geospatial volunteer allocation needs realistic distances. 3760-3763 - Tomoichi Takahashi, Katsuki Ichinose:
Crowd control and evacuation guidance based on simulations. 3764-3768 - Francesco Tarasconi, Michela Farina, Antonio Mazzei, Alessio Bosca:
The role of unstructured data in real-time disaster-related social media monitoring. 3769-3778 - Luca Venturini
, Evelina Di Corso:
Analyzing spatial data from twitter during a disaster. 3779-3783 - Marco Brambilla
, Paolo Mascetti, Andrea Mauri
:
Comparison of different driving style analysis approaches based on trip segmentation over GPS information. 3784-3791 - Qian Fu
, John M. Easton
:
Understanding data quality: Ensuring data quality by design in the rail industry. 3792-3799 - Emmanuel Nii Martey
, Ahmed Lasisi, Nii O. Attoh-Okine:
Track geometry big data analysis: A machine learning approach. 3800-3809 - Federico Perrotta, Tony Parry
, Luís C. Neves
:
Application of machine learning for fuel consumption modelling of trucks. 3810-3815 - Gene P. K. Wu, Keith C. C. Chan:
Privacy-preserving trajectory classification of driving trip data based on pattern discovery techniques. 3816-3825 - Jerzy Bala, Michael Kellar, Fred Ramberg:
Predictive analytics for litigation case management. 3826-3830 - Han Qin, Kit Riehle, Haozhen Zhao:
Using google analytics to support cybersecurity forensics. 3831-3834 - Thanasis Schoinas, Ghulam Qadir:
A feasibility experiment on the application of predictive coding to instant messaging corpora. 3835-3840 - Alexander Acker, Florian Schmidt, Anton Gulenko, Reinhard Kietzmann, Odej Kao:
Patient-individual morphological anomaly detection in multi-lead electrocardiography data streams. 3841-3846 - Fahima Amin Bhuyan, Shiyong Lu, Ishtiaq Ahmed, Jia Zhang:
Predicting efficacy of therapeutic services for autism spectrum disorder using scientific workflows. 3847-3856 - Elham Hassanain:
A multimedia big data retrieval framework to detect dyslexia among children. 3857-3860 - Wei Hong Lee, En Tzu Wang, Arbee L. P. Chen:
Mining accompanying relationships between diseases from patient records. 3861-3868 - Ning Liu, Soundar R. T. Kumara, Eric Reich:
Explainable data-driven modeling of patient satisfaction survey data. 3869-3876 - Goutam Mylavarapu, Johnson P. Thomas:
A multi-task machine learning approach for comorbid patient prioritization. 3877-3881 - Xianjun Shen, Xianchao Zhu
, Xingpeng Jiang, Li Gao, Tingting He, Xiaohua Hu:
Visualization of non-metric relationships by adaptive learning multiple maps t-SNE regularization. 3882-3887 - Ahmad Pahlavan Tafti, Ehsun Behravesh, Mehdi Assefi, Eric LaRose, Jonathan C. Badger, John Mayer, AnHai Doan, David Page, Peggy L. Peissig:
bigNN: An open-source big data toolkit focused on biomedical sentence classification. 3888-3896 - Shahab Tayeb, Matin Pirouz, Johann Sun, Kaylee Hall, Andrew Chang, Jessica Li, Connor Song, Apoorva Chauhan, Michael Ferra, Theresa Sager, Justin Zhan, Shahram Latifi:
Toward predicting medical conditions using k-nearest neighbors. 3897-3903 - Anuja Tike, Sanket Tavarageri:
A medical price prediction system using hierarchical decision trees. 3904-3913 - Iulian Voicu, Denis Kouame:
High dimensional data processing for fetal activity evaluation. 3914-3915 - Lina Yu, Hengle Jiang, Hongfeng Yu, Chi Zhang
, Josiah Mcallister, Dandan Zheng:
iVAR: Interactive visual analytics of radiomics features from large-scale medical images. 3916-3923 - Xin Deng:
Big data technology and ethics considerations in customer behavior and customer feedback mining. 3924-3927 - Duyen Do, Phuc Huynh, Phuong Vo, Tu Vu:
Customer churn prediction in an internet service provider. 3928-3933 - Michael Kranzlein, Dan Chia-Tien Lo:
Training on the poles for review sentiment polarity classification. 3934-3937 - Pegah Nokhiz, Fengjun Li
:
Understanding rating behavior based on moral foundations: The case of Yelp reviews. 3938-3945 - Yixuan Qiu, Wutao Wei:
A scalable sequential principal component analysis algorithm (SeqPCA) with application to user access control analysis. 3946-3754 - Ross Smith:
Towards an ethical application of customer feedback data. 3955-3957 - Wutao Wei, Le Zhang, Qi Ding, Bingrou Zhou:
Dynamic Bayesian predictive model for box office forecasting. 3958-3964 - Donghui Wu:
A big data analytics framework for forecasting rare customer complaints: A use case of predicting MA members' complaints to CMS. 3965-3967 - Yizhou Zang, Xiaohua Hu:
Heterogeneous knowledge transfer via domain regularization for improving cross-domain collaborative filtering. 3968-3974 - Paulo S. C. Alencar, Donald D. Cowan, Douglas W. Mulholland, Bruce MacVicar
, Simon Courtenay, Stephen Murphy, Fred McGarry:
iEnvironment: A software platform for integrated environmental monitoring and modeling of surface water. 3975-3978 - Rumi Chunara:
New data paradigms: From the crowd and back. 3979-3980 - Holden Karau:
Unifying the open big data world: The possibilities∗ of apache BEAM. 3981 - Georgia D. Tourassi:
Deep learning enabled national cancer surveillance. 3982-3983 - Lee Wilson
, Adrienne Colborne, Michael Smit
:
Preparing data managers to support open ocean science: Required competencies, assessed gaps, and the role of experiential learning. 3984-3993 - Xuan Zhou, Wenjun Wu, Yong Han:
Modeling multiple subskills by extending knowledge tracing model using logistic regression. 3994-4003 - Tsumugi Tairaku, Akihiro Nakao, Saneyasu Yamaguchi, Masato Oguchi:
Application specific traffic control using network virtualization node in large-scale disasters. 4004-4009 - Martino Trevisan
, Idilio Drago
, Marco Mellia
, Maurizio M. Munafò:
Automatic detection of DNS manipulations. 4010-4015 - Luca Vassio
, Marco Mellia
, Flavio V. D. de Figueiredo, Ana Paula Couto da Silva
, Jussara M. Almeida:
Mining and modeling web trajectories from passive traces. 4016-4021 - Richard de Groof, Haiping Xu:
Automatic topic discovery of online hospital reviews using an improved LDA with Variational Gibbs Sampling. 4022-4029 - Noriaki Koide, Yu Ichifuji:
Fragrance to vector as scent technology. 4030-4034 - Deepak Kumar
, Chetan Kumar, Ming Shao:
Cross-database mammographic image analysis through unsupervised domain adaptation. 4035-4042 - Christine Bassem
, Azer Bestavros:
GuideMe: Routes coordination of participating agents in mobile crowd sensing platforms. 4043-4049 - Yimin Chen, Jin Wen
:
A whole building fault detection using weather based pattern matching and feature based PCA method. 4050-4057 - Donald D. Cowan, Paulo S. C. Alencar, Kyle Young, Bryan Smale, Ryan Erb, Fred McGarry:
A model for the socially smart city practical uses of city-level socio-economic indicators. 4058-4067 - Mickael Figueredo, Nélio Cacho, Antonio Thome, Andréa Cacho, Frederico Lopes
, Maria Valeria Araujo:
Using social media photos to identify tourism preferences in smart tourism destination. 4068-4073 - Paul G. Flikkema, Morgan Vigil-Hayes:
Self-adaptive and resilient urban networking infrastructure for disasters and smart city services. 4074-4079 - Kyoichi Ito, Masaki Ito, Kosuke Miyazaki, Keishi Tanimoto, Kaoru Sezaki:
Data analysis on train transportation data with nonnegative matrix factorization. 4080-4085 - Anderson Araujo, Rubem Kalebe, Gustavo Girão, Itamir Filho
, Kayo Goncalves, Bianor Neto:
Reliability analysis of an IoT-based smart parking application for smart cities. 4086-4091 - Makoto Kawano, Kazuhiro Mikami, Satoshi Yokoyama, Takuro Yonezawa
, Jin Nakazawa:
Road marking blur detection with drive recorder. 4092-4097 - Yasue Kishino, Koh Takeuchi, Yoshinari Shirai, Futoshi Naya, Naonori Ueda:
Datafying city: Detecting and accumulating spatio-temporal events by vehicle-mounted sensors. 4098-4104 - Takahiro Komamizu, Jin Nakazawa, Toshiyuki Amagasa
, Hiroyuki Kitagawa
, Hideyuki Tokuda:
Analytical toolbox for smart city applications: Garbage collection log use case. 4105-4110 - Shuhua Liu, Patrick Jansson:
City event detection from social media with neural embeddings and topic model visualization. 4111-4116 - Zohreh Pourzolfaghar
, Markus Helfert
, Viviana Angely Bastidas Melo, Ahmad Khalilijafarabad:
Proposing an access gate to facilitate knowledge exchange for smart city services. 4117-4122 - Naoya Shibahara, Ryoma Kondo, Masayuki Iwai:
MM360: A GPS-assisted 360-degree video sharing system for participatory events. 4123-4127 - Jonathan Creighton, Farhana H. Zulkernine:
Towards building a hybrid model for predicting stock indexes. 4128-4133 - Dongmei Guo, Jialong Zheng, Xiaolan Yang:
Agglomeration, network and urban development - - A study on newspaper connection network index of cities. 4134-4141 - Lin Huo, Xiaoli Sun:
An augmented fama and french three-factor model using social interaction. 4142-4147 - Quan Jin, Kun Guo
, Yi Sun:
Stock price forecasting using support vector regression: Based on network behavior data. 4148-4153 - Daniel Muller, Yiea-Funk Te:
Insurance premium optimization using motor insurance policies - A business growth classification approach. 4154-4158 - Daniel Muller, Yiea-Funk Te, Pratiksha Jain:
Predicting business performance through patent applications. 4159-4164 - Shaolong Sun
, Shouyang Wang
, Yunjie Wei, Xianduan Yang, Kwok-Leung Tsui:
Forecasting tourist arrivals with machine learning and internet search index. 4165-4169 - Minggang Wang
, André L. M. Vilela
, Lixin Tian, Hua Xu
, Ruijin Du:
A new time series prediction method based on complex network theory. 4170-4175 - Jinxin Wang, Wei Shang, Zhengyang Liu, Shouyang Wang
:
An enhanced LGSA-SVM for S&P 500 index forecast. 4176-4183 - Yunjie Wei, Xun Zhang, Shouyang Wang
:
Can search data help forecast inflation? Evidence from a 13-country panel. 4184-4188 - Qingqing Zhang, Darren Jian, Rui Xu, Wei Dai, Ying Liu:
Integrating heterogeneous data sources for traffic flow prediction through extreme learning machine. 4189-4194 - Guihuan Zheng, Qikun Yao, Xingfen Wang, Zhou Yang:
The construction and application of expectations index on monetary policy. 4199-4203 - Giuseppe Bruno, Demetrio Condello, Alberto Falzone, Andrea Luciani
:
Big data processing: Is there a framework suitable for economists and statisticians? 4204-4211 - Anne M. Denton, Arighna Roy:
Cluster-overlap algorithm for assessing preprocessing choices in environmental sustainability. 4212-4220 - Chu-hua Kuei, Christian N. Madu, Picheng Lee:
Critical enablers of sustainable water management (SWM): Text evidences from 10 countries. 4221-4227 - Aki-Hiro Sato
:
Characterization of cities based on world grid square statistics about specific properties. 4228-4237 - Aki-Hiro Sato
, Shoki Nishimura, Hiroe Tsubaki:
World grid square codes: Definition and an example of world grid square data. 4238-4247 - Hiroshi Tsuda, Masakazu Ando, Yu Ichifuji:
Statistical analysis of hotel plan popularity in regional tourist areas. 4248-4254 - Craig S. Wright, Antoaneta Serguieva:
Sustainable blockchain-enabled services: Smart contracts. 4255-4264 - Ailun Ye, Venkata L. Raju Chinthalapati, Antoaneta Serguieva, Edward P. K. Tsang:
Developing sustainable trading strategies using directional changes with high frequency data. 4265-4271 - Arunkumar Bagavathi
, Pranava Mummoju, Katarzyna A. Tarnowska
, Angelina A. Tzacheva, Zbigniew W. Ras:
SARGS method for distributed actionable pattern mining using spark. 4272-4281 - I-Cheng Chang, Yudi Pratama Halim, Chun-Man Lin:
Vehicle path estimation using dual-level clustering and multi-source prediction. 4282-4286 - Helena F. Deus
, Corey A. Harper, Darin McBeath, Ron Daniel Jr.:
Combining pattern matching with word embeddings for the extraction of experimental variables from scientific literature. 4287-4292 - Kulsawasd Jitkajornwanich
, Peerapon Vateekul
, Upa Gupta, Teeranai Kormongkolkul, Arnon Jirakittayakorn, Siam Lawawirojwong, Siwapon Srisonphan:
Ocean surface current prediction based on HF radar observations using trajectory-oriented association rule mining. 4293-4300 - Liling Li, Tyler Danner, Jesse Eickholt, Erin McCann, Kevin Pangle, Nicholas Johnson:
A distributed pipeline for DIDSON data processing. 4301-4306 - Tse-Yu Pan, Yi-Zhu Dai, Wan-Lun Tsai, Min-Chun Hu:
Deep model style: Cross-class style compatibility for 3D furniture within a scene. 4307-4313 - A. Aziz Altowayan, Ashraf Elnagar
:
Improving Arabic sentiment analysis with sentiment-specific embeddings. 4314-4320 - Jose Berengueres
, Dani Castro:
Differences in emoji sentiment perception between readers and writers. 4321-4328 - Patrick Jansson, Shuhua Liu:
Topic modelling enriched LSTM models for the detection of novel and emerging named entities from social media. 4329-4336 - Bingjing Jia, Bin Wu, Jinna Lv, Pengpeng Zhou, Yao Bu, Ying Xing:
An entity disambiguation method based on LeaderRank. 4337-4342 - Nicolai Pogrebnyakov, Edgar A. Maldonado:
Identifying emergency stages in facebook posts of police departments with convolutional and recurrent neural networks and support vector machines. 4343-4352 - Ian Stewart, Stevie Chancellor
, Munmun De Choudhury, Jacob Eisenstein:
#Anorexia, #anarexia, #anarexyia: Characterizing online community practices with orthographic variation. 4353-4361 - Joseph A. Cottam, Leslie M. Blaha, Dimitri Zarzhitsky, Mathew Thomas, Elliott Skomski:
Crossing the Streams: Fuzz testing with user input. 4362-4371 - Xiaoni Duan, Keishi Tajima:
Improving classification accuracy in crowdsourcing through hierarchical reorganization. 4372-4374 - Yuzuki Furuhashi, Masaki Matsubara, Atsuyuki Morishima:
Crowd-based best-effort number estimation. 4375-4377 - Austin Graham, Yan Liang, Le Gruenwald, Christan Grant
:
[Research paper] formalizing interruptible algorithms for human over-the-loop analytics. 4378-4383 - Munenari Inoguchi, Keiko Tamura, Kei Horie, Haruo Hayashi:
Clarifying the transition of workload for victims life reconstruction support programs in affected local governments using the victims master database - Comparison between the 2007 Chuetsu-oki earthquake and the 2016 Kumamoto Earthquake-. 4384-4388 - Masahiro Kazama, Viviane Takahashi:
Active preference learning for generative adversarial networks. 4389-4393 - Naoki Kobayashi, Masaki Matsubara, Keishi Tajima, Atsuyuki Morishima:
A crowd-in-the-loop approach for generating conference programs with microtasks. 4394-4396 - Koyo Kobayashi, Hidehiko Shishido, Yoshinari Kameda, Itaru Kitahara:
Method to generate disaster-damage map using 3D photometry and crowd sourcing. 4397-4399 - Takahiro Komamizu, Toshiyuki Amagasa
, Hiroyuki Kitagawa
:
Implicit order join: Joining log data with property data by discovering implicit order-oriented keys with human assistance. 4400-4406 - Mamiko Matsubayashi, Keiko Kurata
:
Conceptual design for comprehensive research support platform: Successful research data management generating big data from little data. 4407-4409 - Yoshitaka Matsuda, Yu Suzuki, Satoshi Nakamura:
A trade-off between estimation accuracy of worker quality and task complexity. 4410-4416 - Hiroki Morise, Satoshi Oyama, Masahito Kurihara:
Collaborative filtering and rating aggregation based on multicriteria rating. 4417-4422 - Michalis Papakostas, Konstantinos Tsiakas, Theodoros Giannakopoulos, Fillia Makedon:
Towards predicting task performance from EEG signals. 4423-4425 - Hidehiko Shishido, Yutaka Ito
, Youhei Kawamura, Toshiya Matsui, Atsuyuki Morishima, Itaru Kitahara:
Proactive preservation of world heritage by crowdsourcing and 3D reconstruction technology. 4426-4428 - Panote Siriaraya, Yuriko Yamaguchi, Mimpei Morishita, Yoichi Inagaki, Reyn Y. Nakamoto, Jianwei Zhang, Junichi Aoi, Shinsuke Nakajima:
Using categorized web browsing history to estimate the user's latent interests for web advertisement recommendation. 4429-4434 - Keiko Tamura, Naoshi Hirata:
"DEKATSU" activity of data and service collaboration among private companies and academic institutions for Tokyo metropolitan resilience project. 4435-4437 - Agniva Banerjee, Karuna Pande Joshi
:
Link before you share: Managing privacy policies through blockchain. 4438-4447 - Ruth Bearden, Dan Chia-Tien Lo:
Automated microsoft office macro malware detection using machine learning. 4448-4452 - Alina Campan, Alfredo Cuzzocrea, Traian Marius Truta:
Fighting fake news spread in online social networks: Actual trends and future research directions. 4453-4457 - Anthony Carella, Murat Kotsoev, Traian Marius Truta:
Impact of security awareness training on phishing click-through rates. 4458-4466 - Alfredo Cuzzocrea, Hossain Shahriar
:
Data masking techniques for NoSQL database security: A systematic review. 4467-4473 - Alfredo Cuzzocrea, Fabio Martinelli, Francesco Mercaldo, Gianni Viardo Vercelli
:
Tor traffic analysis and detection via machine learning techniques. 4474-4480 - Anirban Das, Min-Yi Shen, Jisheng Wang:
Modeling user communities for identifying security risks in an organization. 4481-4486 - Philip Derbeko, Shlomi Dolev
, Ehud Gudes, Jeffrey D. Ullman:
Efficient and private approximations of distributed databases calculations. 4487-4496 - Kangsoo Jung, Seog Park:
Collaborative caching techniques for privacy-preserving location-based services in peer-to-peer environments. 4497-4506 - Haya Shajaiah, Ahmed Abdelhadi, Charles Clancy:
Secure power scheduling auction for smart grids using homomorphic encryption. 4507-4512 - Ugur Sopaoglu, Osman Abul
:
A top-down k-anonymization implementation for apache spark. 4513-4521 - Shahab Tayeb, Matin Pirouz, Gabriel Esguerra, Kimiya Ghobadi, Jimson Huang, Robin Hill, Derwin Lawson, Stone Li, Tiffany Zhan, Justin Zhan, Shahram Latifi:
Securing the positioning signals of autonomous vehicles. 4522-4528 - Trishita Tiwari, Ata Turk, Alina Oprea, Katzalin Olcoz
, Ayse K. Coskun:
User-profile-based analytics for detecting cloud security breaches. 4529-4535 - Conrad M. Albrecht
, Marcus Freitag, Theodore G. van Kessel, Siyuan Lu, Hendrik F. Hamann:
Event clustering & event series characterization on expected frequency. 4536-4541 - Roger N. Anderson:
'Petroleum Analytics Learning Machine' for optimizing the Internet of Things of today's digital oil field-to-refinery petroleum system. 4542-4545 - Hung Cao
, Monica Wachowicz, Sangwhan Cha:
Developing an edge computing platform for real-time descriptive analytics. 4546-4554 - Domitille Couloumb, Charbel El Kaed, Ayush Garg, Chris Healey, Jonathan Healey, Stuart Sheehan:
Energy efficiency driven by a storage model and analytics on a multi-system semantic integration. 4555-4561 - Aurora González-Vidal
, Alfonso P. Ramallo-González, Fernando Terroso-Saenz
, Antonio F. Skarmeta
:
Data driven modeling for energy consumption prediction in smart buildings. 4562-4569 - Christoph A. Keller, Mathew J. Evans
, J. Nathan Kutz, Steven Pawson
:
Machine learning and air quality modeling. 4570-4576 - Theodore G. van Kessel, Ramachandran Muralidhar, Josephine B. Chang, Jun-Song Wang, Michael A. Schappert, Hendrik F. Hamann:
A low maintenance particle pollution sensing system using the Minimum Airflow Particle Counter (MAPC). 4577-4582 - Levente J. Klein, Theodore G. van Kessel, Dhruv Nair, Ramachandran Muralidhar, Nigel Hinds, Hendrik F. Hamann, Norma E. Sosa:
Distributed wireless sensing for fugitive methane leak detection. 4583-4591 - Joshua Lieberman
, Alan Leidner, George Percivall, Carsten Rönsdorf:
Using big data analytics and IoT principles to keep an eye on underground infrastructure. 4592-4601 - Aekyeung Moon, Jaeyoung Kim, Jialing Zhang, Hang Liu, Seung Woo Son:
Understanding the impact of lossy compressions on IoT smart farm analytics. 4602-4611 - Dinesh C. Verma, Geeth de Mel:
Measures of network centricity for edge deployment of IoT applications. 4612-4620 - Xiaochi Zhou, Vinícius Amaral, John D. Albertson:
Source characterization of airborne emissions using a sensor network: Examining the impact of sensor quality, quantity, and wind climatology. 4621-4629 - Dabiah Ahmed Alboaneen
, Huaglory Tianfield
, Yan Zhang:
Sentiment analysis via multi-layer perceptron trained by meta-heuristic optimisation. 4630-4635 - Olga Babko-Malaya, Rebecca Cathey, Steve Hinton, David Maimon
, Taissa Gladkova:
Detection of hacking behaviors and communication patterns on social media. 4636-4641 - Adam Dalton, Bonnie J. Dorr
, Leon Liang, Kristy Hollingshead:
Improving cyber-attack predictions through information foraging. 4642-4647 - Jordan DeLoach, Doina Caragea
:
Twitter-enhanced Android malware detection. 4648-4657 - Mohammed Eslami, George Zheng, Hamed Eramian, Georgiy Levchuk:
Deriving cyber use cases from graph projections of cyber data represented as bipartite graphs. 4658-4663 - Jhu-Sin Luo, Dan Chia-Tien Lo:
Binary malware image classification using machine learning with local binary pattern. 4664-4667 - David Maimon
, Andrew Fukuda, Steve Hinton, Olga Babko-Malaya, Rebecca Cathey:
On the relevance of social media platforms in predicting the volume and patterns of web defacement attacks. 4668-4673 - Fernando Maymi, Robert Bixler, Randolph M. Jones, Scott D. Lathrop:
Towards a definition of cyberspace tactics, techniques and procedures. 4674-4679 - Hau Tran, An Nguyen, Phuong Vo, Tu Vu:
DNS graph mining for malicious domain detection. 4680-4685 - Xiaoyan Zhuo, Jialing Zhang, Seung Woo Son:
Network intrusion detection using word embeddings. 4686-4695 - Sung Whan Jeon, Hye Jin Lee, Sungzoon Cho:
Building industry network based on business text: Corporate disclosures and news. 4696-4704 - Yang Jiao, Jérémie Jakubowicz:
Predicting stock movement direction with machine learning: An extensive study on S&P 500 stocks. 4705-4713 - Naomi Simumba, Suguru Okami
, Naohiko Kohtake:
Credit decision tool using mobile application data for microfinance in agriculture. 4714-4721 - Masanori Ajito, Yasuko Kawahata, Akira Ishii:
Analysis of national election using mathematical model of hit phenomenon. 4722-4724 - Darlan Arruda
, Nazim H. Madhavji:
Towards a big data requirements engineering artefact model in the context of big data software development projects: Poster extended abstract. 4725-4726 - Shilpa Balan, Nishant Shristiraj, Vrunda Shah, Anusha Manjappa:
Big data analysis of youth tobacco smoking trends in the United States. 4727-4729 - Shaunak D. Bopardikar, George S. Eskander Ekladious:
Towards scalable kernel machines for streaming data analytics. 4730-4732 - Chaochao Chen
, Xinxing Yang, Li Wang, Jun Zhou, Xiaolong Li
:
Large scale app recommendation in Ant Financial. 4733-4735 - Ranjeet Devarakonda
, Michael Giansiracusa, Jitendra Kumar
, Harold Shanafield:
Social media based NPL system to find and retrieve ARM data: Concept paper. 4736-4737 - Mohammed Elshambakey, Mohamed Khalefa, William J. Tolone, Sreyasee Das Bhattacharjee, Huikyo Lee, Luca Cinquini, Shannon Schlueter, Isaac Cho, Wenwen Dou, Daniel J. Crichton:
Towards a distributed infrastructure for data-driven discoveries & analysis. 4738-4740 - Mohammed Eslami, George Zheng, Hamed Eramian, Georgiy Levchuk:
Anomaly detection on bipartite graphs for cyber situational awareness and threat detection. 4741-4743 - Iwao Fujino, Christophe Claramunt, Abdel-Ouahab Boudraa
:
Extracting route patterns of vessels from AIS data by using topic model. 4744-4746 - Michel Généreux, Bryor Snefjella, Marta Maslej:
Big data in psychology: Using word embeddings to study theory-of-mind. 4747-4749 - Frank R. Greguska, Thomas Huang, Brian Wilson, Nga Quach, Joe Jacob:
Analyzing big ocean science data with NEXUS. 4750 - Abdeltawab M. Hendawi, Aqeel Rustum, Mohamed H. Ali, John A. Stankovic:
Turning big spatial data into smart routing. 4751-4753 - Mauri Kaipainen, Olli Pitkänen
, Perspicamus Ab:
Human-controlled iterative subclustering analysis. 4754-4756 - Kasumi Kato, Atsuko Takefusa, Hidemoto Nakada
, Masato Oguchi:
Consideration of parallel data processing over an apache spark cluster. 4757-4759 - Yasuko Kawahata, Yukari Moriyama, Shinichirou Yamada, Mingyi Sun, Taketo Kawamura:
Analytical the large-scale collection of data on the results of the guides for foreigners visiting Japan. 4760-4764 - Saleena Khanna, Yuvraj S. Sethi, Akash R. Nambiar:
iSkin specialist - A big data based expert system for dermatology. 4765-4767 - Thomas Kitson, Paula Olaya, Elizabeth Racca, Michael R. Wyatt II, Mario Guevara
, Rodrigo Vargas
, Michela Taufer
:
Data analytics for modeling soil moisture patterns across united states ecoclimatic domains. 4768-4770 - Anusha Kola, Harshal More, Sean Soderman, Michael N. Gubanov:
Generating Unified Famous Objects (UFOs) from the classified object tables. 4771-4773 - Tai-Yeon Ku, Wan-Ki Park, Hoon Choi
:
Energy information collection mechanism using big data correlation map. 4774-4776 - Hyun-Chul Lee, Tong-Il Jang, Kwangsu Moon:
Anticipating human errors from periodic big survey data in nuclear power plants. 4777-4778 - Chen Li
, Annisa
, Asif Zaman, Yasuhiko Morimoto:
MapReduce-based computation of area skyline query for selecting good locations in a map. 4779-4782 - PrathyushaRani Merla, Yiheng Liang:
Data analysis using hadoop MapReduce environment. 4783-4785 - Kwan Hui Lim, Shanika Karunasekera, Aaron Harwood, Lucia Falzon:
Spatial-based topic modelling using wikidata knowledge base. 4786-4788 - Lixin Liu, Jun Chen:
The influences of deep-sea vision data quality on observational analysis. 4789-4791 - Amin Majd, Elena Troubitsyna:
Data-driven approach to ensuring fault tolerance and efficiency of swarm systems. 4792-4794 - Javier Mata
, Ignacio de Miguel
, Ramón J. Durán
, Juan Carlos Aguado, Noemí Merayo
, Lidia Ruiz-Perez
, Patricia Fernández
, Rubén M. Lorenzo, Evaristo J. Abril:
A SVM approach for lightpath QoT estimation in optical transport networks. 4795-4797 - Kenji Nakashima, Joichiro Kon, Saneyasu Yamaguchi, Gil Jae Lee, José A. B. Fortes:
1A study on big data I/O performance with modern storage systems. 4798-4799 - Monika Nawrocka, Marcin Lukowski:
Biofeedback EEG data integration and visualization analytics for endurance exercise practices: Data integration and visualization analytics of biofeedback EEG. 4800-4802 - Paul Le Noac'h, Alexandru Costan
, Luc Bougé:
A performance evaluation of Apache Kafka in support of big data streaming applications. 4803-4806 - Steven Ortiz, Caner Enbatan, Maksim Podkorytov
, Dylan Soderman, Michael N. Gubanov:
Hybrid.JSON: High-velocity parallel in-memory polystore JSON ingest. 4807-4809 - Kaine Black, Monica Wachowicz, Alec Parise:
Using Bi-partite graphs to cluster complex networks. 4810-4812 - Nat Pavasant, Hiroshi Furutani, Masayuki Numao, Ken-ichi Fukui:
ART-2b: Adapted ART-2a for large scale data clustering on PM2.5 mass spectra. 4813-4815 - Tayfun Pay
, Stephen Lucci:
Automatic keyword extraction: An ensemble method. 4816-4818 - Iulia Popescu, Kurt Portelli, Christos Anagnostopoulos
, Nikos Ntarmos
:
The case for graph-based recommendations. 4819-4821 - Jason Radford, Luke Horgan, David Lazer:
Baselines for demographic inference on a new gold standard twitter corpus. 4822-4823 - Jason Radford:
Piloting a theory-based approach to inferring gender in big data. 4824-4826 - Bharath K. Samanthula:
Privacy-preserving outsourced collaborative frequent itemset mining in the cloud. 4827-4829 - Shohei Shirataki, Saneyasu Yamaguchi:
A study on interpretability of decision of machine learning. 4830-4831 - Mark Simmons, Daniel Armstrong, Dylan Soderman, Michael N. Gubanov:
Hybrid.media: High velocity video ingestion in an in-memory scalable analytical polystore. 4832-4834 - Lisa Singh, Raghu Pemmaraju:
EOS: A multilingual text archive of international newspaper & blog articles. 4835-4837 - Tsumugi Tairaku, Akihiro Nakao, Saneyasu Yamaguchi, Masato Oguchi:
Application specific traffic control in large-scale disasters. 4838-4840 - Masashi Toyoda, Daisaku Yokoyama, Junpei Komiyama, Masahiko Itoh:
Road safety estimation utilizing big and heterogeneous vehicle recorder data. 4841-4842 - Sebastian Trinks, Carsten Felden:
Real time analytics - State of the art: Potentials and limitations in the smart factory. 4843-4845 - Akira Umayabara, Hayato Yamana
:
MCMalloc: A scalable memory allocator for multithreaded applications on a many-core shared-memory machine. 4846-4848 - Santiago Villasenor, Tom Nguyen, Anusha Kola, Sean Soderman, Michael N. Gubanov:
Scalable spam classifier for web tables. 4849-4851 - Jonathan Wang, Kesheng Wu
, Alex Sim
, Seongwook Hwangbo:
Accurate signal timing from high frequency streaming data. 4852-4854 - Yifang Wei, Lisa Singh:
Understanding the impact of sampling and noise on detecting events using twitter. 4855-4857 - Yoshiko Yasumura, Hiroki Imabayashi, Hayato Yamana
:
Attribute-based proxy re-encryption method for revocation in cloud data storage. 4858-4860 - Daisaku Yokoyama, Masashi Toyoda:
Towards constructing a driver management system based on large-scale driving operation records. 4861-4862 - Takuya Yonezawa, Ismail Arai
, Toyokazu Akiyama, Kazutoshi Fujikawa:
Proposal of classification method of bus operation states using sensor data. 4863-4865 - Haiyan Yu, Kun Xiang, Jiang Yu:
Understanding a moderating effect of physicians' endorsement to online workload: An empirical study in online health-care communities. 4866-4868 - Philipp Zehnder, Dominik Riemer:
Towards automatic infrastructure provisioning for highly dynamic streaming applications. 4869-4871 - Binyam A. Zemede, Byron J. Gao:
Personalized search with editable profiles. 4872-4874 - Yin Zhang, Jiming Hu:
Discovering the interdisciplinary nature of big data research. 4875-4877 - Ziwei Zhu, Weijia Xu, Wei He:
Big data system for information aggregation and model comparison for precison medicine. 4878-4880

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.