Palm Springs, California
May 6, 2016 - May 8, 2016
Early-bird Deadline: Sun, March 6th, 2016
Wed, April 6th, 2016
Extended Submission Deadline: Fri, April 29th, 2016
While intentionally interdisciplinary, this conference likely is of most interest to those with interests in:
Big Data | Statistics | Algorithms | Resampling Methods | Regression Trees | Computer Science | Programming | Open-source
Prior to the ready availability of powerful computers, insights from data analysis were limited by reliance on small but manageable data samples and by techniques derived from mathematical theory that relied upon stringent but often unrealistic assumptions. Neither constraint applies today. Today the focus is on “Big Data”, computer-intensive strategies that substitute raw computing power for abstract theory, and new algorithms to explore hidden and often unforeseen patterns in data. We encourage presentations that introduce and demonstrate novel approaches to data analysis, that glean fresh insights from data, or that compare methods.
As with all events offered by The Sisu Advantage, this event is run in parallel with another conference. The intent is to break down traditional discipline silos and foster interdisciplinary collaboration. Registration to this event allows attendance to all sessions and events of both The Pacific Conference on Statistical Computing and Data Mining
and The Pacific Conference on The Science and Art of Business
, at no additional cost.
A California Connection:
There has been a recent explosion of interest in “Big Data” and “Data Mining”, largely driven by advances in computing power that allow for implementation of innovative statistical algorithms, developments which in turn have inspired many organizations to view their data in new ways. As data has become a potential competitive advantage for organizations, rather than mere industrial waste to be discarded after completing transactions, demand for those with skills in working with massive data sets and applying new analytic techniques has risen dramatically. By any measure, demand for such skills has outstripped supply, with one recent estimate that demand for those with such skills has “increased 89.9% in the last twelve months” (http://www.forbes.com/sites/louiscolumbus/2014/12/29/where-big-data-jobs-will-be-in-2015/ ).
The map above shows the 2015 distribution of such jobs by MSA (metropolitan statistical area). The two leading MSAs both are in Northern California:
- San Jose – Sunnyvale – Santa Clara MSA
- San Francisco – Oakland – Fremont MSA.
Nationally, only four MSAs have even half as many such jobs as do each of these two.
Each conference weekend, a specialty workshop is offered that exposes attendees to open-source productivity tools that are freely available, such as "R Packages", that are specifically designed for teaching and research in the discipline(s) and topics that are the focus of that weekend.
The R Statistical Computing Environment is a free, downloadable software system for both basic and state-of-the-art data visualizations and analyses.
It is the leading programming language for newly introduced algorithms, and more than 7,000 “R Packages” are available for download to save you from needing to “reinvent the wheel” in your own work. We solicit requests from attending delegates and select one or two to present at each workshop. No prior experience with R or programming is expected.
Following are a sample of “R Packages” representative of those we cover:
- Pre-K through K-12 Education // Language Learning
→ “shiny Package”: for building interactive web data visualizations
- Community College and University Ed// Distance and Lifelong Learning
→ “knitr Package”: for presentation and publication of reproducible work
- Social/Mass Media and Communication // Fine Arts and Humanities
→ “tm Package”: a comprehensive framework for text mining and web-mining
- Psychology and the Behavioral Sciences // Economics and Social Sciences
→ “vegan Package”: for ordination methods like multidimensional scaling
- Statistical Computing and Data Mining // The Science and Art of Business
→ “rattle Package”: A GUI for classical and tree models and data mining
- Energy, Climate Change and the Environment // Technology and Public Policy
→ “EnvStats Package”: for environmental statistics and EPA guidance