Monday, November 14, 2016

UW Data Science Seminar: Matthew Salganik

November 16, 2016 3:30 in Johnson 075

Matthew Salganik, Professor of Sociology at Princeton University, will be presenting “Social Research in the Age of Big Data” at this week’s Data Science Seminar. The Data Science Seminar is free and open to the public.

The digital age has transformed how researchers are able to study social behavior. These new opportunities mean that the future of social research will involve blending together insights from two communities: social scientists and data scientists. In this talk, I'll begin by describing what I think each community has to contribute and what each community has to learn. Then, I'll focus on this social science/data science hybrid in one particular domain where I see a lot of opportunities: survey research. The talk will conclude with some predictions about the future of social research.

Tuesday, November 1, 2016

UW Data Science Seminar: Rob Axtell

November 2, 2016 3:30 in Johnson 075

The UW Data Science Seminar, organized by the eScience Institute, iSchool DataLab, and CSE Interactive Data Lab, is a “university-wide effort bringing together thought-leading speakers and researchers across campus to discuss topics related to data analysis, visualization and applications to domain sciences.” Rob Axtell, Department Chair of the Krasnow Institute for Advanced Study at George Mason University, presents this week’s seminar entitled, “Computationally-Enabled Public Policy Using Comprehensive Data.”
The social sciences are being revolutionized today by two distinct forces, data and computing. The ability to perform controlled experiments, both in laboratory (small scale) and web-facilitated (large scale) settings, combine with natural experiments and digital exhaust type click-stream data to provide an unprecedented window into human behavior in a wide variety of social contexts. But just as significant is the increasing availability of administratively-complete micro-data that offer nearly comprehensive portraits of important social phenomena. Computational techniques and tools are essential for managing such data, and for creating models capable of explaining the data. Specifically, agent-based computing is an emerging technology for representing individuals engaged in social behavior and grounding them in micro-data. In this talk I will start with some background material on agent computing, discussing how the approach has been utilized for abstract models of social processes. I will then go on to describe two large-scale agent models that utilize individual-level data. A model of the U.S. housing market bubble that burst c 2006-7 will be described for the Washington, D.C. area. It involves some 2 million housing units overall with more than a million homeowners and some 500K mortgages. The model combines data on the housing stock (county sources), borrowers (Census), and mortgages (from mortgage service providers), and the model output is compared to MLS transactional data. We have investigated alternative policies for attenuating the size of the bubble. Then a model of the U.S. private sector, 120 million employees organized into 6 million firms, will be presented. This model uses data on the entire population of tax-paying firms in the U.S. and closely reproduces firm sizes, ages, growth rates, job tenure, wage distributions, and so on. In these models, aggregate phenomena emerge from the interactions of the agents without any pre-specification of what might happen. That is, social phenomena grow from the bottom up.

Thursday, October 13, 2016

Hacking the Academy: Open in Action

Come celebrate Open Access Week by learning how UW faculty and staff are working to keep their work open. The Hacking the Academy: Open in Action program will begin with four short talks, followed by time for discussion around the theme "Open in Action." Speakers include Rachel Arteaga (public scholarship), Steven Roberts (open science/open data), Dan Berger (public scholarship), and Justin Marlowe (open textbooks). Please join us October 26, 4-5pm in the Research Commons Green A! 

Wednesday, October 12, 2016

Big Data to Knowledge Webinars and Discussion Groups

The UW Health Sciences Library and Research Data Services are collaborating with the National Network of Libraries of Medicine/Pacific Northwest Region, to provide a monthly discussion group focused on issues around Data Science, with a focus on biomedical science. The discussion group will provide a venue for those interested in the National Institutes of Health’s Big Data to Knowledge “Guide to the Fundamentals of Data Science,” a series of online lectures given by experts from across the country covering a range of diverse topics in data science.
The online lecture series is an introductory overview that assumes no prior knowledge or understanding of data science, and will run all year, once per week, from 9-10am Pacific Time. The list of speakers through the beginning of 2017 is available online. Upcoming topics include Ontologies, Metadata, Provenance, Databases, Social Networking Data, Exploratory Data Analysis, and lots more.
Academic librarians and others interested in biomedical big data from around the Puget Sound are invited to join a monthly Friday discussion group on October 14, November 18 and December 16. The group will meet at 8:45am to watch the week’s BD2K online lecture, and then from 10-11am share insights or questions about that week’s topic, and previous lectures in the series. All discussion groups will be held in The Health Sciences Pacific Room.
Questions? Email Emily Patridge at ep001 (at) uw.edu.

Tuesday, October 11, 2016

UW Hosting Trial of Data-Planet Statistical Datasets


UW librarians, faculty, and researchers are invited to learn more about the power of Data-Planet Statistical Datasets, the largest repository of standardized and structured data. We have trial access to this database along with the others listed at http://guides.lib.uw.edu/research/db-trial. 

Data-Planet founder Richard Landry will highlight subjects and sources covered, along with functionality, features, and visualization tools. You’ll leave with tips for searching, manipulating, and exporting data from over 70+ government and private sources, covering 35 billion data points in 4.9 billion datasets. 

Please register for the UW Libraries Data-Planet Statistical Datasets Webinar on Thursday, Oct 13, 2016 12:00 PM PDT at:  https://attendee.gotowebinar.com/register/1191767628182916610 
Participate remotely or join a group viewing of the webinar: Suzzallo Library, RAD. Thunderbird Conference Rm. (if you don’t work in the UW Libraries, contact cass@uw.edu for info about this location). 

Please visit  online for additional information: 

Please contact Cass Hartnett at cass@uw.edu or Marcy Rothman at mrothman@data-planet.com with any questions or special requests! 

After registering, you will receive a confirmation email containing information about joining the webinar. We look forward to hearing your feedback!

Thursday, September 15, 2016

Software Carpentry Workshop: Oct 10-13 @UWescience



Software Carpentry is a non-profit volunteer organization whose members teach researchers computing skills. 

On October 10th-13th, we will hold a four-day (mornings-only) Software Carpentry workshop at the UW eScience Data Science Studio. The workshop is focused on software tools to make researchers more effective, allowing them to automate research tasks, automatically track their research over time, and use programming to accelerate their research, and make it more reproducible.

In the workshop, we will have two parallel tracks: one in which we will focus on the programming language R, and the other in which we will focus on Python. 

For details, and to register for the upcoming workshop, please refer to the following web-page: https://uwescience.github.io/2016-10-10-uw/

Wednesday, August 3, 2016

Upcoming Data Management Planning Workshop

Do you create or use data in your research? Looking for tips and tools to better help you manage your research data, and preserve it for long-term use?

On August 22, the UW Libraries is offering Data Management Planning, an asynchronous online workshop for UW community members engaged in research with data. Topics will include getting started with data management planning, funder requirements for data sharing, metadata, tips to help keep you organized, sharing, archiving and preservation, and an introduction to tools and on-campus support to aid researchers.

Full course information and link to registration is below. Contact us with any questions.

Data Management Planning Workshop
A free, tutor-supported online workshop
August 22 - 25, 2016
Duration: Monday, August 22, 2016 - Thursday, August 25, 2016 (4 days)
Time Commitment: Approximately 30 minutes to 1 hour per day, for 4 straight days
Target audience: UW community members engaged in research with data.
Prerequisites: Access to the internet for each of the 4 days identified. A valid UW NetID is also required.

Description:
  • This module-based workshop consists of activities and peer discussion forums that will provide tips on how to effectively plan for data management over the lifecycle of your research project.
  • By asking students to share experiences with one another, this workshop gives you the opportunity to reflect on your research workflow and to see how various techniques and tools can be employed to most effectively manage, share and preserve your data.

Participation Process:
  • This workshop will take place in Canvas over 4 days, with no fixed participation times (asynchronous).
  • Each day corresponds to one online module, which includes a topic overview, resources, activity, and peer discussion forum.
  • Discussion forums are the workshop's primary means of 'assessment,' so expect to post to forums daily.
  • You will be guided through the course by a team of friendly librarian tutors, who will answer questions and provide feedback.

How to Join:
  • If interested, please register via this Catalyst link no later than Friday, August 19, 2016.
  • Space in the workshop is limited, and participants will be accepted on a first-come-first-served basis. Students who register after capacity is reached may be placed on a wait list.

If you have any questions, please feel free to contact the Data Services Team.

Tuesday, July 5, 2016

Society of American Archivists to discuss research data management

In the next month, the Society of American Archivists' Records Management Roundtable has planned a series of blog posts to foster discussion on research data management. The roundtable's blog The Schedule will "feature posts describing collaborative efforts to address research data management, resources and outreach initiatives, incorporating research records into a retention schedule, and the question of faculty research as a public record."

Comments are encouraged, so make sure to follow the blog, watch the discussion, and participate!

Tuesday, April 5, 2016

Data Science Studio Office Hours for Spring Quarter

As a reminder, the WRF Data Science Studio offers several types of drop-in office hours to meet the needs of those working in data-intensive science. The program brings together expertise from the eScience Data Scientists, UW libraries, UW-IT, and the Center for Statistics and the Social Sciences (CSSS) to help triage challenges in data-intensive science – including cloud computing – and steer people towards appropriate solutions. Assistance may be in the form of immediate help, a longer meeting with our team to understand the problem more deeply, or a referral  you to faculty on campus with relevant expertise.

Tuesday, March 29, 2016

Upcoming classes: Community Data Science Workshop, R + Stata

Several upcoming workshops and classes will be held Spring Quarter at the University of Washington, focusing on students needing R or Stata introductions, as well as another round of the popular Community Data Science Workshops. Details are below.

Classes

The Center for Social Science Computation and Research has posted their Spring Quarter classes, which includes Introduction to Stata, Introduction to R with R Studio, and Introduction to R with Commander. Students will learn basics software organization, where to find help, and how to get started with basic analyses. No previous experience in statistical programming is necessary, but basic understanding of statistics will be helpful. 

Workshops

The Spring 2016 round of the Community Data Science Workshops are for anyone interested in learning how to use programming and data science tools to ask and answer questions about online communities like Wikipedia, free and open source software, Twitter, civic media, etc. The Spring 2016 series consists of one Friday evening and three Saturday sessions in April and May. The workshops are for people with no previous programming experience and, thanks to sponsorship from eScience and the Department of Communication, are free of charge and open to anyone.

Our goal is that, after the three workshops, participants will be able to use data to produce numbers, hypothesis tests, tables, and graphical visualizations to answer questions like:

- Are new contributors to an article in Wikipedia sticking around   longer or contributing more than people who joined last year?

- Who are the most active or influential users of a particular Twitter hashtag?

- Are people who participated in a Wikipedia outreach event staying involved? How do they compare to people that joined the project outside of the event?

Details and dates are online here:

If you are interested in participating, please fill out our registration at the link above before Saturday April 2. Register soon!

If you already know how to program in Python, it would be really awesome if you would volunteer as a mentor! Being a mentor involves working with participants and talking them through the challenges they encounter in programming. No special preparation is required. If you’re interested, there’s a link on the page above, or you can send me an email. If you mentored before, it’s still easier if you fill our form again. Thanks!

Regards,
Mako (On behalf of Jonathan, Tommy, Dharma, Ben, Mika, and all the CDSW
mentors.)