Monday, May 9, 2016

Carl Miller (@carljackmiller), Centre for the Analysis of Social Media, Demos

The rise of social media has been important; that is no great revelation. It has wrought profound social change, buffeted our institutions and altered, for many of us, our way of life. New identities, dialects, cultures, affiliations and movements have all bloomed and spread across the digital world, and spilled out of it into mainstream public life.

Back in 2012, we at Demos could see that social media was changing research too. The transfer of social activity onto digital spaces was ‘datafying’ social life. Huge new datasets were being routinely created that we saw as treasure troves of behavioural evidence: often very large, in real-time, rich, linked and unmediated. It was a massive new opportunity to learn about how people and society worked.
Unlocking these datasets presented an enormous challenge. The sheer scale of social media data also meant that conventional social research methods couldn’t cope. Powerful new analytical techniques - modelling, entity extraction, machine learning, algorithmic clustering - were needed to make sense of what was happening. However, the true challenge wasn’t a technological one alone. It was how to deploy the new tools of data science in the service of social science. Getting better at counting people is not the same as getting better at understanding them.

We established the Centre for the Analysis of Social Media that brought together social and policy researchers at Demos, and technologists from the University of Sussex with the explicit aim of confronting this challenge. The first layer of the challenge has been the technology itself. The tools of big data analysis needed to be put into the hands of non-technical researchers: the subject matter experts who have long understood social science, and now needed to be able to do it in a new way. We built a technology platform, Method52, which allowed non-technical users to use a graphical user interface, and drag-and-drop components to flexibly conduct big data analytics, rather than be faced with a screen full of code. Especially important was to make accessible a vitally important technique called natural language processing. Coupled with machine learning, it is one of the crucial ways of understanding bodies of primarily text-based data (like Tweets or Facebook posts) that are too large to manually read.

However, any technology - even one that learns - is just a tool and the second layer has been to learn how to slot all the technology into a broader social scientific methodology. We’ve just concluded a major study with the pollsters Ipsos MORI, on how to use tools like natural language processing within a broader framework that stands up to social scientific scrutiny. Much of this has been to develop a process of big data analysis that cares about the same things that social science cares about: the introduction of possible biases in how the data is sampled and collected; the non-representative skews in who uses social media; the danger of analyst pre-conceptions and bias in how the data is measured and handled; the difficulty of measuring at great scale the textured complex utterances of people in specific social contexts and the importance of interpreting the results in the light of the norms, cultures, languages and practices of social media itself.


But even beyond this, the third layer has been get social science to govern the whole endeavour: the questions that are asked, the implications that are drawn, how the research is used, and, of course, the ethical frameworks that control its use.


The big data revolution will not slow down, it will only gather pace. The scales of data will only increase, and the technologies and techniques to harness data are becoming more capable and powerful at a bewildering rate. To my mind, this means that social science - qualitative as well as quantitative - has never been more important. It has never been more crucial to point out the inherent difficulties in studying people in all their messy and chaotic complexity, all the pitfalls of reducing human behaviour into something that can be counted and aggregated, and of how understanding society doesn’t stop with a series of raw metrics, however large they are.


This article was originally published in the National Centre for Research Methods’ Newsletter 2016:2 - http://www.ncrm.ac.uk/news/methodsnews.php

Notes

1 More information on its work is available at: http://www.demos.co.uk/research-area/centre-for-analysis-of-social-media/
2 For more information on Method52, see Jamie Bartlett, Carl Miller, Jeremy Reffin, David Weir, Simon Wibberly, ‘Vox Digitas’ (Demos: 2014): http://www.demos.co.uk/files/ Vox_Digitas_-_web.pdf?1408832211
3 For a further description of natural language processing, see Jeremy Reffin, ‘Why Natural Language Processing is the most important technology you’ve never heard of’, Demos Quarterly 8, Sprint 2016, http://quarterly. demos.co.uk/article/issue-8/natural-language-processing-the-most-important-technology-youve-never-heard-of/
4 See ‘the wisdom of the crowd’, Ipsos MORI, https://www.ipsos-mori.com/ ourexpertise/digitalresearch/sociallistening/wisdomofthecrowd.aspx
5 For more information on this work, see http://www.demos.co.uk/files/Road_to_ representivity_final.pdf?1441811336

Further Reading
On the current work of the Centre for the Analysis of Social Media at Demos, http://www.demos.co.uk/research-area/centre-for-analysis-of-social-media/
A technology edition of Demos Quarterly, Issue 8, Spring 2016, http://quarterly.demos.co.uk

0 komentar:

Post a Comment

LightBlog

BTemplates.com

Categories

#BigData (1) #bookofblogs (6) #einterview (5) #nsmnss (21) #SoMeEthics (2) AHRC (1) Amy Aisha Brown (2) analysis (2) analytics (1) API (1) auxiliary data source (1) Big Data (8) big data analytics (1) blog (14) blogging (7) blogs (8) Book of blogs (3) book review (8) case studies (1) Christian Fuchs (1) coders (1) cognition (1) community (2) community of practice (1) computer mediated (1) conference (3) content analysis (1) crowdsourcing (3) data (1) data access (1) Data Base Management System (1) data linkage (1) data protection (1) definitions (4) demographics (1) Dhiraj Murthy (1) digital (3) digital convergence (1) Digital debate (7) digital humanities (1) dissemination (1) Dr Chareen Snelson (2) Dr Sarah-Louise Quinnell (1) Dr Steve Jones (1) e interviews (2) e-privacy (1) ECR (1) einterview (2) empathy (1) Eran Fisher. (1) ESRC (2) ethics (13) event (3) facebook (3) fanfiction (1) funding (2) Geert Lovink (1) graduate (3) guidelines (5) hootsuite (1) HR (1) identity (3) impact (1) imputation (1) international research (2) janet salmons (7) Japanese (1) Jenna Condie (1) jobs (1) Katheleen McNiff (2) Language (1) learning (1) linguistic anthropology (1) Make Money (2) Mark Carrigan (1) market research (2) media (2) methods (1) mixed methods (1) natcen (1) NCapture (1) netnography (2) network (3) Networked Researcher (1) networked spaces (2) new media (2) NVivo (2) Online (2) online communities (1) online footprint (2) online interview research (2) online personas (2) online research (2) organisational management (1) ownership (1) Paolo Gerbaudo (1) phd (2) PhDBlogger (2) politics (1) power (1) privacy (4) QSR International (1) Qualitative (4) qualitative research methods (6) Quantitative (4) Recruitment (1) research (8) research methods (8) researcher (2) RSS (1) RTI International (3) rumours (1) SAGE (1) Sampling (3) semantic analysis (1) semantics (1) sentiment (1) sentiment accuracy (1) Sherry Turkle (1) small data (1) small datasets (1) social media (36) Social Media MA (10) Social Media Managment System (1) social media monitoring tools (2) social media research (12) social science (4) Social Science Space (2) social scientists (6) social tensionn (1) sociolinguistics (1) sociology (3) software (2) statistics (1) Stories (1) storify (1) surveillance (2) survey (4) teaching (2) technologies (4) tools (2) trust (1) tweet chat (11) Twitter (20) University of Westminster (13) user views (1) video interview (7) vlogging (9) web team (4) webinar (2) weighting (1) YouTube (10)
Responsive Ads Here

Recent

Recent Posts

Navigation List

Popular Posts

Blog Archive