We had lots to talk about this afternoon! Catchup with new ATAP staff Debugging openNLP package installation in R An attendee from last week’s Network Analysis and Topic Modelling workshop showed us how she is applying topic modelling to her own research data! We discussed using topic models in conjunction with manual coding and…
Federal Election 2019 Dataset are now publicly available
During the 2019 federal election campaign, the QUT Digital Observatory along with Professor Axel Bruns, Professor Daniel Angus, and Adam Smith collected the tweets from Twitter accounts officially associated with candidates in the election. Now, with the next federal election coming, we have published this dataset for those who are…
Office Hours Recap 2022-03-29
At today’s Office Hours, we discussed topics including: We’re looking forward to the upcoming UQ Digital Cultures and Societies lightning talks Looked at some of the LADAL notebooks and a pull request contributed by a community member Mentioned DO’s new newsletter Discussed potential workflows for comparative topic modelling (relevant…
Announcing the Digital Observatory Newsletter
We are starting up a newsletter! Once every month or two, we’ll be sending out an issue with upcoming events, and some highlights of projects, datasets, and software we’ve been working on. This is a QUT Digital Observatory newsletter, but we will have events and news from the Australian Digital Observatory as well. Much of the content is…
Investigating likes and retweets on Twitter with Twarc
Changes to Twitter’s API now allow researchers to look at the full history of likes on tweets, and likes by specific users. We’ve added support for this to the twarc toolkit to make this newly expanded functionality accessible to more people. As a quick example, you can retrieve the profiles of all the Twitter users that have liked Scott…
Office Hours Recap 2022-02-01
Thanks everyone who came along today for office hours. Topics of discussion included: When speech recognition might be helpful in transcribing interviews (and cases where transcription of any kind isn’t going to happen). Approaches to speeding up data collection from the web, using parallelisation or other means. Why you probably don’t…
Office Hours Recap 2021-11-23
Thanks to everyone who attended last Tuesday! Next weeks office hours on December 7th will be our final session for 2021. Topics of discussion from November 23 included: Writing SQL and working with large datasets in databases. Discussion of datasette for hosting smaller datasets quickly. Barriers to making data usable. ATAP’s plans for…
Office Hours Recap 3
Thanks to everyone who attended today! Topics of discussion included: The complexities of parsing language data from tweets, including platform affordances like #hashtags and @mentions, and also emoji 😺 Pitfalls of importing data into Excel How to make meaningful entity level identifiers for projects where data should be readily…
Even Easier Twitter API Searching with Twarc
Have you ever found yourself using the Twitter API for your research, and needed to run many different searches to answer your question? Maybe you have a set of politics related accounts and you want to see who’s interacting with them via @mentions, or you have a set of #hashtags used for different aspects of a discussion and want to…
Summary of Office Hours I
Thank you to everyone who came along to our first joint office hours today. Topics of discussion included: How we’re going to organise sessions and how they’ll normally run. Supporting NVivo – how to make the most of the data model, and limitations for larger collections of short text. When do you know you need to start building a…