Loading…
Attending this event?
Friday October 18, 2024 12:00pm - 1:00pm CEST
Without looking it up, would you know if the "Clean Resource Innovation Network” is an oil and gas lobbying organisation? Lobbyists often hide under unintuitive or misleading affiliations that obscure their origins. The work of uncovering their identities can be time-consuming and challenging.  

This workshop will outline an approach to using web scraping and Large Language Models (LLMs), like those powering ChatGPT, to systematically identify organisations that are affiliated with the fossil fuel industry. These techniques could also be adapted to other climate projects, such as identifying climate misinformation.

The workshop will:

- Describe some of the challenges of web scraping at scale and the technical tools that can be used to address those challenges
- Describe how to design and test a successful LLM prompt
- How to identify the right LLM for the project you want to do
- How to test your approach and validate the results

ps. You should have a Google account to open the link that'll be shared during the session. 
Moderators
avatar for Léopold Salzenstein

Léopold Salzenstein

Data coordinator, Arena for Journalism in Europe
Leopold Salzenstein is a freelance investigative data journalist and trainer based in the south of France. At Arena, he coordinates the handling of data for publications and trainings. He is also a member of the collective of journalists Environmental Investigative Forum (EIF).
Speakers
avatar for Nicu Calcea

Nicu Calcea

Senior Data Investigator, Global Witness
I’m a journalist with 14 years of experience in media, specialised in data reporting. Currently based in London.
Friday October 18, 2024 12:00pm - 1:00pm CEST
Auditorium

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link