BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//pretalx//pretalx.devconf.info//devconf-cz-2026//talk//9XRAV8
BEGIN:VTIMEZONE
TZID:CET
BEGIN:STANDARD
DTSTART:20001029T040000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=10
TZNAME:CET
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
END:STANDARD
BEGIN:DAYLIGHT
DTSTART:20000326T030000
RRULE:FREQ=YEARLY;BYDAY=-1SU;BYMONTH=3
TZNAME:CEST
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
END:DAYLIGHT
END:VTIMEZONE
BEGIN:VEVENT
UID:pretalx-devconf-cz-2026-9XRAV8@pretalx.devconf.info
DTSTART;TZID=CET:20260618T123000
DTEND;TZID=CET:20260618T130500
DESCRIPTION:GitHub provides public API for obtaining detailed information a
 bout various events performed by users across public repositories: git pus
 hes\, pull requests and reviews\, github issues and comments\, github star
 s\, etc.  The information about these events is available at https://gharc
 hive.org in the form of per-hour compressed files with JSON lines represen
 ting all the events. The number of events recorded per year is ~1.5 billio
 ns. The total size of events per year is ~7 terabytes. This sounds like a 
 big data. The talk shows how to explore this data at high speed and minima
 l costs and how to obtain interesting insights from this data.
DTSTAMP:20260430T125108Z
LOCATION:E105 (capacity 70)
SUMMARY:How to Analyze Terabytes of Data from GitHub Archive at High Speed 
 - Aliaksandr Valialkin
URL:https://pretalx.devconf.info/devconf-cz-2026/talk/9XRAV8/
END:VEVENT
END:VCALENDAR
