Categories
Web

PHP-Based Website Scraper and .ics vCalendar Generator

PHP scraper that creates an .ics calendar from Finnish audiovisual archive showtimes, for easy subscription.

The National Audiovisual Archive of Finland (KAVA) has its own cinema but does not offer standard .ics vCalendar files for movie showtimes. However, their ticket store page is well-structured, making it suitable for web scraping.

I used this as an opportunity to practice scraping and created an ICS calendar file from the showtime data. The calendar is primarily for my personal use but is publicly available for download. Note: if KAVA changes their site layout, the scraper will break, so the calendar may stop updating until fixed.

Showtimes rarely change, so updating the calendar once a week is sufficient. The subscription URL is:

https://www.sanaracreations.fi/apps/kavacal/kavacal.php

Currently, KAVA has updated their website layout, rendering the scraper non-functional. The scraper code is open for anyone to inspect and improve.