Data Collection and Description
We focused on the top 100 french channels. Some of these channels were not relevant to our reseach question (for example they were in arabic or were news channel or radio stations that repost on youtube or lacked text alltogether like some kids channels). All in all we kept X channels (annex ?), which are the channels at the top 100 with human scale (independent youtuber or small studios).
|channel name and id||strings||kaggle|
|transcript||string||YouTube Transcript/Subtitle API (python package)|