In a quest to elevate student engagement in virtual classrooms, I undertook a project that leveraged the power of data analytics to provide insights into participation levels during Zoom classes. With the shift to online education, it became paramount to understand and encourage student interaction. My goal was to transform auto-generated Zoom transcripts into a structured dataset that could be analyzed to inform engagement strategies.
Transcript Parsing:
Developed a Python program to parse the ‘captured_dialogue.vtt’ file, extracting structured data from the raw transcript text. Employed regular expressions to accurately identify and separate different components of the transcript, such as serial numbers, timestamps, speaker names, and utterances.
Database Table Creation:
Using the extracted data, I created a MySQL table named ‘vtt’ with columns corresponding to the transcript’s structure: SNo, TimeFrom, TimeTo, RegName, and Utterance. The program ensured data integrity and correct formatting for direct database importation.
Establishing Connection to SQL and Creating Dataframe
Time Duration Calculation:
Executed SQL operations to construct a new table, ‘vttclean,’ which included all columns from ‘vtt’ plus a new column for ‘milliseconds.’ The new column represented the duration of each utterance, providing a quantitative basis for subsequent analysis.
The details steps as below:
vttclean
table with a new column called milliseconds
(with INT)vttclean
with the milliseconds
column calculated (TimeTo
- TimeFrom
in milliseconds)The New SQL Table vttclean
Visualization of Class Participation:
Designed an R program to visualize the total airtime each student contributed during class, represented in milliseconds. The bar chart created succinctly captured the level of participation, allowing for quick comparison and assessment of engagement.
Total Airtime per Person
Through overcoming these challenges, I honed my data manipulation and visualization skills, reinforced the importance of precise data cleaning, and learned to present data-driven insights in a compelling visual format.
The project was a resounding success, providing clear metrics on student engagement that could guide the university in adapting teaching methods to the virtual environment. By transforming raw transcripts into a structured MySQL database and then into R-generated visualizations, I crafted a narrative of class participation that was both informative and accessible. This work stands as a testament to the potential of data analytics in enriching the educational experience and enhancing student interaction in online platforms.
R:
Economic Growth & Population Dynamics: A Visual Story of Southeast Asia
Integrating Python with R for Data Scraping and Analysis: A Study on GDP and InnovationPython/R:
The Singapore HDB Resale Market: 1990-2024SQL/R:
Visual Analysis of University Graduate Survey DataPython/SQL/R:
Insights from Zoom Transcripts: Analyzing Student Engagement in the Digital ClassroomPython/SQL/Web API/SAP BO/Web GUI/Laravel/PHP:
“Sample Management System” - Infineon Technologies