Ray Wang Lei

Logo

Return to My Main Page

View the Project on GitHub airfire6518/portfolio

Project 5: “Insights from Zoom Transcripts: Analyzing Student Engagement in the Digital Classroom”

Introduction:

In a quest to elevate student engagement in virtual classrooms, I undertook a project that leveraged the power of data analytics to provide insights into participation levels during Zoom classes. With the shift to online education, it became paramount to understand and encourage student interaction. My goal was to transform auto-generated Zoom transcripts into a structured dataset that could be analyzed to inform engagement strategies.

Data Processing and Database Integration:

Transcript Parsing:

Developed a Python program to parse the ‘captured_dialogue.vtt’ file, extracting structured data from the raw transcript text. Employed regular expressions to accurately identify and separate different components of the transcript, such as serial numbers, timestamps, speaker names, and utterances.

Database Table Creation:

Using the extracted data, I created a MySQL table named ‘vtt’ with columns corresponding to the transcript’s structure: SNo, TimeFrom, TimeTo, RegName, and Utterance. The program ensured data integrity and correct formatting for direct database importation.

Example Image

Establishing Connection to SQL and Creating Dataframe

Data Enhancement and Analysis:

Time Duration Calculation:

Executed SQL operations to construct a new table, ‘vttclean,’ which included all columns from ‘vtt’ plus a new column for ‘milliseconds.’ The new column represented the duration of each utterance, providing a quantitative basis for subsequent analysis.

The details steps as below:

Example Image

The New SQL Table vttclean

Visualization of Class Participation:

Designed an R program to visualize the total airtime each student contributed during class, represented in milliseconds. The bar chart created succinctly captured the level of participation, allowing for quick comparison and assessment of engagement.

Example Image

Total Airtime per Person

Skills and Tools Used:

Challenges Overcome:

Through overcoming these challenges, I honed my data manipulation and visualization skills, reinforced the importance of precise data cleaning, and learned to present data-driven insights in a compelling visual format.

Conclusion:

The project was a resounding success, providing clear metrics on student engagement that could guide the university in adapting teaching methods to the virtual environment. By transforming raw transcripts into a structured MySQL database and then into R-generated visualizations, I crafted a narrative of class participation that was both informative and accessible. This work stands as a testament to the potential of data analytics in enriching the educational experience and enhancing student interaction in online platforms.

R:
Economic Growth & Population Dynamics: A Visual Story of Southeast Asia
Integrating Python with R for Data Scraping and Analysis: A Study on GDP and Innovation

Python/R:
The Singapore HDB Resale Market: 1990-2024

SQL/R:
Visual Analysis of University Graduate Survey Data

Python/SQL/R:
Insights from Zoom Transcripts: Analyzing Student Engagement in the Digital Classroom

Python/SQL/Web API/SAP BO/Web GUI/Laravel/PHP:
“Sample Management System” - Infineon Technologies