OpenAI Embeddings and Clustering for Survey Analysis — A How-To Guide | by Alejandra Vlerick | Oct, 2024


How to gain insights from survey data and extract topics using embeddings and Large Language Models

Photo by Olav Ahrens Røtne on Unsplash

It has been exactly 4 months since I changed jobs and have had the time to settle in and continue my little side projects.

The latest one was a survey analyser tool, requested by one of the product owners at my company. He had to go through thousands of company wide survey responses on a quarterly basis, trying to extract actionable improvements for the business

Now that the tool is being used and (hopefully) saving many hours of work to product owners and analysts, I have crafted this how-to guide to help you create a similar tool.

We use as input a dataframe with the survey responses (using pandas). The key columns are the comments left by each user. Other fields like department, job title and date submitted could also be analysed but for the minimum viable product, I decided to keep the variables few and simple.

The data for this study was collected through an online survey, and it will remain confidential. For the purpose of this article, the actual data is not shown; the images depict the analysis of a smaller…

Read Also:  Is Data Preprocessing As Important As They Say?

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top