Skip to content

Vilin97/Urait

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

53 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Urait course matching

Finding New Textbooks

  • What disciplines are taught? (50% done)
  • Which of them are not covered by Urait courses? (50% done)
  • How to fill these gaps? (0% done)

Pipeline description

To answer the question "what courses does Urait have for a given university specialization?" we:

  • search for the study plans (учебные планы) for the specialization, via google search
  • parse the disciplines from the study plan page, using Gemini 2.5 Flash
  • search for work programs (рабочие программы дисциплин) for each discipline
  • parse the topics from the work program page, using Gemini 2.5 Flash
  • embed the topics using Gemini embeddings
  • embed the Urait courses using Gemini embeddings
  • match the topics to the courses using cosine similarity
  • for top 5 course matches, use Gemini 2.5 Flash to decide whether the course can be used to teach the discipline

Plans

  • match the study plans to the universities they came from, using the urls
  • get ~10-100 study plans per specialization, to approximate the popularity of the disciplines
  • use the popularity to sort the holes -- disciplines without matching courses

How to run

  1. run pip install -r requirements.txt
  2. download project_subjects.csv from drive.
  3. make a .env file with GOOGLE_API_KEY=your_key and SERPER_API_KEY=your_key.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages