Synthetic Dataset Workshop

Join us for an exclusive hands-on workshop on the CanPath Synthetic Dataset, designed for faculty, researchers, and trainees looking to explore Canada’s largest population health dataset. This workshop will introduce participants to Lifebit’s cloud-based platform and provide practical training on analyzing synthetic health data.
What is the CanPath Synthetic Dataset?
The CanPath Synthetic Dataset is a versatile resource designed for research, education, and practical applications. It was manipulated to mimic CanPath’s nationally harmonized data but does not include or reveal actual data of any CanPath participants.
Interested in exploring the data? Check out the CanPath Data Dictionary.
What is the Trusted Research Environment?
The Trusted Research Environment (TRE) is a secure, cloud-based platform where researchers can access and analyze CanPath data without downloading it to their local machines. The TRE ensures data privacy and security, providing a controlled environment for conducting research while offering powerful tools for analysis. Participants in this workshop will learn how to navigate and use the TRE efficiently. Lifebit, CanPath and AWS have collaborated to bring this platform to life.
Who should attend?
This workshop is open to:
- Faculty/staff using the dataset for education (e.g., incorporating it into a course assignment).
- Faculty/staff using the dataset for training purposes (e.g., teaching trainees how to analyze population health data or preparing to apply for real CanPath data access).
- Trainees (e.g., postdoctoral fellows, graduate students, undergraduate students) interested in learning how to work with CanPath data.
Learning objectives
By the end of the workshop, participants will:
- Navigate and analyze the CanPath Synthetic Dataset using the Lifebit cloud-based platform.
- Work with pre-created analytical pipelines in R and Python (basic familiarity required).
- Understand how to bring in additional data and use platform tools such as the Cohort Browser, Airlock, and Data Factory.
- Gain insights from real-world case studies and demonstrations from CanPath experts.
- Connect with fellow researchers, instructors, and clinician scientists to foster future collaborations.
Agenda
The final agenda will be sent to attendees prior to the event.
Time | Session |
---|---|
9:00 AM | Introduction to CanPath, Lifebit, staff and attendees |
9:30 AM | Training on the Lifebit platform and guided exercise |
11:00 AM | Coffee Break |
11:15 AM | Demonstration and case studies with CanPath data |
12:00 PM | Lunch Break |
1:00 PM | Hands-on analyses in the platform |
2:30 PM | Discussion: key learnings and future data strategy |
3:30 PM | Snacks and networking reception |
Additional support
Need extra help? Attendees can attend Office Hours to get one-on-one support from Lifebit and CanPath experts. Participants can book sessions for personalized assistance. The location, date, and time will be announced soon.
How to apply
- Participants must fill out an application to be considered.
- Applications are accepted on a rolling basis.
- Initial deadline: March 28
- Space is limited! Apply early for the best chance of securing a spot.
Frequently asked questions
Do I need experience with cloud-based environments?
No, this workshop is a great opportunity to learn how to use a cloud-based research platform.
Do I need programming experience?
Some familiarity with R and Python is helpful. While we provide pre-created analytical pipelines, you’ll need to understand how to move code around.
Will the workshop be offered in French?
CanPath is a bilingual organization, but this workshop will be delivered in English. Future training sessions may be available in French if there is interest.
Will I receive a certificate of completion?
Yes!