Kwang Moo Yi

Faculty of Science

Research Classification

Computer vision in artificial intelligence

Pattern recognition and artificial vision

Research Interests

3D Computer Vision

Computer Vision

Machine Learning

Astronomy Applications of Computer VIsion

Relevant Thesis-Based Degree Programs

View all programs

Affiliations to Research Centres, Institutes & Clusters

CAIDA: UBC ICICS Centre for Artificial Intelligence Decision-making and Action

Research Options

I am available and interested in collaborations (e.g. clusters, grants).

I am interested in and conduct interdisciplinary research.

I am interested in working with undergraduate students on research projects.

Biography

Recruitment

Looking to recruit:

Master's students, Doctoral students, Postdoctoral Fellows

Desired start dates:

2025

Potential research project areas:

Application of Machine Learning methods and Generative Models to 3D Computer Vision

Ideal applicant profile:

Typically, successful applicants with MSc degrees have prior exposure to 3D Computer Vision and/or Deep Learning, evident from publications at Computer Vision / Graphics conferences (CVPR,ECCV,ICCV,NeurIPS,SIGGRAPH,WACV,BMVC,ICIP). For students directly applying to graduate school with BSc degrees, having a publication record is a plus, and prior exposure to research environments or evidence of research projects is suggested.

Note: For graduate student positions, it is essential that you meet the department deadline, which is December 15th. You will only then be considered as a potential candidate. Also, contacting me in advance will not likely make any difference, as long as you list me as a potential supervisor. Please see the department website before anything if you intend to apply for graduate school.

Other options:

I support public scholarship, e.g. through the Public Scholars Initiative, and am available to supervise students and Postdocs interested in collaborating with external partners as part of their research., I support experiential learning experiences, such as internships and work placements, for my graduate students and Postdocs., I am open to hosting Visiting International Research Students (non-degree, up to 12 months).

Complete these steps before you reach out to a faculty member!

Focus your search

Make a good impression

ADVICE AND INSIGHTS FROM UBC FACULTY ON REACHING OUT TO SUPERVISORS

These videos contain some general advice from faculty across UBC on finding and reaching out to a potential thesis supervisor.

Graduate Student Supervision

Doctoral Student Supervision

Dissertations completed in 2010 or later are listed below. Please note that there is a 6-12 month delay to add the latest dissertations.

Monte-Carlo neural rendering (2026)

Repurposing large pretrained diffusion models for unsupervised visual understanding and efficient adaptation (2025)

Large pretrained text-conditioned image generation models learn a compositional and structured latent representation of visual concepts, showcasing their rich understanding of the world through their ability to generate diverse, coherent images. These models link text descriptions to visual concepts, unifying concepts across a range of conditions such as understanding the relationships between the text input and objects in a scene. This thesis explores how this link between text and visual concepts enables identifying consistent semantic regularities across images, where similar regions are mapped through the same text embedding. We show that this can be leveraged for tasks like semantic correspondence and estimating consistent keypoints, simply by optimizing the text embedding to activate highly in a specific region in the image for a given token. We also take advantage of the capacity of the model for one-shot personalization given only a single image. We leverage this by training hypernetworks to quickly estimate network weights for subject personalized generation, whose convergence is only possible due to the smooth underlying representation of concepts learned by these models. This PhD thesis leverages large pretrained diffusion models to address three key areas: semantic correspondence, unsupervised keypoint detection, and eﬀicient hypernetwork-based adaptation for personalized model fine tuning. For semantic correspondence, we optimize text tokens to focus attention on specific regions in an image, leveraging the latent knowledge of large pretrained models to identify correspondences from a single image without additional supervision. For unsupervised keypoint detection, we localize text tokens across a collection of images to identify common keypoints, using a collection of images to focus the model on a specific concept, leveraging the knowledge within the pretrained model to generalize without ground truth keypoints. We also investigate hypernetwork-based methods for generating weights for large model personalization conditioned on a single image, providing an eﬀicient alternative to compute intense optimization without requiring ground truth weights. This work highlights the versatility of diffusion models, extending their utility beyond image generation while proposing scalable, eﬀicient solutions for downstream tasks of semantic correspondence, unsupervised keypoint estimation, and hypernetwork-based personalized model fine tuning.

View record

Exploring explicit models for geometric point cloud learning (2024)

We are interested in processing point clouds -- a set of unordered points -- specifically in Euclidean space, such as 3D point cloud acquired from a range sensor (LiDAR) or 4D correspondence cloud in stereo matching task. Point clouds play an increasingly essential role in many tasks due to prevalence they hold. However, it is notoriously challenging to process point clouds with deep neural networks because of their irregular data structure, the difficulty in encoding contextual information from nearby points, and the large compute requirement that is typically required. This thesis addresses these challenges by enforcing intermediate features or model parameters to carry specific meanings such as attention and poses, leading to explicit representation. The meanings of explicit representation allow for traditional ways of manipulating features in order to solve target tasks. We refer to these architectures with explicit representations as explicit models. Explicit models largely improve performances without massively scaling up training data or model size because the explicit representation directly injects the prior knowledge needed by target tasks into neural networks without any learning. We explore explicit models for point cloud learning to perform robust estimation, stereo matching, segmentation, reconstruction and neural rendering. The thesis is organized into four chapters: 1, ACNe: An optimization-inspired network architecture that allows learning with point clouds contaminated with an abundance of outliers. 2, Canonical Capsules: An equivariant latent representation that consists of pose and pose-invariant features, enabling point cloud auto-encoding in unaligned datasets. 3, NeuralBF: A novel 3D instance proposal generation inspired by traditional bilateral filtering for top-down instance segmentation for 3D point clouds. 4, PointNeRF++: A multi-scale, point-based NeRF architecture, allowing seamless integration of point-based representation with Neural Radiance Fields.Across these four chapters, we show that explicit models largely improve point cloud learning, inspiring more future research in this domain. We conclude with a discussion about future works, practical tips on how to form an explicit model, and its role in the era of large foundation models.

View record

Modularizing deep learning for geometry-aware registration and reconstruction (2023)

Master's Student Supervision

Theses completed in 2010 or later are listed below. Please note that there is a 6-12 month delay to add the latest theses.

Kwang Moo Yi

Research Classification

Research Interests

Relevant Thesis-Based Degree Programs

Affiliations to Research Centres, Institutes & Clusters

Research Options

Biography

Recruitment

Complete these steps before you reach out to a faculty member!

Check requirements

Focus your search

Make a good impression

Attend an information session

ADVICE AND INSIGHTS FROM UBC FACULTY ON REACHING OUT TO SUPERVISORS

Graduate Student Supervision

Doctoral Student Supervision

Monte-Carlo neural rendering (2026)

Repurposing large pretrained diffusion models for unsupervised visual understanding and efficient adaptation (2025)

Exploring explicit models for geometric point cloud learning (2024)

Modularizing deep learning for geometry-aware registration and reconstruction (2023)

Master's Student Supervision

Deblurring neural radiance fields by modeling camera imperfections and using RGB-event stereo (2024)

Generative spectra modelling for galaxy redshift estimation (2024)

Weakly-supervised geometry-aware novel view synthesis (2024)

Neural fourier filter bank (2023)

Bootstrapping human optical flow and pose (2022)

Human pose and stride length estimation (2021)

Membership Status

Program Affiliations

Academic Unit(s)

Planning to do a research degree? Use our expert search to find a potential supervisor!