Niranjan Thanikachalam – Portfolio

I am a Computer Vision engineer with a PhD from EPFL in Computational Imaging. My interests are in 3D reconstruction, segmentation, inverse rendering, inverse problems and numerical optimization. Below is a selection of some of my recent work.


Semantic Segmentation for Surface Anomalies and Defects Detection

2024 – Present . Artmyn

Semantic segmentation for surface anomalies and defects detection. This video shows the anomaly detection across 4 labels. First it shows cracks that have been detected across the entire painting. Next, it shows a region where someone made a repair or retouching of paint, which is is sometimes visible in the UV range. Then it shows some detections on regions where material has been lost from the surface of the painting. Finally it shows detections of regions which have suffered a surface abrasion.

We aim at detecting surface defects, anomalies and events on Artmyn’s multimodal gigapixel digital twins. We worked with external experts to develop a small annotated dataset. For training, we leveraged distribution aware resampling, heavy augmentations, a class frequency weighted loss function, selective masking, and elements of weakly supervised and semi-supervised learning in order to develop a zoo of semantic segmentation models. At gigapixel scales getting consistent predictions without compromising precision or recall is not trivial, so during inference in addition to severe test-time augmentations, we also ensemble the models via majority voting.


Differentiable Rendering for Spatially-Varying Reflectance Estimation

2022 – 2024 . Artmyn

A digital twin rendering of a 14th century artwork that used gold in combination with paint on wood. The differentiable renderer effectively captures both the metallic shine of gold and the gloss of paint on wood.

An aspect of real world objects that is often ignored in 3D-reconstructions is reflectance capture. The visual richness of material textures in real world arises from the way light interacts with the surface roughness of the object, resulting in cues we perceive such as matte, glossy, shiny. To effectively estimate this reflectance, I built a differentiable renderer for inverse rendering, partially inspired by the idea behind NeRF. The resulting reflectance model brought drastic realism improvements with an increase of up to 15dB of PSNR in objects with regions of specular highlights.


Gigapixel-Scale Multi-Modal Capture and Processing Pipeline

2021 – 2023 . Artmyn

In this video we see renderings of digital twins of two artworks captured with Artmyn’s gigapixel multi-modal capture pipeline.

Artmyn’s multimodal imaging pipeline operates at 2000ppi, producing gigapixel assets. This comes with several challenges. Consistent image registration is a big challenge, particularly across regions with repeating patterns, which can occur surprisingly often in paintings. To overcome this, I built CRAFT, an inhouse adaptation of the RAFT optical flow CNN, where we added a real geometry encoder based on estimated depth and transfer learnt on synthetic data, resulting in a reduction of failure rates from 20% to under 1%. Accurate depth estimation at these scales is also non-trivial. I cast the problem as a large scale inverse problem with priors from both photometric cues and stereo displacement to improve depth estimation drastically. I also contributed heavily to proper camera sensor and color calibration models resulting in extremely high fidelity digital twins.


3D Reconstruction in the Wild

2024 . CS231N

3D reconstruction result
Comparision of point clouds reconstructed from unstructured photographs using LoFTR and two variations of DFSFM.

This was a research project for the course CS231N - Deep Learning for Computer Vision at Stanford Online. In this project the problem of Phototourism - i.e recreating a 3D model of the real world from unstructured set of photographs is considered from a deep learning perspective. The work explores the use of deep learning components in the classical structure from motion pipeline. It also explores the replacement of the optimization component bundle adjustment using the recently proposed DBARF, a generalized NERF inspired neural rendering method that simultaneously optimizes camera pose and image rendering. It is seen that while deep-learning components are in general successful in improving the feature description and matching stage, even neural rendering methods that optimize instead of learn, fail to achieve the accuracy of bundle adjustment.


Exploring the capability of Tiny Language Models for story telling for resource constrained languages

2025 . CS224N

Evaluation of TLMs
Evolution of GPTEval scores across validation and test datasets.

This was a research project for the course CS224N - Natural Language Processing at Stanford Online. In this study, we are interested in Tamil language models that can tell stories with the same complexity as told to toddlers and young children. To build such a model we created a machine translated version of the TinyStories dataset with 1M stories in the train split. We then explore GPTNeo and Llama models of differing sizes, all less than 150M parameters to learn story telling. We take a three stage approach, where the model is first pretrained on internet quality Tamil data. Next the machine translated dataset is used for continual training. Followed by this we run a final fine tuning run with a very small expert curated dataset of 2000 stories in the train split. We also attempt LoRA fine tuning of an English language GPTNeo model. We see that while the models are able to tell stories, they are not of high quality, mainly arising from the low-quality of machine translations. Some of the resulting models are small enough at less than 100MB and can easily run on your browser. Head over to the project site to give it a try.


Computational Methods for freeform optics design

2017 - 2020 . Rayform

Freeform optics design © Rayform

At Rayform, I contributed heavily to the algorithm in production for the design of freeform optics. My particular focus was on the computational method behind the lens’ surface design, resulting in a parameter-free algorithm that was robust to more general light configurations. This has since been licensed to LVMH’s Fred brand.


Computational Imaging for Digitization

2012 - 2016 . EPFL

PhD thesis work

During my PhD at EPFL, I worked on a Google funded project called “eFacsimile” that was focused on modern computational methods for high fidelity digitization of cultural heritage artifacts. The work resulted in collaborations and exhibitions at Vitromusée and IP that was later licensed to Artmyn through Invaluable. A selection of resulting publications and media coverage is linked below.


Mountainscapes

Forever ago . Personal

Ghosts of Summer Past - V a.k.a Might
Comet Neowise setting behind a rock near Lacs de Fenêtre, as the milky way rises over part of the Mont Blanc massif in the background. © Niranjan

I used to be obsessed with mountains and the night sky. Most of my photos still reside on a hard drive somewhere, but I have a few on my 500px.