{"id":23,"date":"2022-12-17T21:50:00","date_gmt":"2022-12-18T02:50:00","guid":{"rendered":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/?page_id=23"},"modified":"2023-05-09T04:10:06","modified_gmt":"2023-05-09T09:10:06","slug":"approaches","status":"publish","type":"page","link":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/","title":{"rendered":"Our Approach"},"content":{"rendered":"\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"840\" height=\"904\" src=\"http:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_15.46.35.png\" alt=\"\" class=\"wp-image-113\" srcset=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_15.46.35.png 840w, https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_15.46.35-279x300.png 279w, https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_15.46.35-768x827.png 768w\" sizes=\"auto, (max-width: 706px) 89vw, (max-width: 767px) 82vw, 740px\" \/><figcaption class=\"wp-element-caption\">Overall architecture of our approach<\/figcaption><\/figure>\n\n\n\n<p>Here we describe each component of our pipeline for estimating depth from focal stack. <\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Multiplane Image (MPI)<\/h2>\n\n\n\n<p>We represent the scene with Multiplane Image, which is a set of fronto-parallel planes at different predefined depths of the scene. Each plane contains RGB value and alpha or transparency value at a specific pixel location and depth layer. RGB images could be rendered with alpha compositing, and disparity map can be generated by taking expected value of the disparity map.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"952\" height=\"387\" src=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/mpis.png\" alt=\"\" class=\"wp-image-66\" srcset=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/mpis.png 952w, https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/mpis-300x122.png 300w, https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/mpis-768x312.png 768w\" sizes=\"auto, (max-width: 706px) 89vw, (max-width: 767px) 82vw, 740px\" \/><figcaption class=\"wp-element-caption\">Architecture and rendering process of a Multiplane Image<\/figcaption><\/figure>\n\n\n\n<p>We chose the MPI representation due to its fast-rendering speed, known depth value of each layer and ability to work with patch-based losses such as GAN loss, Perceptual or SSIM losses.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Apply Defocus effect to MPI<\/h2>\n\n\n\n<p>To simulate defocus effect with MPI representation and reconstruct each image of a focal stack, we blur different parts of the scene with different amount of blur based on the difference between focal distance and depth at each layer of MPI. Specifically, we apply disk kernel with different diameter to each layer of the MPI.  <\/p>\n\n\n\n<p>Based on the Thin Lens Model, given camera parameter aperture and focal length, and focal distance which vary among images from the focal stack, diameter of defocus is at a particular scene depth:<\/p>\n\n\n\n<p class=\"has-text-align-center\"> aperture * |focal distance &#8211; depth| * focal length \/ (focal distance-focal length) \/ depth<\/p>\n\n\n\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/function.png\" alt=\"\" class=\"wp-image-67\" width=\"536\" height=\"379\" srcset=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/function.png 543w, https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/function-300x212.png 300w\" sizes=\"auto, (max-width: 536px) 100vw, 536px\" \/><figcaption class=\"wp-element-caption\">Relation between blur radius and scene depth with projective camera<\/figcaption><\/figure>\n\n\n\n<p>The above figure describes the relationship between blur amount and depth under a projective camera model. In our experiment, we are using an orthographic camera to capture the focal stack. With such a camera, the relationship between blur radius and scene depth is linear<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"456\" src=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_16.29.51-1024x456.png\" alt=\"\" class=\"wp-image-116\" srcset=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_16.29.51-1024x456.png 1024w, https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_16.29.51-300x134.png 300w, https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_16.29.51-768x342.png 768w, https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_16.29.51.png 1406w\" sizes=\"auto, (max-width: 706px) 89vw, (max-width: 767px) 82vw, 740px\" \/><\/figure>\n\n\n\n<p>After convolving disk kernel of appropriate radius with each MPI layer, we perform alpha compositing of the layers to get the reconstruction. The reconstruction process is illustrated below:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/forward.png\" alt=\"\" class=\"wp-image-68\" width=\"674\" height=\"385\" srcset=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/forward.png 1017w, https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/forward-300x172.png 300w, https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2022\/12\/forward-768x440.png 768w\" sizes=\"auto, (max-width: 674px) 100vw, 674px\" \/><figcaption class=\"wp-element-caption\">Reconstruction of focal stack <\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">Optimization of MPIs<\/h2>\n\n\n\n<p>We compare the reconstructed image from the focal stack and the ground truth and optimize for the RGB and alpha values stored in each layer of the MPI through gradient descent by applying gradient of the loss with respect to the RGB and alpha values.<\/p>\n\n\n\n<p>The loss functions we use include the following:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>RGB L1 reconstruction loss: enforces reconstructed image to have the same RGB value as the ground truth, L1 is used to increase sharpness<\/li>\n\n\n\n<li>Edge-aware smoothness loss: Defocus cues are inherently limited to surfaces with little or no texture. The edge-aware smoothness loss encourages that depth remains smooth with a textureless surface.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\"><\/h2>\n","protected":false},"excerpt":{"rendered":"<p>Here we describe each component of our pipeline for estimating depth from focal stack. Multiplane Image (MPI) We represent the scene with Multiplane Image, which is a set of fronto-parallel planes at different predefined depths of the scene. Each plane contains RGB value and alpha or transparency value at a specific pixel location and depth &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Our Approach&#8221;<\/span><\/a><\/p>\n","protected":false},"author":151,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-23","page","type-page","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Our Approach - Neural Radiance Imaging<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Our Approach - Neural Radiance Imaging\" \/>\n<meta property=\"og:description\" content=\"Here we describe each component of our pipeline for estimating depth from focal stack. Multiplane Image (MPI) We represent the scene with Multiplane Image, which is a set of fronto-parallel planes at different predefined depths of the scene. Each plane contains RGB value and alpha or transparency value at a specific pixel location and depth &hellip; Continue reading &quot;Our Approach&quot;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/\" \/>\n<meta property=\"og:site_name\" content=\"Neural Radiance Imaging\" \/>\n<meta property=\"article:modified_time\" content=\"2023-05-09T09:10:06+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_15.46.35.png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/approaches\\\/\",\"url\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/approaches\\\/\",\"name\":\"Our Approach - Neural Radiance Imaging\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/approaches\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/approaches\\\/#primaryimage\"},\"thumbnailUrl\":\"http:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/wp-content\\\/uploads\\\/sites\\\/75\\\/2023\\\/05\\\/iShot_2023-05-09_15.46.35.png\",\"datePublished\":\"2022-12-18T02:50:00+00:00\",\"dateModified\":\"2023-05-09T09:10:06+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/approaches\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/approaches\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/approaches\\\/#primaryimage\",\"url\":\"http:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/wp-content\\\/uploads\\\/sites\\\/75\\\/2023\\\/05\\\/iShot_2023-05-09_15.46.35.png\",\"contentUrl\":\"http:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/wp-content\\\/uploads\\\/sites\\\/75\\\/2023\\\/05\\\/iShot_2023-05-09_15.46.35.png\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/approaches\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Our Approach\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/#website\",\"url\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/\",\"name\":\"Neural Radiance Imaging\",\"description\":\"Monocular Depth Estimation with Focal Stack\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/2023team4\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Our Approach - Neural Radiance Imaging","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/","og_locale":"en_US","og_type":"article","og_title":"Our Approach - Neural Radiance Imaging","og_description":"Here we describe each component of our pipeline for estimating depth from focal stack. Multiplane Image (MPI) We represent the scene with Multiplane Image, which is a set of fronto-parallel planes at different predefined depths of the scene. Each plane contains RGB value and alpha or transparency value at a specific pixel location and depth &hellip; Continue reading \"Our Approach\"","og_url":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/","og_site_name":"Neural Radiance Imaging","article_modified_time":"2023-05-09T09:10:06+00:00","og_image":[{"url":"http:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_15.46.35.png","type":"","width":"","height":""}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/","url":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/","name":"Our Approach - Neural Radiance Imaging","isPartOf":{"@id":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/#website"},"primaryImageOfPage":{"@id":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/#primaryimage"},"image":{"@id":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/#primaryimage"},"thumbnailUrl":"http:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_15.46.35.png","datePublished":"2022-12-18T02:50:00+00:00","dateModified":"2023-05-09T09:10:06+00:00","breadcrumb":{"@id":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/#primaryimage","url":"http:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_15.46.35.png","contentUrl":"http:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-content\/uploads\/sites\/75\/2023\/05\/iShot_2023-05-09_15.46.35.png"},{"@type":"BreadcrumbList","@id":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/approaches\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/"},{"@type":"ListItem","position":2,"name":"Our Approach"}]},{"@type":"WebSite","@id":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/#website","url":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/","name":"Neural Radiance Imaging","description":"Monocular Depth Estimation with Focal Stack","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-json\/wp\/v2\/pages\/23","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-json\/wp\/v2\/users\/151"}],"replies":[{"embeddable":true,"href":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-json\/wp\/v2\/comments?post=23"}],"version-history":[{"count":19,"href":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-json\/wp\/v2\/pages\/23\/revisions"}],"predecessor-version":[{"id":130,"href":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-json\/wp\/v2\/pages\/23\/revisions\/130"}],"wp:attachment":[{"href":"https:\/\/mscvprojects.ri.cmu.edu\/2023team4\/wp-json\/wp\/v2\/media?parent=23"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}