{"id":2,"date":"2023-05-01T17:47:50","date_gmt":"2023-05-01T17:47:50","guid":{"rendered":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/?page_id=2"},"modified":"2023-05-10T02:07:09","modified_gmt":"2023-05-10T02:07:09","slug":"sample-page","status":"publish","type":"page","link":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/","title":{"rendered":"OUR CONTRIBUTIONS"},"content":{"rendered":"\n<p><strong>Visual Odometry <\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-1024x521.png\" alt=\"\" class=\"wp-image-29\" width=\"797\" height=\"405\" srcset=\"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-1024x521.png 1024w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-300x153.png 300w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-768x391.png 768w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-1536x781.png 1536w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-2048x1042.png 2048w\" sizes=\"auto, (max-width: 706px) 89vw, (max-width: 767px) 82vw, 740px\" \/><\/figure>\n\n\n\n<p>We use <a href=\"https:\/\/arxiv.org\/pdf\/1409.4842.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><mark class=\"has-inline-color has-vivid-cyan-blue-color\">InceptionNet<\/mark><\/a> backbone with appropriate changes in the final fully-connected layers such that we can regress over the 7D pose i.e. position (x, y, z) and the quaternion rotation. <\/p>\n\n\n\n<p>We train this model on a dataset collected in the basement of Smith Hall using Aria Glasses. For establishing our baseline, we have also tested our model on the King&#8217;s College Dataset in order to compare it against <a href=\"https:\/\/arxiv.org\/pdf\/1505.07427v4.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><mark class=\"has-inline-color has-vivid-cyan-blue-color\">PoseNet<\/mark><\/a> implementation. <\/p>\n\n\n\n<p>Using the RGB camera ensures that the predicted pose is quite accurate with the only downside of a higher power consumption due to the continuous camera frame input. <\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><\/p>\n<cite><strong>Inertial Odometry <\/strong><\/cite><\/blockquote>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-3-1024x593.png\" alt=\"\" class=\"wp-image-61\" width=\"757\" height=\"438\" srcset=\"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-3-1024x593.png 1024w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-3-300x174.png 300w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-3-768x444.png 768w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-3-1536x889.png 1536w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-3-2048x1185.png 2048w\" sizes=\"auto, (max-width: 706px) 89vw, (max-width: 767px) 82vw, 740px\" \/><\/figure>\n\n\n\n<p>We use IDOL: <a href=\"https:\/\/arxiv.org\/pdf\/2102.04024.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><mark class=\"has-inline-color has-vivid-cyan-blue-color\">Inertial Deep Orientation-Estimation and Localization<\/mark><\/a> architecture as our baseline. We have trained this model on the author&#8217;s dataset and also on our dataset collected using Aria glasses. <\/p>\n\n\n\n<p>This model aims to estimate the pose using the sensor readings from the gyroscope, magnetometer, and accelerometer.<\/p>\n\n\n\n<p>These sensors can operate using very low power consumption but they accumulate drift quickly over time. Hence, they are accurate only for a short duration. <\/p>\n\n\n\n<p><strong>Combined low-power Visual-Inertial Odometry<\/strong><\/p>\n\n\n\n<p>In order to find a balance between the pros and cons of the visual odometry system and the inertial odometry system, we combine the inertial odometry model with the visual odometry model. We use the visual odometry prediction after every k<sup>th <\/sup>timestep such that we can reset the inertial odometry system which has been collecting drift till this timestep. <\/p>\n\n\n\n<p>We try to find the best possible &#8216;k&#8217; value which strikes the balance between power consumption i.e. frequency of RGB frames used v\/s the accuracy of the overall predictions. <\/p>\n\n\n\n<p><strong>Overview of the above methods<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-4-1024x372.png\" alt=\"\" class=\"wp-image-74\" width=\"890\" height=\"323\" srcset=\"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-4-1024x372.png 1024w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-4-300x109.png 300w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-4-768x279.png 768w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-4-1536x558.png 1536w, https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/image-4.png 1916w\" sizes=\"auto, (max-width: 706px) 89vw, (max-width: 767px) 82vw, 740px\" \/><\/figure>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><\/p>\n<\/blockquote>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Visual Odometry We use InceptionNet backbone with appropriate changes in the final fully-connected layers such that we can regress over the 7D pose i.e. position (x, y, z) and the quaternion rotation. We train this model on a dataset collected in the basement of Smith Hall using Aria Glasses. For establishing our baseline, we have &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;OUR CONTRIBUTIONS&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"open","template":"","meta":{"footnotes":""},"class_list":["post-2","page","type-page","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>OUR CONTRIBUTIONS - Sparse Image and dual IMU localization for AR glasses<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"OUR CONTRIBUTIONS - Sparse Image and dual IMU localization for AR glasses\" \/>\n<meta property=\"og:description\" content=\"Visual Odometry We use InceptionNet backbone with appropriate changes in the final fully-connected layers such that we can regress over the 7D pose i.e. position (x, y, z) and the quaternion rotation. We train this model on a dataset collected in the basement of Smith Hall using Aria Glasses. For establishing our baseline, we have &hellip; Continue reading &quot;OUR CONTRIBUTIONS&quot;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/\" \/>\n<meta property=\"og:site_name\" content=\"Sparse Image and dual IMU localization for AR glasses\" \/>\n<meta property=\"article:modified_time\" content=\"2023-05-10T02:07:09+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-1024x521.png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/\",\"url\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/\",\"name\":\"OUR CONTRIBUTIONS - Sparse Image and dual IMU localization for AR glasses\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/wp-content\\\/uploads\\\/sites\\\/91\\\/2023\\\/05\\\/Picture1-1024x521.png\",\"datePublished\":\"2023-05-01T17:47:50+00:00\",\"dateModified\":\"2023-05-10T02:07:09+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/#primaryimage\",\"url\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/wp-content\\\/uploads\\\/sites\\\/91\\\/2023\\\/05\\\/Picture1-1024x521.png\",\"contentUrl\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/wp-content\\\/uploads\\\/sites\\\/91\\\/2023\\\/05\\\/Picture1-1024x521.png\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"OUR CONTRIBUTIONS\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/#website\",\"url\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/\",\"name\":\"Sparse Image and dual IMU localization for AR glasses\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/mscvprojects.ri.cmu.edu\\\/f23team14\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"OUR CONTRIBUTIONS - Sparse Image and dual IMU localization for AR glasses","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/","og_locale":"en_US","og_type":"article","og_title":"OUR CONTRIBUTIONS - Sparse Image and dual IMU localization for AR glasses","og_description":"Visual Odometry We use InceptionNet backbone with appropriate changes in the final fully-connected layers such that we can regress over the 7D pose i.e. position (x, y, z) and the quaternion rotation. We train this model on a dataset collected in the basement of Smith Hall using Aria Glasses. For establishing our baseline, we have &hellip; Continue reading \"OUR CONTRIBUTIONS\"","og_url":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/","og_site_name":"Sparse Image and dual IMU localization for AR glasses","article_modified_time":"2023-05-10T02:07:09+00:00","og_image":[{"url":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-1024x521.png","type":"","width":"","height":""}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/","url":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/","name":"OUR CONTRIBUTIONS - Sparse Image and dual IMU localization for AR glasses","isPartOf":{"@id":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/#website"},"primaryImageOfPage":{"@id":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/#primaryimage"},"image":{"@id":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/#primaryimage"},"thumbnailUrl":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-1024x521.png","datePublished":"2023-05-01T17:47:50+00:00","dateModified":"2023-05-10T02:07:09+00:00","breadcrumb":{"@id":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/#primaryimage","url":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-1024x521.png","contentUrl":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-content\/uploads\/sites\/91\/2023\/05\/Picture1-1024x521.png"},{"@type":"BreadcrumbList","@id":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/"},{"@type":"ListItem","position":2,"name":"OUR CONTRIBUTIONS"}]},{"@type":"WebSite","@id":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/#website","url":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/","name":"Sparse Image and dual IMU localization for AR glasses","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-json\/wp\/v2\/pages\/2","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-json\/wp\/v2\/comments?post=2"}],"version-history":[{"count":2,"href":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-json\/wp\/v2\/pages\/2\/revisions"}],"predecessor-version":[{"id":195,"href":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-json\/wp\/v2\/pages\/2\/revisions\/195"}],"wp:attachment":[{"href":"https:\/\/mscvprojects.ri.cmu.edu\/f23team14\/wp-json\/wp\/v2\/media?parent=2"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}