{"id":1550,"date":"2025-08-18T11:15:02","date_gmt":"2025-08-18T09:15:02","guid":{"rendered":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/?p=1550"},"modified":"2025-12-22T14:01:02","modified_gmt":"2025-12-22T13:01:02","slug":"part-b","status":"publish","type":"post","link":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/part-b\/","title":{"rendered":"Part B"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">Generalization of skills across different tasks and environments using meta-learning<br>techniques<\/h2>\n\n\n\n<h2 class=\"wp-block-heading\"><br>WP3: Federated learning in diverse environments<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">In WP3, we teach fleets of robots to share skills without sharing secrets. When robots work in private spaces like homes, they must learn from experience without exposing personal data. We solve this by stripping camera footage down to essential, anonymized features \u2014 relevant object poses and keypoints rather than faces and furniture in the background. This method protects privacy while helping robots learn faster and recognize objects in new environments. To handle complex jobs, we break long tasks into short, reusable skills that the robots can shuffle and recombine like building blocks. Finally, because every home is different, we give the robots tools to tweak their own behavior the moment they encounter a new obstacle.<\/p>\n\n\n\n<div class=\"wp-block-cover alignfull is-light has-parallax\"><div class=\"wp-block-cover__image-background wp-image-3183 size-full has-parallax\" style=\"background-position:50% 50%;background-image:url(https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/JG_1709-1-scaled.jpg)\"><\/div><span aria-hidden=\"true\" class=\"wp-block-cover__background has-pure-white-100-background-color has-background-dim\"><\/span><div class=\"wp-block-cover__inner-container has-global-padding is-layout-constrained wp-block-cover-is-layout-constrained\">\n<p class=\"has-text-align-center wp-block-paragraph\"><\/p>\n<\/div><\/div>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Recent Highlights from WP3<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\"><strong><a href=\"https:\/\/tapas-gmm.cs.uni-freiburg.de\/\" data-type=\"link\" data-id=\"https:\/\/tapas-gmm.cs.uni-freiburg.de\/\">Task Parametrized Gaussian Mixture Models (TP-GMM)<\/a><\/strong> offer a promising way for robots to learn by imitation. But taking this math from the simulator to the real world remains difficult. We solve three specific problems to make it work. <br><br>First, robot hands move in arcs, not straight lines. We capture this by modeling velocity on its natural curved surface rather than a flat grid. Second, we use these motion patterns to slice long tasks\u2014like cooking a meal\u2014into distinct skills, such as chopping or stirring. This allows the robot to shuffle and recombine these actions to solve entirely new problems. Third, we teach the robot to see what matters. When stirring, for instance, it automatically tracks the pot and ladle while ignoring the cutting board.<br><br>The results are compelling. Our robots learn complex tasks after seeing them performed only five times\u2014one-twentieth the data usually required. But beyond speed, they achieve a flexibility that baselines cannot match: adapting these learned skills to completely new objects and changing environments.<\/p>\n\n\n\n<div style=\"height:50px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n<div>\n\t<div class=\"bl-relative bl-w-full bl-pb-[56.25%]\">\n\t\t\t\t<iframe\n\t\t\t\tclass=\"bl-absolute bl-top-0 bl-left-0 bl-w-full bl-h-full\"\n\t\t\t\taria-label=\"Video from the VIMP media portal of the University of Freiburg\"\n\t\t\t\taria-description=\"\"\n\t\t\t\tsrc=\"https:\/\/videoportal.uni-freiburg.de\/media\/embed?key=d8b21589e267dc54fdcd9e62022a39ce&#038;autoplay=false&#038;autolightsoff=false&#038;loop=false&#038;chapters=false&#038;responsive=true&#038;loadonclick=true&#038;thumb=true\"\n\t\t\t\tdata-src=\"https:\/\/videoportal.uni-freiburg.de\/media\/embed?key=d8b21589e267dc54fdcd9e62022a39ce&#038;autoplay=false&#038;autolightsoff=false&#038;loop=false&#038;chapters=false&#038;responsive=true&#038;loadonclick=true\"\n\t\t\t\tframeborder=\"0\"\n\t\t\t\tallowfullscreen=\"allowfullscreen\"\n\t\t\t\tallowtransparency=\"true\"\n\t\t\t\tscrolling=\"no\"\n\t\t\t><\/iframe>\n\t\t\t\t<\/div>\n<\/div>\n\n\n\n<div style=\"height:50px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1474\" height=\"1034\" src=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/image-6.png\" alt=\"\" class=\"wp-image-3270\" srcset=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/image-6.png 1474w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/image-6-300x210.png 300w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/image-6-1024x718.png 1024w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/image-6-768x539.png 768w\" sizes=\"auto, (max-width: 1474px) 100vw, 1474px\" \/><figcaption class=\"wp-element-caption\">Figure 1:<em> TAPAS-GMM: Task Auto-Parameterized And Skill Segmented GMM learns task-parameterized manipulation policies from only a handful of complex task demonstrations. First, we segment the full task demonstrations into the involved skills. For each segment, we then automatically select the relevant task parameters and learn a Riemannian Task-Parameterized Hidden Markov Model (TP-HMM). The skill models can be cascaded and reused f lexibly. To enable modeling of the robot\u2019s end-effector velocity, we further leverage a novel action factorization and Riemannian geometry.<\/em><\/figcaption><\/figure>\n\n\n\n<div class=\"is-layout-flex wp-block-ufr-buttons-is-layout-flex\">\n\t\n\n<div class=\"wp-block-ufr-button\"> \n\t<a href=\"https:\/\/tapas-gmm.cs.uni-freiburg.de\/\"\n\t\t\t\taria-label=\"\"\n\t\tclass=\"\n\t\tbl-text-[1.125rem] bl-leading-[1.5625rem] bl-px-[26px] bl-py-[7px] bl-border-[2px] bl-gap-[8px] !bl-decoration-2 !bl-underline-offset-[5px] focus-visible:bl-outline-offset-[6px]\t\tbl-no-underline hover:bl-underline bl-font-medium\n\t\t\n\t\tfocus-visible:bl-outline-dotted focus-visible:bl-outline-2 bl-border-identity-black bl-text-identity-black\n\t   hover:bl-bg-identity-blue-80 hover:bl-border-identity-blue-80 hover:bl-text-pure-white\n\t   active:bl-bg-identity-blue active:bl-text-pure-white active:bl-border-identity-blue\n\t   !bl-outline-identity-black dark:bl-text-pure-white dark:!bl-border-pure-white\n\t\tdark:hover:bl-text-identity-black dark:hover:bl-bg-identity-yellow-40 dark:hover:!bl-border-identity-yellow-40\n\t\tdark:active:bl-text-identity-black dark:active:bl-bg-identity-yellow dark:active:!bl-border-identity-yellow\n\t\tdark:!bl-outline-pure-white\t\tbl-inline-flex bl-flex-row bl-items-center motion-safe:bl-transition-colors\n\t\tmotion-safe:bl-duration-[400ms] justify-center bl-hyphens-auto\n\t\t\">\n\t\t\t\t<span class=\"bl-inline-block\">\n\t\t\t<strong>Find out more<\/strong>\t\t<\/span>\n\t<\/a>\n<\/div>\n\n<\/div>\n\n\n\n\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h4 class=\"wp-block-heading\"><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Robots learn slowly because raw video data is overwhelming. To speed them up, we must boil down camera footage into simple, compact summaries. The problem is that current algorithms assume they can see the whole scene. But in the real world, objects are hidden by clutter or slip out of the camera\u2019s frame. When a robot cannot see an object, or when the object moves close enough to look different, the robot usually loses track of it.<br><br>We introduce our approach, <strong><a href=\"https:\/\/bask.cs.uni-freiburg.de\/\" data-type=\"link\" data-id=\"https:\/\/bask.cs.uni-freiburg.de\/\">Bayesian Scene Keypoints (BASK)<\/a> <\/strong>to solve this puzzle. It is a probabilistic method that tracks specific points on an object \u2014like the handle of cup\u2014 regardless of how far away they are. BASK resolves the confusion caused by missing information; it knows where a tool is even when it is hidden, and it can track a symmetrical cup that looks the same from different angles. We tested this using a camera mounted on the robot\u2019s moving wrist. The system mastered difficult tasks involving multiple objects, outperforming standard techniques. It proved stubborn in the face of messy desks, blocked views, and limited field-of-view, even handling objects it had never seen before.<\/p>\n\n\n\n<div style=\"height:50px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n<div>\n\t<div class=\"bl-relative bl-w-full bl-pb-[56.25%]\">\n\t\t\t\t<iframe\n\t\t\t\tclass=\"bl-absolute bl-top-0 bl-left-0 bl-w-full bl-h-full\"\n\t\t\t\taria-label=\"Video from the VIMP media portal of the University of Freiburg\"\n\t\t\t\taria-description=\"\"\n\t\t\t\tsrc=\"https:\/\/videoportal.uni-freiburg.de\/media\/embed?key=f0b6bff5bb5470c6631ce4e69fa3cded&#038;autoplay=false&#038;autolightsoff=false&#038;loop=false&#038;chapters=false&#038;responsive=true&#038;loadonclick=true&#038;thumb=true\"\n\t\t\t\tdata-src=\"https:\/\/videoportal.uni-freiburg.de\/media\/embed?key=f0b6bff5bb5470c6631ce4e69fa3cded&#038;autoplay=false&#038;autolightsoff=false&#038;loop=false&#038;chapters=false&#038;responsive=true&#038;loadonclick=true\"\n\t\t\t\tframeborder=\"0\"\n\t\t\t\tallowfullscreen=\"allowfullscreen\"\n\t\t\t\tallowtransparency=\"true\"\n\t\t\t\tscrolling=\"no\"\n\t\t\t><\/iframe>\n\t\t\t\t<\/div>\n<\/div>\n\n\n\n<div style=\"height:50px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"2560\" height=\"479\" src=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/Figure-BASK-scaled.png\" alt=\"\" class=\"wp-image-3267\" srcset=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/Figure-BASK-scaled.png 2560w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/Figure-BASK-300x56.png 300w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/Figure-BASK-1024x192.png 1024w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/Figure-BASK-768x144.png 768w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/Figure-BASK-1536x287.png 1536w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/Figure-BASK-2048x383.png 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><figcaption class=\"wp-element-caption\">Figure 2: <em>Individual camera observations are often ambiguous. For example, from the observation on the left, the rotation of the saucepan cannot be uniquely inferred. When tracking object keypoints, this leads to multimodal localization hypotheses. We overcome this problem by considering the image in context. We find likely correspondences across image scales and then use spatial or temporal context to resolve the ambiguities. Our model further detects when a keypoint is likely not observed, enabling our approach to track occluded objects and objects outside the current field of view as shown on the right.<\/em><\/figcaption><\/figure>\n\n\n\n<div class=\"is-layout-flex wp-block-ufr-buttons-is-layout-flex\">\n\t\n\n<div class=\"wp-block-ufr-button\"> \n\t<a href=\"https:\/\/bask.cs.uni-freiburg.de\/\"\n\t\t\t\taria-label=\"\"\n\t\tclass=\"\n\t\tbl-text-[1.125rem] bl-leading-[1.5625rem] bl-px-[26px] bl-py-[7px] bl-border-[2px] bl-gap-[8px] !bl-decoration-2 !bl-underline-offset-[5px] focus-visible:bl-outline-offset-[6px]\t\tbl-no-underline hover:bl-underline bl-font-medium\n\t\t\n\t\tfocus-visible:bl-outline-dotted focus-visible:bl-outline-2 bl-border-identity-black bl-text-identity-black\n\t   hover:bl-bg-identity-blue-80 hover:bl-border-identity-blue-80 hover:bl-text-pure-white\n\t   active:bl-bg-identity-blue active:bl-text-pure-white active:bl-border-identity-blue\n\t   !bl-outline-identity-black dark:bl-text-pure-white dark:!bl-border-pure-white\n\t\tdark:hover:bl-text-identity-black dark:hover:bl-bg-identity-yellow-40 dark:hover:!bl-border-identity-yellow-40\n\t\tdark:active:bl-text-identity-black dark:active:bl-bg-identity-yellow dark:active:!bl-border-identity-yellow\n\t\tdark:!bl-outline-pure-white\t\tbl-inline-flex bl-flex-row bl-items-center motion-safe:bl-transition-colors\n\t\tmotion-safe:bl-duration-[400ms] justify-center bl-hyphens-auto\n\t\t\">\n\t\t\t\t<span class=\"bl-inline-block\">\n\t\t\t<strong>Find out more<\/strong>\t\t<\/span>\n\t<\/a>\n<\/div>\n\n<\/div>\n\n\n\n\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">WP4: Meta-learning with meta-features and augmentation<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">WP4 focuses on developing methods to enable accurate transfer of robotic policies in real-world household environments through advanced meta-learning techniques. The goal is to make learned behaviors adapt quickly and safely to new homes, object arrangements, and interaction dynamics without requiring extensive retraining or perfectly curated demonstrations. To achieve this, WP4 investigates meta-learning across diverse simulated environments. A key innovation is the development of novel end-to-end methods to compute environment meta-features that enhance policy transfer efficiency. Additionally, WP4 explores the generation of synthetic, varied simulated environments to facilitate faster and more reliable transfer from simulation to real-world settings. This work is crucial for enabling scalable, adaptable, and safe assistive robots that can operate effectively in dynamic household environments.<\/p>\n\n\n\n<div class=\"wp-block-cover alignfull is-light has-parallax\"><div class=\"wp-block-cover__image-background wp-image-3181 size-full has-parallax\" style=\"background-position:50% 50%;background-image:url(https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/JG_1920-scaled.jpg)\"><\/div><span aria-hidden=\"true\" class=\"wp-block-cover__background has-pure-white-100-background-color has-background-dim\"><\/span><div class=\"wp-block-cover__inner-container has-global-padding is-layout-constrained wp-block-cover-is-layout-constrained\">\n<p class=\"has-text-align-center wp-block-paragraph\"><\/p>\n<\/div><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">Recent Highlights from WP4<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">WP4 worked on meta-learning for task sequencing by creating synthetic graph-based sequencing environments, defining utility scores for (partial) sequences, and successfully meta-learning a Transformer-based sequencing policy that can choose good next tasks even from suboptimal context. In parallel, they improved cross-episode Meta-RL efficiency by shifting the outer loop from on-policy (PPO) to off-policy (SAC), achieving ~2.5\u00d7 faster learning on ML1 Reach and SOTA performance on ML1 Push from the MetaWorld benchmark, while noting memory limits as task diversity grows (replay buffers become a bottleneck). Next steps include extending the utility function to enable mid-sequence recovery and exploring how these sequencing strategies can support WP1 with less reliance on optimal trajectories.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"2560\" height=\"919\" src=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/transformer-architecture-compressed-scaled.jpg\" alt=\"\" class=\"wp-image-3199\" srcset=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/transformer-architecture-compressed-scaled.jpg 2560w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/transformer-architecture-compressed-300x108.jpg 300w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/transformer-architecture-compressed-1024x368.jpg 1024w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/transformer-architecture-compressed-768x276.jpg 768w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/transformer-architecture-compressed-1536x552.jpg 1536w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/transformer-architecture-compressed-2048x736.jpg 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><figcaption class=\"wp-element-caption\">Figure 3: <em>Task Sequencing Transformer Architecture. We train a Transformer policy to select the next task. The model uses past experience to predict which task will most improve future performance, enabling more efficient learning than fixed curricula.<\/em><\/figcaption><\/figure>\n\n\n\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"2217\" height=\"508\" src=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/synthetic-sequencing-benchmark.jpg\" alt=\"\" class=\"wp-image-3198\" srcset=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/synthetic-sequencing-benchmark.jpg 2217w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/synthetic-sequencing-benchmark-300x69.jpg 300w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/synthetic-sequencing-benchmark-1024x235.jpg 1024w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/synthetic-sequencing-benchmark-768x176.jpg 768w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/synthetic-sequencing-benchmark-1536x352.jpg 1536w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/synthetic-sequencing-benchmark-2048x469.jpg 2048w\" sizes=\"auto, (max-width: 2217px) 100vw, 2217px\" \/><figcaption class=\"wp-element-caption\">Figure 4: <em>Synthetic Task Sequencing Benchmark. A lightweight benchmark where tasks form a graph and each task has a measurable \u201cutility.\u201d It lets us test and compare sequencing strategies quickly before transferring insights to other settings.<\/em><\/figcaption><\/figure>\n\n\n\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">WP5: Meta-learning with dynamic algorithm configuration for reinforcement learning<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The WP5 focuses on enhancing learning efficiency by enabling dynamic configuration of learning agents, allowing their learning parameters to be adapted on-the-fly to the task at hand. This is essential because deep reinforcement learning (RL) algorithms, which often underpin robot learning, are highly sensitive to their configurations. Unlike many other machine learning settings, the data from which we learn changes continuously during the learning process. These changes arise both from learning to solve new tasks and from exploring different behaviors within the same task. Consequently, different stages of learning require different algorithm configurations to achieve optimal results. If this adaptation is not carefully managed, it can severely hinder or even prevent successful learning. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">To enable efficient, scalable, and robust learning, we first focus on identifying suitable meta-features that facilitate transfer of configuration policies across varying problems and environments. Building on this, we aim to develop dynamic configuration policies that optimize reinforcement learning efficiency. The ultimate goal of this work package is to create dynamic configuration policies that are transferable even across vastly different problem environments.<\/p>\n\n\n\n<div class=\"wp-block-cover alignfull is-light has-parallax\"><div class=\"wp-block-cover__image-background wp-image-3184 size-full has-parallax\" style=\"background-position:50% 50%;background-image:url(https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/JG_1743-1-scaled.jpg)\"><\/div><span aria-hidden=\"true\" class=\"wp-block-cover__background has-pure-white-100-background-color has-background-dim\"><\/span><div class=\"wp-block-cover__inner-container has-global-padding is-layout-constrained wp-block-cover-is-layout-constrained\">\n<p class=\"has-text-align-center wp-block-paragraph\"><\/p>\n<\/div><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">Recent Highlights from WP5<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">WP5 (Dynamic Algorithm Configuration for RL) explored the effects of hyperparameter optimization on RL. Showing work on both hand-designed and inferred meta-features of environments, Dynamic Algorithm Configuration boosts adaptability and meta-learning in such parameterized environment settings with similar dynamics. Their current focus is on improving zero-shot generalization to novel environments with differing dynamics.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"2560\" height=\"1359\" src=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/WP5_Andre_Slide-13-1-scaled.jpg\" alt=\"\" class=\"wp-image-2827\" srcset=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/WP5_Andre_Slide-13-1-scaled.jpg 2560w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/WP5_Andre_Slide-13-1-300x159.jpg 300w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/WP5_Andre_Slide-13-1-1024x543.jpg 1024w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/WP5_Andre_Slide-13-1-768x408.jpg 768w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/WP5_Andre_Slide-13-1-1536x815.jpg 1536w, https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-content\/uploads\/sites\/171\/WP5_Andre_Slide-13-1-2048x1087.jpg 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><figcaption class=\"wp-element-caption\">Figure 5: <em>CARL allows to configure and modify existing environments through the use of context. This context can be made visible to agents through context features to inform them directly about the current instantiation of the context. Learning with and across different contexts, enables learning of more general behaviors. Reinforcement learning agents that observe this context during training can learn how to adapt their behavior accordingly.<\/em><\/figcaption><\/figure>\n\n\n\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<div class=\"is-layout-flex wp-block-ufr-buttons-is-layout-flex\">\n\t\n\n<div class=\"wp-block-ufr-button\"> \n\t<a href=\"https:\/\/uni-freiburg.de\/brainlinks-braintools\/part-c\/\"\n\t\t\t\taria-label=\"\"\n\t\tclass=\"\n\t\tbl-text-[1.125rem] bl-leading-[1.5625rem] bl-px-[26px] bl-py-[7px] bl-border-[2px] bl-gap-[8px] !bl-decoration-2 !bl-underline-offset-[5px] focus-visible:bl-outline-offset-[6px]\t\tbl-no-underline hover:bl-underline bl-font-medium\n\t\t\n\t\tfocus-visible:bl-outline-dotted focus-visible:bl-outline-2 bl-border-identity-black bl-text-identity-black\n\t   hover:bl-bg-identity-blue-80 hover:bl-border-identity-blue-80 hover:bl-text-pure-white\n\t   active:bl-bg-identity-blue active:bl-text-pure-white active:bl-border-identity-blue\n\t   !bl-outline-identity-black dark:bl-text-pure-white dark:!bl-border-pure-white\n\t\tdark:hover:bl-text-identity-black dark:hover:bl-bg-identity-yellow-40 dark:hover:!bl-border-identity-yellow-40\n\t\tdark:active:bl-text-identity-black dark:active:bl-bg-identity-yellow dark:active:!bl-border-identity-yellow\n\t\tdark:!bl-outline-pure-white\t\tbl-inline-flex bl-flex-row bl-items-center motion-safe:bl-transition-colors\n\t\tmotion-safe:bl-duration-[400ms] justify-center bl-hyphens-auto\n\t\t\">\n\t\t\t\t<span class=\"bl-inline-block\">\n\t\t\tContinue to Part C\t\t<\/span>\n\t<\/a>\n<\/div>\n\n<\/div>\n\n","protected":false},"excerpt":{"rendered":"<p>Generalization of skills across different tasks and environments using meta-learningtechniques WP3: Federated learning in diverse environments In WP3, we teach fleets of robots to share skills without sharing secrets. When robots work in private spaces like homes, they must learn from experience without exposing personal data. We solve this by stripping camera footage down to<\/p>\n","protected":false},"author":803,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_trash_the_other_posts":false,"editor_notices":[],"_show_in_pageentry_slider":false,"_pageentry_slider_title":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-1550","post","type-post","status-publish","format-standard","hentry","category-allgemein"],"featured_image_url":null,"featured_image_alt":"","_links":{"self":[{"href":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-json\/wp\/v2\/posts\/1550","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-json\/wp\/v2\/users\/803"}],"replies":[{"embeddable":true,"href":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-json\/wp\/v2\/comments?post=1550"}],"version-history":[{"count":30,"href":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-json\/wp\/v2\/posts\/1550\/revisions"}],"predecessor-version":[{"id":3397,"href":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-json\/wp\/v2\/posts\/1550\/revisions\/3397"}],"wp:attachment":[{"href":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-json\/wp\/v2\/media?parent=1550"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-json\/wp\/v2\/categories?post=1550"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/uni-freiburg.de\/brainlinks-braintools\/wp-json\/wp\/v2\/tags?post=1550"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}