Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
RESPONDING TO REPRESENTATIONS OF PHYSICAL ELEMENTS
Document Type and Number:
WIPO Patent Application WO/2020/219381
Kind Code:
A1
Abstract:
In some implementations, a method includes obtaining, by a virtual intelligent agent (VIA), a perceptual property vector (PPV) for an enhanced reality (ER) representation of a physical element. In some implementations, the PPV includes one or more perceptual characteristic values characterizing the ER representation of the physical element. In some implementations, the method includes instantiating an ER representation of the VIA in an ER setting that includes the ER representation of the physical element and an affordance that is associated with the ER representation of the physical element. In some implementations, the method includes generating, by the VIA, an action for the ER representation of the VIA based on the PPV. In some implementations, the method includes displaying a manipulation of the affordance by the ER representation of the VIA in order to effectuate the action generated by the VIA.

Application Number:
PCT/US2020/028966
Publication Date:
October 29, 2020
Filing Date:
April 20, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
RAITONSA DYNAMICS LLC (US)
International Classes:
G06F3/01; G06F3/03; G06F3/0481; G06K9/00
Foreign References:
US20120179651A12012-07-12
US20170187852A12017-06-29
US20180365898A12018-12-20
US20050143138A12005-06-30
CN104142995B2017-09-26
Attorney, Agent or Firm:
BHATNAGAR, Prateek (US)
Download PDF:
Claims:
What is claimed is:

1. A method comprising:

at a device including a non-transitory memory and one or more processors coupled with the non-transitory memory:

obtaining, by an intelligent agent (IA), one or more perceptual characteristic values characterizing an ER representation of a physical element;

instantiating an ER representation of the IA in an ER setting that includes the ER representation of the physical element and an affordance that is associated with the ER representation of the physical element;

generating, by the IA, an action for the ER representation of the IA based on the one or more perceptual characteristic values; and

displaying a manipulation of the affordance by the ER representation of the IA in order to effectuate the action generated by the IA.

2. The method of claim 1, further comprising:

populating a potentially detectable set of the IA based on the one or more perceptual characteristic values.

3. The method of claim 2, wherein populating the potentially detectable set of the IA allows the ER representation of the IA to detect or interact with the ER representation of the physical element.

4. The method of any of claims 2 and 3, wherein populating the potentially detectable set of the IA allows the ER representation of the IA to detect a texture of the ER representation of the physical element.

5. The method of any of claims 2-4, wherein populating the potentially detectable set of the IA allows the ER representation of the IA to detect a hardness of the ER representation of the physical element.

6. The method of any of claims 2-5, wherein populating the potentially detectable set of the IA allows the ER representation of the IA to detect a smell associated with the ER representation of the physical element.

7. The method of claim 6, wherein the ER representation of the IA detects a degree of the smell based on a distance between the ER representation of the IA and the ER representation of the physical element.

8. The method of any of claims 2-7, wherein populating the potentially detectable set comprises:

populating a potentially visible subset of the IA based on the one or more perceptual characteristic values.

9. The method of any of claims 2-8, wherein populating the potentially detectable set comprises:

populating a potentially audible subset of the IA based on the one or more perceptual characteristic values.

10. The method of any of claims 2-9, wherein populating the potentially detectable set comprises:

populating a potentially smellable subset of the IA based on the one or more perceptual characteristic values.

11. The method of any of claims 1-10, wherein the action includes the ER representation of the IA detecting the ER representation of the physical element.

12. The method of any of claims 1-11, wherein the action includes the ER representation of the IA touching the ER representation of the physical element.

13. The method of any of claims 1-12, wherein the action includes the ER representation of the IA picking-up the ER representation of the physical element.

14. The method of any of claims 1-13, wherein the action includes the ER representation of the IA modifying the ER representation of the physical element.

15. The method of any of claims 1-14, wherein the action includes the ER representation of the IA breaking the ER representation of the physical element.

16. The method of any of claims 1-15, wherein the action includes the ER representation of the IA changing a state of the ER representation of the physical element.

17. The method of any of claims 1-16, wherein the action includes the ER representation of the IA smelling an odor associated with the ER representation of the physical element.

18. The method of any of claims 1-17, wherein the action includes the ER representation of the IA hearing a sound generated by the ER representation of the physical element.

19. The method of any of claims 1-18, wherein obtaining the one or more perceptual characteristic values comprises:

receiving the one or more perceptual characteristic values from another device that generated the one or more perceptual characteristic values.

20. The method of any of claims 1-19, wherein obtaining the one or more perceptual characteristic values comprises:

retrieving the one or more perceptual characteristic values from the non-transitory memory or a remote data source.

21. The method of any of claims 1-20, wherein obtaining the one or more perceptual characteristic values comprises obtaining a perceptual property vector (PPV) that includes the one or more perceptual characteristic values.

22. The method of any of claims 1-21, further comprising:

obtaining, by the IA, one or more perceptual characteristic values characterizing an ER representation of a second physical element;

displaying a second affordance that is associated with the ER representation of the second physical element;

generating, by the IA, a second action for the ER representation of the VIA based on the one or more perceptual characteristic values characterizing the ER representation of the second physical element; and

displaying a manipulation of the second affordance by the ER representation of the IA in order to effectuate the second action generated by the IA.

23. A device comprising:

one or more processors;

a non- transitory memory;

one or more displays; and

one or more programs stored in the non-transitory memory, which, when executed by the one or more processors, cause the device to perform any of the methods of claims 1-22.

24. A non-transitory memory storing one or more programs, which, when executed by one or more processors of a device with a display, cause the device to perform any of the methods of claims 1-22.

25. A device comprising:

one or more processors;

a non-transitory memory;

a display; and

means for causing the device to perform any of the methods of claims 1-22.

Description:
RESPONDING TO REPRESENTATIONS OF PHYSICAL

ELEMENTS

CROSS-REFERENCE TO RELATED APPLICATION

[0001] This application claims the benefit of U.S. Provisional Patent App. No.

62/837,287, filed on April 23, 2019, which is incorporated by reference in its entirety.

TECHNICAL FIELD

[0002] The present disclosure generally relates to responding to representations of physical elements.

BACKGROUND

[0003] Some devices are capable of generating and presenting enhanced reality (ER) settings. Some ER settings include virtual settings that are simulated replacements of physical settings. Some ER settings include augmented settings that are modified versions of physical settings. Some devices that present ER settings include mobile communication devices such as smartphones, head-mountable displays (HMDs), eyeglasses, heads-up displays (HUDs), and optical projection systems. Most previously available devices that present ER settings are ineffective at providing the same level of interaction as physical settings.

BRIEF DESCRIPTION OF THE DRAWINGS

[0004] So that the present disclosure can be understood by those of ordinary skill in the art, a more detailed description may be had by reference to aspects of some illustrative implementations, some of which are shown in the accompanying drawings.

[0005] Figures 1A-1D are diagrams illustrating a virtual intelligent agent (VIA) interacting with enhanced reality representations of physical elements in accordance with some implementations .

[0006] Figure 2 is a block diagram of an example device in accordance with some implementations .

[0007] Figures 3A-3B are flowchart representations of a method of detecting and interacting with ER representations of physical elements in accordance with some implementations . [0008] Figure 4 is a block diagram of a device enabled with various components that enable a VIA to detect and interact with ER representations of physical elements in accordance with some implementations.

[0009] In accordance with common practice the various features illustrated in the drawings may not be drawn to scale. Accordingly, the dimensions of the various features may be arbitrarily expanded or reduced for clarity. In addition, some of the drawings may not depict all of the components of a given system, method or device. Finally, like reference numerals may be used to denote like features throughout the specification and figures.

SUMMARY

[0010] Various implementations disclosed herein include devices, systems, and methods that enable a virtual intelligent agent (VIA) (e.g., an intelligent agent (IA)) to detect and interact with enhanced reality (ER) representations of physical elements. In various implementations, a device includes a non-transitory memory and one or more processors coupled with the non-transitory memory. In some implementations, a method includes obtaining, by a virtual intelligent agent (VIA), a perceptual property vector (PPV) for an ER representation of a physical element. In some implementations, the PPV includes one or more perceptual characteristic values characterizing the ER representation of the physical element. In some implementations, the method includes instantiating an ER representation of the VIA in an ER setting that includes the ER representation of the physical element and an affordance that is associated with the ER representation of the physical element. In some implementations, the method includes generating, by the VIA, an action for the ER representation of the VIA based on the PPV. In some implementations, the method includes displaying a manipulation of the affordance by the ER representation of the VIA in order to effectuate the action generated by the VIA.

[0011] In accordance with some implementations, a device includes one or more processors, a non-transitory memory, and one or more programs. In some implementations, the one or more programs are stored in the non-transitory memory and are executed by the one or more processors. In some implementations, the one or more programs include instructions for performing or causing performance of any of the methods described herein. In accordance with some implementations, a non-transitory computer readable storage medium has stored therein instructions that, when executed by one or more processors of a device, cause the device to perform or cause performance of any of the methods described herein. In accordance with some implementations, a device includes one or more processors, a non-transitory memory, and means for performing or causing performance of any of the methods described herein.

DESCRIPTION

[0012] Numerous details are described in order to provide a thorough understanding of the example implementations shown in the drawings. However, the drawings merely show some example aspects of the present disclosure and are therefore not to be considered limiting. Those of ordinary skill in the art will appreciate that other effective aspects and/or variants do not include all of the specific details described herein. Moreover, well-known systems, methods, components, devices and circuits have not been described in exhaustive detail so as not to obscure more pertinent aspects of the example implementations described herein.

[0013] Various examples of electronic systems and techniques for using such systems in relation to various enhanced reality technologies are described.

[0014] A physical setting refers to a world with which various persons can sense and/or interact without use of electronic systems. Physical settings, such as a physical park, include physical elements, such as, for example, physical wildlife, physical trees, and physical plants. Persons can directly sense and/or otherwise interact with the physical setting, for example, using one or more senses including sight, smell, touch, taste, and hearing.

[0015] An enhanced reality (ER) setting, in contrast to a physical setting, refers to an entirely (or partly) computer-produced setting that various persons, using an electronic system, can sense and/or otherwise interact with. In ER, a person’ s movements are in part monitored, and, responsive thereto, at least one attribute corresponding to at least one virtual object in the ER setting is changed in a manner that is consistent with one or more physical laws. For example, in response to an ER system detecting a person looking upward, the ER system may adjust various audio and graphics presented to the person in a manner consistent with how such sounds and appearances would change in a physical setting. Adjustments to attribute(s) of virtual object(s) in an ER setting also may be made, for example, in response to representations of movement (e.g., voice commands).

[0016] A person may sense and/or interact with an ER object using one or more senses, such as sight, smell, taste, touch, and sound. For example, a person may sense and/or interact with objects that create a multi-dimensional or spatial acoustic setting. Multi-dimensional or spatial acoustic settings provide a person with a perception of discrete acoustic sources in multi dimensional space. Such objects may also enable acoustic transparency, which may selectively incorporate audio from a physical setting, either with or without computer-produced audio. In some ER settings, a person may sense and/or interact with only acoustic objects.

[0017] Virtual reality (VR) is one example of ER. A VR setting refers to an enhanced setting that is configured to only include computer-produced sensory inputs for one or more senses. A VR setting includes a plurality of virtual objects that a person may sense and/or interact with. A person may sense and/or interact with virtual objects in the VR setting through a simulation of at least some of the person’s actions within the computer-produced setting, and/or through a simulation of the person or her presence within the computer-produced setting.

[0018] Mixed reality (MR) is another example of ER. An MR setting refers to an enhanced setting that is configured to integrate computer-produced sensory inputs (e.g., virtual objects) with sensory inputs from the physical setting, or a representation of sensory inputs from the physical setting. On a reality spectrum, an MR setting is between, but does not include, a completely physical setting at one end and a VR setting at the other end.

[0019] In some MR settings, computer-produced sensory inputs may be adjusted based on changes to sensory inputs from the physical setting. Moreover, some electronic systems for presenting MR settings may detect location and/or orientation with respect to the physical setting to enable interaction between real objects (i.e., physical elements from the physical setting or representations thereof) and virtual objects. For example, a system may detect movements and adjust computer-produced sensory inputs accordingly, so that, for example, a virtual tree appears fixed with respect to a physical structure.

[0020] Augmented reality (AR) is an example of MR. An AR setting refers to an enhanced setting where one or more virtual objects are superimposed over a physical setting (or representation thereof). As an example, an electronic system may include an opaque display and one or more imaging sensors for capturing video and/or images of a physical setting. Such video and/or images may be representations of the physical setting, for example. The video and/or images are combined with virtual objects, wherein the combination is then displayed on the opaque display. The physical setting may be viewed by a person, indirectly, via the images and/or video of the physical setting. The person may thus observe the virtual objects superimposed over the physical setting. When a system captures images of a physical setting, and displays an AR setting on an opaque display using the captured images, the displayed images are called a video pass-through. Alternatively, a transparent or semi-transparent display may be included in an electronic system for displaying an AR setting, such that an individual may view the physical setting directly through the transparent or semi-transparent displays. Virtual objects may be displayed on the semi-transparent or transparent display, such that an individual observes virtual objects superimposed over a physical setting. In yet another example, a projection system may be utilized in order to project virtual objects onto a physical setting. For example, virtual objects may be projected on a physical surface, or as a holograph, such that an individual observes the virtual objects superimposed over the physical setting.

[0021] An AR setting also may refer to an enhanced setting in which a representation of a physical setting is modified by computer-produced sensory data. As an example, at least a portion of a representation of a physical setting may be graphically modified (e.g., enlarged), so that the modified portion is still representative of (although not a fully-reproduced version of) the originally captured image(s). Alternatively, in providing video pass-through, one or more sensor images may be modified in order to impose a specific viewpoint different than a viewpoint captured by the image sensor(s). As another example, portions of a representation of a physical setting may be altered by graphically obscuring or excluding the portions.

[0022] Augmented virtuality (AV) is another example of MR. An AV setting refers to an enhanced setting in which a virtual or computer-produced setting integrates one or more sensory inputs from a physical setting. Such sensory input(s) may include representations of one or more characteristics of a physical setting. A virtual object may, for example, incorporate a color associated with a physical element captured by imaging sensor(s). Alternatively, a virtual object may adopt characteristics consistent with, for example, current weather conditions corresponding to a physical setting, such as weather conditions identified via imaging, online weather information, and/or weather-related sensors. As another example, an AR park may include virtual structures, plants, and trees, although animals within the AR park setting may include features accurately reproduced from images of physical animals.

[0023] Various systems allow persons to sense and/or interact with ER settings. For example, a head mounted system may include one or more speakers and an opaque display. As another example, an external display (e.g., a smartphone) may be incorporated within a head mounted system. The head mounted system may include microphones for capturing audio of a physical setting, and/or image sensors for capturing images/video of the physical setting. A transparent or semi-transparent display may also be included in the head mounted system. The semi-transparent or transparent display may, for example, include a substrate through which light (representative of images) is directed to a person’s eyes. The display may also incorporate LEDs, OLEDs, liquid crystal on silicon, a laser scanning light source, a digital light projector, or any combination thereof. The substrate through which light is transmitted may be an optical reflector, holographic substrate, light waveguide, optical combiner, or any combination thereof. The transparent or semi-transparent display may, for example, transition selectively between a transparent/semi-transparent state and an opaque state. As another example, the electronic system may be a projection-based system. In a projection-based system, retinal projection may be used to project images onto a person’s retina. Alternatively, a projection-based system also may project virtual objects into a physical setting, for example, such as projecting virtual objects as a holograph or onto a physical surface. Other examples of ER systems include windows configured to display graphics, headphones, earphones, speaker arrangements, lenses configured to display graphics, heads up displays, automotive windshields configured to display graphics, input mechanisms (e.g., controllers with or without haptic functionality), desktop or laptop computers, tablets, or smartphones.

[0024] The present disclosure provides methods, systems, and/or devices that enable an enhanced reality (ER) representation of a virtual intelligent agent (VIA) to detect and interact with ER representations of physical elements. Many physical elements do not have electronic transceivers that emit data which identifies the physical elements. Such physical elements are sometimes referred to as passive physical elements. A perceptual property vector (PPV) for an ER representation of a physical element includes perceptual characteristic values that characterize the ER representation of the physical element. Populating a potentially detectable set of the VIA with the perceptual characteristic values allows the ER representation of the VIA to detect and interact with the ER representation of the physical element. The VIA utilizes the perceptual characteristic values included in the PPV to generate an action which involves an interaction between the ER representation of the VIA and the ER representation of the physical element. The ER representation of the physical element is associated with an affordance. The ER representation of the VIA manipulates the affordance associated with the ER representation of the physical element in order to effectuate the action generated by the VIA.

[0025] Figure 1A is a block diagram of an example operating environment 10 in accordance with some implementations. While pertinent features are shown, those of ordinary skill in the art will appreciate from the present disclosure that various other features have not been illustrated for the sake of brevity and so as not to obscure more pertinent aspects of the example implementations disclosed herein. To that end, as a non-limiting example, the operating environment 10 includes an electronic device 100.

[0026] In the example of Figure 1A, the electronic device 100 is held by a user (not shown). In some implementations, the electronic device 100 includes a smartphone, a tablet, a laptop, or the like. In some implementations, the electronic device 100 includes a wearable computing device that is worn by the user. For example, in some implementations, the electronic device 100 includes a head-mountable device (HMD). In some implementations, the HMD is shaped to form a receptacle that receives a device with a display (e.g., the device with the display can be slid into the HMD to serve as a display for the HMD). Alternatively, in some implementations, the HMD includes an integrated display.

[0027] In various implementations, the electronic device 100 includes a virtual intelligent agent (VIA) 110. In various implementations, the VIA 110 performs an action in order to satisfy (e.g., complete or achieve) an objective of the VIA 110. In various implementations, the VIA 110 obtains the objective from a human operator (e.g., the user of the electronic device 100). For example, in some implementations, the VIA 110 generates responses to queries that the user of the electronic device 100 inputs into the electronic device 100. In some implementations, the VIA 110 synthesizes vocal responses to voice queries that the electronic device 100 detects. In various implementations, the VIA 110 performs electronic operations on the electronic device 100. For example, the VIA 110 composes messages in response to receiving an instruction from the user of the electronic device 100. In some implementations, the VIA 110 schedules calendar events, sets timers/alarms, provides navigation directions, reads incoming messages, and/or assists the user in operating the electronic device 100. In various implementations, the VIA 110 is referred to as an intelligent agent (IA).

[0028] In various implementations, the electronic device 100 presents an enhanced reality (ER) setting 120. In some implementations, the electronic device 100 receives the ER setting 120 from another device. In some implementations, the electronic device 100 retrieves the ER setting 120 from a non-transitory memory (e.g., from a remote data source). In some implementations, the electronic device 100 generates the ER setting 120. For example, in some implementations, the electronic device 100 synthesizes the ER setting 120 based on a semantic construction of a physical setting. In various implementations, the ER setting 120 corresponds to a physical setting. In some implementations, the ER setting 120 is within a degree of similarity to a corresponding physical setting. [0029] In the example of Figure 1A, the ER setting 120 includes ER representations of physical elements 122. In some implementations, the ER representations of physical elements 122 correspond to respective physical elements in a physical setting. In such implementations, the ER representations of physical elements 122 are within a degree of similarity to their corresponding physical elements. In the example of Figure 1A, the ER setting 120 includes ER representations of bounding surfaces 124. In some implementations, the ER representations of bounding surfaces 124 correspond to respective physical surfaces in the physical setting. In such implementations, the ER representations of bounding surfaces 124 are within a degree of similarity to their corresponding physical surfaces.

[0030] In the example of Figure 1A, the ER setting 120 includes an ER television 122a, an ER couch 122b, an ER coffee table 122c, an ER television remote 122d, an ER door 122e and an ER door handle 122f. In some implementations, the ER television 122a, the ER couch 122b, the ER coffee table 122c, the ER television remote 122d, the ER door 122e and the ER door handle 122f represent a real television, a real couch, a real coffee table, a real television remote, a real door and a real door handle, respectively, located in a physical setting represented by the ER setting 120.

[0031] In various implementations, ER properties of the ER representations of physical elements 122 are within a degree of similarity to physical properties of corresponding physical elements located in the physical setting. In some implementations, visual properties of the ER representations of physical elements 122 are selected to match visual properties of the corresponding physical elements located in the physical setting. For example, a color of the ER couch 122b is within a degree of similarity to a color of the corresponding real couch. Similarly, a texture of the ER coffee table 122d is within a degree of similarity to a texture of the corresponding real coffee table.

[0032] In the example of Figure 1A, the ER representations of bounding surfaces 124 include an ER floor 124a, an ER front wall 124b and an ER side wall 124c. In some implementations, the ER floor 124a, the ER front wall 124b and the ER side wall 124c represent a real floor, a real front wall and a real side wall, respectively, of a physical setting represented by the ER setting 120. As such, the ER representations of bounding surfaces 124 are within a degree of similarity to physical surfaces in the physical setting.

[0033] In various implementations, the electronic device 100 obtains respective perceptual property vectors (PPVs) 130 for the ER representations of physical elements 122 and the ER representations of bounding surfaces 124. In various implementations, each PPV 130 includes one or more perceptual characteristic values 132 characterizing a corresponding ER representation of a physical element. For example, the PPVs 130 include a first PPV which includes a first set of perceptual characteristic values that characterize the ER television 122a (e.g., the first set of perceptual characteristic values indicate a size of the ER television 122a, a resolution of the ER television 122a, a refresh rate of the ER television 122a, etc.). Similarly, the PPVs 130 include a second PPV which includes a second set of perceptual characteristic values that characterize the ER couch 122b (e.g., the second set of perceptual characteristic values indicate a size, a color, a texture and/or a material of the ER couch 122b).

[0034] In various implementations, the perceptual characteristic values 132 characterize one or more physical properties of the ER representations of the physical elements 122. In some implementations, the perceptual characteristic values 132 characterize a texture of the ER representation. For example, the perceptual characteristic values 132 for an ER representation indicate whether the ER representation appears smooth or rough when touched by an ER object such as an ER representation of the VIA 110.

[0035] In some implementations, the perceptual characteristic values 132 characterize a hardness of the ER representation of the physical element. For example, the perceptual characteristic values 132 for the ER couch 122b characterize a hardness of an arm rest and/or a hardness of a cushion of the ER couch 122b. As another example, the perceptual characteristic values 132 for the ER floor 124a characterize a hardness of the ER floor 124a, which determines the result of dropping an ER object on the ER floor 124a. For example, if the perceptual characteristic values 132 for the ER floor 124a indicate that the ER floor 124a is as hard as concrete then dropping a delicate ER object such as a glass may result in the ER object breaking. However, if the perceptual characteristic values 132 for the ER floor 124a indicate that the ER floor 124a is as soft as carpet then dropping the delicate ER object may result in the ER object staying intact.

[0036] In various implementations, the perceptual characteristic values 132 characterize a smell of the ER representation of the physical element. For example, in some implementations, the perceptual characteristic values 132 define an odor function for the ER representation of the physical element. As an example, the perceptual characteristic values 132 for the ER couch 122b characterize how the ER couch 122b smells to an ER object such as an ER dog or an ER human. [0037] Referring to Figure IB, in some implementations, the ER setting 120 includes an ER representation 126 of the VIA 110. In the example of Figure IB, the ER representation 126 of the VIA 110 includes an ER human. In some implementations, a user of the electronic device 100 selects the ER representation 126 for the VIA 110 from a set of available ER representations. In various implementations, the ER representation 126 of the VIA 110 is customizable. For example, in some implementations, the ER representation 126 of the VIA 110 includes an ER dog, an ER robot, etc.

[0038] In various implementations, the ER representation 126 of the VIA 110 performs an action within the ER setting 120 in order to satisfy (e.g., complete or achieve) an objective of the VIA 110. In some implementations, the VIA 110 obtains the objective from a human operator (e.g., a user of the electronic device 100). In some implementations, the ER representation 126 of the VIA 110 obtains the objective from an ER representation of the human operator. For example, the ER representation of the human operator instructs the ER representation 126 of the VIA 110 to perform an action in the ER setting 120.

[0039] In various implementations, the VIA 110 performs an action or causes performance of the action by manipulating the ER representation 126 of the VIA 110 in the ER setting 120. In some implementations, the ER representation 126 of the VIA 110 is able to perform ER actions that the ER representation of the human operator is incapable of performing. In some implementations, the ER representation 126 of the VIA 110 performs ER actions based on information that the VIA 110 obtains from a physical setting. For example, the ER representation 126 of the VIA 110 nudges the ER representation of the human operator when the VIA 110 detects ringing of a doorbell in the physical setting.

[0040] Referring to Figure 1C, in various implementations, the ER representation 126 of the VIA 110 is associated with a potentially detectable set 112. In some implementations, the potentially detectable set 112 includes ER representations of physical elements that the ER representation 126 of the VIA 110 can detect (e.g., see, hear and/or smell). For example, the potentially detectable set 112 includes at least some of the ER representations of physical elements 122 in the ER setting 120. In some implementations, the potentially detectable set 112 includes perceptual characteristics values for various ER representations of physical elements that the ER representation 126 of the VIA 110 can detect.

[0041] In some implementations, the potentially detectable set 112 includes a potentially visible subset, a potentially audible subset and a potentially smellable subset. The potentially visible subset includes visual properties of the ER representations of physical elements 122 that the ER representation 126 of the VIA 110 can see (e.g., a display screen of the ER television 122a, a surface of the ER couch 122b, etc.)· The potentially audible subset includes audible properties of the ER representations of physical elements 122 that the ER representation 126 of the VIA 110 can hear (e.g., sounds emitted by the ER television 122a, and sounds made by the ER door 122e when the ER door 122e opens/closes). The potentially smellable subset includes smell properties (e.g., olfaction properties) of the ER representations of physical elements 122 that the ER representation 126 of the VIA 110 can smell (e.g., an odor of the ER couch 122b).

[0042] In some implementations, the VIA 110 populates the potentially detectable set

112 based on the perceptual characteristic values 132 included in the PPVs 130. For example, the VIA 110 populates the potentially detectable set 112 with an odor function of the ER couch 122b in order to allow the ER representation 126 of the VIA 110 to smell an odor of the ER couch 122b. Populating the potentially detectable set 112 based on the PPVs 130 of the ER representations of physical elements 122 allows the ER representation 126 of the VIA 110 to detect and interact with the ER representations of physical elements 122.

[0043] In various implementations, the VIA 110 generates an action 114 for the ER representation 126 of the VIA 110 based on the PPV(s) 130. In some implementations, the action 114 includes detecting and/or interacting with one or more of the ER representations of physical elements 122. For example, in some implementations, the action 114 includes turning ON the ER television 122a, jumping on the ER couch 122b, opening the ER door 122e, etc.

[0044] Referring to Figure ID, in various implementations, the electronic device 100 displays respective affordances 140 in association with the ER representations of physical elements 122. For example, the electronic device 100 composites a television affordance 140a in association with the ER television 122a, a couch affordance 140b in association with the ER couch 122b, a coffee table affordance 140a in association with the ER coffee table 122c, a television remote affordance 140d in association with the ER television remote 122d, a door affordance 140e in association with the ER door 122e, and a door handle affordance 140f in association with the ER door handle 122f.

[0045] In various implementations, the affordances 140 allow interaction with the corresponding ER representation of physical elements 122 in accordance with the perceptual characteristic values 132 included in their corresponding PPVs 130. For example, the television affordance 140a allows interaction with the ER television 122a in accordance with the perceptual property values 132 included in the PPV 130 for the ER television 122a (e.g., the ER representation 126 of the VIA 110 can activate the television affordance 140a to turn the ER television 122a ON or OFF). Similarly, the door handle affordance 140f allows interaction with the ER door handle 122f in accordance with the perceptual property values 132 included in the PPV 130 for the ER door handle 122f (e.g., the ER representation 126 of the VIA 110 can invoke the door handle affordance 140f to turn the ER door handle 122 f).

[0046] In some implementations, the action 114 includes activating one or more of the affordances 140 to interact with the corresponding ER representation 122. For example, in some implementations, the action 114 includes causing the ER representation 126 of the VIA 110 to move closer to the door handle affordance 140f and manipulating (e.g., activating) the door handle affordance 140f in order to turn the ER door handle 122f which can result in opening/closing of the ER door 122e. Similarly, in some implementations, the action 114 includes causing the ER representation 126 of the VIA 110 to move closer to the television remote affordance 140d and manipulating the television remote affordance 140d in order to pick-up the ER television remote 122d. After picking-up the television remote affordance 122d, the action 114 can cause the ER representation 126 of the VIA 110 to manipulate the television remote affordance 140d again in order to operate the ER television 122a via the ER television remote 122d. In the example of Figure ID, the ER representation 126 of the VIA 110 is manipulating the television affordance 140a, for example, because the action 114 is to turn the ER television 122a ON or OFF. More generally, in various implementations, the electronic device 100 displays a manipulation of one of the affordances 140 by the ER representation 126 of the VIA 110 in order to effectuate the action 114 generated by the VIA 110.

[0047] In some implementations, a head-mountable device (HMD) (not shown), being worn by a user, presents (e.g., displays) the ER setting 120 according to various implementations. In some implementations, the HMD includes an integrated display (e.g., a built-in display) that displays the ER setting 120. In some implementations, the HMD includes a head-mountable enclosure. In various implementations, the head-mountable enclosure includes an attachment region to which another device with a display can be attached. For example, in some implementations, the electronic device 100 can be attached to the head- mountable enclosure. In various implementations, the head-mountable enclosure is shaped to form a receptacle for receiving another device that includes a display (e.g., the electronic device 100). For example, in some implementations, the electronic device 100 slides/snaps into or otherwise attaches to the head-mountable enclosure. In some implementations, the display of the device attached to the head-mountable enclosure presents (e.g., displays) the ER setting 120.

[0048] Figure 2 illustrates a block diagram of an electronic device 200. In some implementations, the electronic device 200 implements the electronic device 100 shown in Figures 1A-1D. As illustrated in Figure 2, in some implementations, the electronic device 200 includes a data obtainer 210, an action generator 220, and an ER setting generator 230.

[0049] In various implementations, the data obtainer 210 obtains the PPVs 130 for the

ER representations of physical elements 122. As described herein, the PPVs 130 include respective perceptual characteristic values 132 characterizing the ER representations of physical elements 122. In some implementations, the data obtainer 210 retrieves the PPVs 130 from a non-transitory memory of the electronic device 200, or from a remote data source. In some implementations, the data obtainer 210 receives the PPVs 130 from another device that generated by the PPVs 130. In some implementations, the data obtainer 210 generates the PPVs 130 based on information encoded in a semantic construction of a physical setting. In some implementations, the data obtainer 210 provides the PPVs 130 to the action generator 220 and/or the ER setting generator 230.

[0050] In various implementations, the action generator 220 generates the action 114 based on the PPV(s) 130. In some implementations, the action 114 is for the ER representation 126 of the VIA 110 shown in Figures 1C and ID. In some implementations, the action generator 220 includes a neural network system that accepts the PPV(s) 130 and/or the perceptual characteristic values 132 as input(s) and outputs the action 114. In some implementations, the action 114 includes detecting one of the ER representations of physical elements 122 based on their corresponding perceptual characteristic values 132. In some implementations, the action 114 includes interacting with one of the ER representations of physical elements 122 based on their corresponding perceptual characteristic values 132. In some implementations, the action generator 220 provides the action 114 to the ER setting generator 230.

[0051] In various implementations, the ER setting generator 230 presents the ER setting 120. The ER setting generator 230 also displays the ER representations of physical elements 122, the ER representation 126 of the VIA 110, and the affordances 140 in association with the ER representations of physical elements 122. In some implementations, the ER setting generator 230 displays a manipulation of one of the affordances 140 associated with one of the ER representations of physical elements 122 in order to effectuate the action 114 generated by the VIA 110.

[0052] In some implementations, the ER setting generator 230 causes the ER representation 126 of the VIA 110 to move closer to the affordance 140 that is to be manipulated. After the ER representation 126 of the VIA 110 is adjacent to the affordance 140 that is to be manipulated, the ER setting generator 230 causes the ER representation 126 of the VIA 110 to manipulate (e.g., activate) the affordance 140.

[0053] Figure 3A is a flowchart representation of a method 300 of detecting and interacting with ER representations of physical elements in accordance with some implementations. In various implementations, the method 300 is performed by a device with a non-transitory memory and one or more processors coupled with the non-transitory memory (e.g., the electronic device 100 shown in Figures 1 A- ID and/or the electronic device 200 shown in Figure 2). In some implementations, the method 300 is performed by processing logic, including hardware, firmware, software, or a combination thereof. In some implementations, the method 300 is performed by a processor executing code stored in a non-transitory computer-readable medium (e.g., a memory).

[0054] As represented by block 310, in some implementations, the method 300 includes obtaining, by a virtual intelligent agent (VIA), a perceptual property vector (PPV) for an ER representation of a physical element. For example, as shown in Figure 1 A, the VIA 110 obtains the PPVs 130 for the ER representations of physical elements 122. In some implementations, the PPV includes one or more perceptual characteristic values characterizing the ER representation of the physical element. For example, as shown in Figure 1A, each PPV 130 includes one or more perceptual characteristic values 132 characterizing a corresponding one of the ER representations of the physical elements 122.

[0055] As represented by block 320, in some implementations, the method 300 includes instantiating an ER representation of the VIA in an ER setting that includes the ER representation of the physical element and an affordance that is associated with the ER representation of the physical element. For example, as shown in Figure ID, the ER setting 120 includes the ER representation 126 of the VIA 110, the ER representations of physical elements 122, and the affordances 140 that are associated with the ER representations of physical elements 122. [0056] As represented by block 330, in some implementations, the method 300 includes generating, by the VIA, an action for the ER representation of the VIA based on the PPV. For example, as shown in Figures 1C and ID, the VIA 110 generates the action 114 for the ER representation 126 of the VIA 110 based on the PPV(s) 130.

[0057] As represented by block 340, in some implementations, the method 300 includes displaying a manipulation of the affordance by the ER representation of the VIA in order to effectuate the action generated by the VIA. For example, as shown in Figure ID, the ER representation 126 of the VIA 110 is manipulating the television affordance 140a in order to effectuate the action 114 with respect to the ER television 122a (e.g., in order to control the ER television 122a, for example, in order to turn the ER television 122a ON or OFF).

[0058] Referring to Figure 3B, as represented by block 310a, in some implementations, the method 300 includes populating a potentially detectable set of the VIA based on the PPV. For example, as shown in Figure 1C, the VIA 110 populates the potentially detectable set 112 of the ER representation 126 of the VIA 110 based on the PPV(s) 130. In some implementations, populating the potentially detectable set includes populating a potentially visible subset of the VIA based on the PPV. In some implementations, populating the potentially detectable set includes populating a potentially audible subset of the VIA based on the PPV. In some implementations, populating the potentially detectable set includes populating a potentially smellable subset of the VIA based on the PPV.

[0059] As represented by block 310b, in some implementations, populating the potentially detectable set of the VIA allows the ER representation of the VIA to detect and/or interact with the ER representation of the physical element. For example, populating the potentially detectable set 112, shown in Figures 1C and ID, allows the ER representation 126 of the VIA 110 to detect and/or interact with the ER representation of physical elements 122.

[0060] As represented by block 310b, in some implementations, populating the potentially detectable set of the VIA allows the ER representation of the VIA to detect a texture of the ER representation of the physical element. For example, populating the potentially detectable set 112, shown in Figures 1C and ID, with texture characteristics of the ER couch 122b allows the ER representation 126 of the VIA 110 to detect (e.g., sense or feel) the texture of the ER couch 122b.

[0061] As represented by block 310b, in some implementations, populating the potentially detectable set of the VIA allows the ER representation of the VIA to detect a hardness of the ER representation of the physical element. For example, populating the potentially detectable set 112, shown in Figures 1C and ID, with hardness characteristics of the ER coffee table 122c allows the ER representation 126 of the VIA 110 to detect (e.g., sense or feel) the hardness of the ER coffee table 122c.

[0062] As represented by block 310b, in some implementations, populating the potentially detectable set of the VIA allows the ER representation of the VIA to detect a smell associated with the ER representation of the physical element. For example, populating the potentially detectable set 112, shown in Figures 1C and ID, with smell characteristics (e.g., an odor function) of the ER couch 122b allows the ER representation 126 of the VIA 110 to detect (e.g., smell) the odor of the ER couch 122b.

[0063] As represented by block 310b, in some implementations, the ER representation of the VIA detects a degree of the smell based on a distance between the ER representation of the VIA and the ER representation of the physical element. For example, populating the potentially detectable set 112, shown in Figures 1C and ID, with an odor function of the ER couch 122b allows the ER representation 126 of the VIA 110 to detect (e.g., smell) the odor of the ER couch 122b with varying degrees based on a distance between the ER representation 126 of the VIA 110 and the ER couch 122b.

[0064] As represented by block 310c, in some implementations, the method 300 includes receiving the PPV from another device that generated the PPV. As represented by block 310d, in some implementations, the method 300 includes retrieving the PPV from the non-transitory memory or a remote data source.

[0065] As represented by block 330a, in some implementations, the action includes the

ER representation of the VIA touching the ER representation of the physical element. For example, in some implementations, the action 114, shown in Figure ID, includes the ER representation 126 of the VIA 110 touching the ER couch 122b by manipulating the couch affordance 140b.

[0066] As represented by block 330a, in some implementations, the action includes the

ER representation of the VIA picking-up the ER representation of the physical element. For example, in some implementations, the action 114, shown in Figure ID, includes the ER representation 126 of the VIA 110 picking-up the ER television remote 122d by manipulating the television remote affordance 140d. [0067] As represented by block 330a, in some implementations, the action includes the

ER representation of the VIA modifying the ER representation of the physical element. For example, in some implementations, the action 114, shown in Figure ID, includes the ER representation 126 of the VIA 110 modifying the ER television remote 122d (e.g., by removing ER batteries from the ER television remote 122d) by manipulating the television remote affordance 140d.

[0068] As represented by block 330a, in some implementations, the action includes the

ER representation of the VIA breaking the ER representation of the physical element. For example, in some implementations, the action 114, shown in Figure ID, includes the ER representation 126 of the VIA 110 breaking the ER coffee table 122c by manipulating the coffee table affordance 140c.

[0069] As represented by block 330b, in some implementations, the action includes the

ER representation of the VIA changing a state of the ER representation of the physical element. For example, in some implementations, the action 114, shown in Figure ID, includes the ER representation 126 of the VIA 110 opening/closing the ER door 122e by manipulating the door affordance 140e.

[0070] As represented by block 330c, in some implementations, the action includes the

ER representation of the VIA smelling an odor associated with (e.g., emanating from) the ER representation of the physical element. For example, in some implementations, the action 114, shown in Figure ID, includes the ER representation 126 of the VIA 110 smelling an odor emanating from the ER couch 122b.

[0071] As represented by block 330d, in some implementations, the action includes the

ER representation of the VIA hearing a sound generated by (e.g., emitted by) the ER representation of the physical element. For example, in some implementations, the action 114, shown in Figure ID, includes the ER representation 126 of the VIA 110 hearing sounds generated by the ER television 122a.

[0072] As represented by block 350, in some implementations, the method includes obtaining, by the VIA, a second PPV for an ER representation of a second physical element. For example, as shown in Figure 1A, the VIA 110 obtains respective PPVs 130 for the ER representations of physical elements 122. The second PPV includes one or more perceptual characteristic values characterizing the ER representation of the second physical element. For example, as shown in Figure 1A, each PPV 130 includes a set of one or more perceptual characteristic values 132. In some implementations, the method 300 includes displaying a second affordance that is associated with the ER representation of the second physical element. For example, as shown in Figure ID, the electronic device 100 displays respective affordances 140 in association with the ER representations of physical elements 122. In some implementations, the method 300 includes generating, by the VIA, a second action for the ER representation of the VIA based on the second PPV. For example, the action 114 shown in Figures 1C and ID includes multiple actions. In some implementations, the method 300 includes displaying a manipulation of the second affordance by the ER representation of the VIA in order to effectuate the second action generated by the VIA. For example, as shown in Figure ID, the ER representation 126 of the VIA 110 manipulates one of the affordances 140 in order to effectuate the action(s) 114.

[0073] Figure 4 is a block diagram of a device 400 (e.g., the electronic device 100 shown in Figures 1A-1D and/or the electronic device 200 shown in Figure 2) in accordance with some implementations. While certain specific features are illustrated, those of ordinary skill in the art will appreciate from the present disclosure that various other features have not been illustrated for the sake of brevity, and so as not to obscure more pertinent aspects of the implementations disclosed herein. To that end, as a non-limiting example, in some implementations the device 400 includes one or more processing units (CPUs) 401, a network interface 402, a programming interface 403, a memory 404, input/output (I/O) sensors 405 and one or more communication buses 406 for interconnecting these and various other components.

[0074] In some implementations, the network interface 402 is provided to, among other uses, establish and maintain a metadata tunnel between a cloud hosted network management system and at least one private network including one or more compliant devices. In some implementations, the one or more communication buses 406 include circuitry that interconnects and controls communications between system components. The memory 404 includes high-speed random access memory, such as DRAM, SRAM, DDR RAM or other random access solid state memory devices, and may include non-volatile memory, such as one or more magnetic disk storage devices, optical disk storage devices, flash memory devices, or other non-volatile solid state storage devices. The memory 404 optionally includes one or more storage devices remotely located from the one or more CPUs 401. The memory 404 comprises a non-transitory computer readable storage medium. [0075] In some implementations, the I/O sensor 405 includes an image sensor (e.g., a camera) that captures images and/or videos of a physical setting. In some implementations, the I/O sensor 405 includes a depth sensor that captures depth data for a physical setting.

[0076] In some implementations, the memory 404 or the non-transitory computer readable storage medium of the memory 404 stores the following programs, modules and data structures, or a subset thereof including an optional operating system 408, the data obtainer 210, the action generator 220, and the ER setting generator 230. As described herein, in various implementations, the data obtainer 210 obtains a PPV (e.g., the PPVs 130 shown in Figures 1A-2). To that end, the data obtainer 210 includes instructions 210a, and heuristics and metadata 210b. As described herein, in various implementations, the action generator 220 generates an action based on the PPV (e.g., the action 114 shown in Figures lC-2). To that end, the action generator 220 includes instructions 220a, and heuristics and metadata 220b. As described herein, in various implementations, the ER setting generator 230 displays a manipulation of the affordance by the ER representation of the VIA in order to effectuate the action. To that end, the ER setting generator 230 includes instructions 230a, and heuristics and metadata 230b.

[0077] In some implementations, the VIA 110 shown in Figures 1A-1D includes an objective-effectuator. In some implementations, an objective-effectuator performs an action in order to satisfy (e.g., complete or achieve) an objective. In some implementations, an objective- effectuator is associated with a particular objective, and the objective-effectuator performs actions that improve the likelihood of satisfying that particular objective. In some implementations, ER representations of the objective-effectuators are referred to as object representations, for example, because the ER representations of the objective-effectuators represent various objects (e.g., real objects, or fictional objects). In some implementations, an objective-effectuator representing a character is referred to as a character objective-effectuator. In some implementations, a character objective-effectuator performs actions to effectuate a character objective. In some implementations, an objective-effectuator representing an equipment is referred to as an equipment objective-effectuator. In some implementations, an equipment objective-effectuator performs actions to effectuate an equipment objective. In some implementations, an objective-effectuator representing an environment is referred to as an environmental objective-effectuator. In some implementations, an environmental objective- effectuator performs environmental actions to effectuate an environmental objective. [0078] While various aspects of implementations within the scope of the appended claims are described above, it should be apparent that the various features of implementations described above may be embodied in a wide variety of forms and that any specific structure and/or function described above is merely illustrative. Based on the present disclosure one skilled in the art should appreciate that an aspect described herein may be implemented independently of any other aspects and that two or more of these aspects may be combined in various ways. For example, an apparatus may be implemented and/or a method may be practiced using any number of the aspects set forth herein. In addition, such an apparatus may be implemented and/or such a method may be practiced using other structure and/or functionality in addition to or other than one or more of the aspects set forth herein.

[0079] It will also be understood that, although the terms“first”,“second”, etc. may be used herein to describe various elements, these elements should not be limited by these terms. These terms are only used to distinguish one element from another. For example, a first node could be termed a second node, and, similarly, a second node could be termed a first node, which changing the meaning of the description, so long as all occurrences of the“first node” are renamed consistently and all occurrences of the“second node” are renamed consistently. The first node and the second node are both nodes, but they are not the same node.

[0080] The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the claims. As used in the description of the embodiments and the appended claims, the singular forms“a”,“an”, and“the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will also be understood that the term“and/or” as used herein refers to and encompasses any and all possible combinations of one or more of the associated listed items. It will be further understood that the terms“comprises” and/or“comprising” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.

[0081] As used herein, the term“if’ may be construed to mean“when” or“upon” or

“in response to determining” or“in accordance with a determination” or“in response to detecting” that a stated condition precedent is true, depending on the context. Similarly, the phrase“if it is determined [that a stated condition precedent is true]” or“if [a stated condition precedent is true]” or“when [a stated condition precedent is true]” may be construed to mean “upon determining” or“in response to determining” or“in accordance with a determination” or“upon detecting” or“in response to detecting” that the stated condition precedent is true, depending on the context.