Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
PERFORMING CONTEXT-RICH ATTRIBUTE-BASED SERVICES ON A HOST
Document Type and Number:
WIPO Patent Application WO/2018/106612
Kind Code:
A1
Abstract:
Some embodiments provide a novel method for configuring a set of service one or more nodes on a host to perform context-rich, attribute-based services on the host computer, which executes several data compute nodes (DCNs) in addition to the set of service nodes. The method uses a context-filtering node on the host to collect a first set of attributes associated with service rules processed by the set of service nodes on the host computer. The context filter also collects a second set of attributes associated with at least one data message flow of a DCN (e.g., of a virtual machine (VM) or container) executing on the host. After collecting the first and second sets of attributes, the context filtering node on the host compares the first and second sets of attributes to generate a service tag to represent a subset of the first set of attributes associated with the data message flow. The method associates this service tag with the data message flow. This service tag can then be used to identify the subset of attributes associated with the data message flow when a service node needs to process its attribute-based service rules for the data message flow.

Inventors:
POON ARNOLD (US)
GUNDA LAXMIKANT (US)
JAIN JAYANT (US)
SENGUPTA ANIRBAN (US)
VAIDYA SACHIN (IN)
Application Number:
PCT/US2017/064557
Publication Date:
June 14, 2018
Filing Date:
December 04, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NICIRA INC (US)
International Classes:
G06F9/448; H04L29/06
Foreign References:
US201314070360A2013-11-01
Other References:
JULIUS SCHULZ-ZANDER: "Realizing Software-Defined Wireless Networks - Achieving fine-grained Wi-Fi programmability with off-the-shelf hardware", 20 October 2016 (2016-10-20), XP055470673, Retrieved from the Internet [retrieved on 20180425]
MAHALINGAM, MALLIK; DUTT, DINESH G. ET AL.: "VXLAN: A Framework for Overlaying Virtualized Layer 2 Networks over Layer 3 Networks", 8 May 2013, IETF
Attorney, Agent or Firm:
HEYMAN, Leonard et al. (US)
Download PDF:
Claims:
CLAIMS

We claim:

1. A method of configuring a set of service nodes on a host computer to provide a set of attribute-based services to data compute nodes (DCNs) on the host computer, the method comprising:

collecting a first set of attributes associated with service rules processed by the set of service nodes on the host computer;

collecting a second set of attributes associated with at least one data message flow of a DCN;

comparing the first and second sets of attributes to generate a service tag representing a subset of the first set of attributes associated with the data message flow; and

associating the service tag with the data message flow for the set of service nodes to use when processing the service rules for data messages of the data message flow.

2. The method of claim 1, wherein the subset of the first set of attributes represented by the service tag are a subset of the second set of attributes of the DCN data message flow that is part of the first set of attributes of the processed rules.

3. The method of claim 1, wherein

the first set of attributes comprise group identifiers,

comparing the first and second sets of attributes comprises comparing the second set of attributes with definitions of the group identifiers to identify one or more group identifiers associated with the second set of attributes;

the subset of first attributes represented by the service tag comprises the identified group identifiers.

4. The method of claim 1, wherein associating the service tag comprises providing the service tag to the DCN for the DCN to forward with at least one data message of the data message flow.

5. The method of claim 1, wherein associating the service tag comprises providing the service tag to the DCN for the DCN to forward through an out-of-band communication to an attribute-resolving engine executing on the host computer.

6. The method of claim 5, wherein the attribute-resolving engine is part of a service node.

7. The method of claim 5, wherein the attribute resolving engine is not part of a service node.

8. The method of claim 1 further comprising: supplying the service tag and the subset of attributes to an attribute-resolving engine that provides the subset of attributes to at least one service node when this service node (1) processes a data message of the data message flow and (2) provides the service tag to the attribute-resolving engine, the provided subset of attributes allowing the service node to process service rules that are defined in terms of one or more attributes of the subset of attributes.

9. The method of claim 8, wherein

each service rule comprises a rule identifier for matching with data message identifiers,

the first set of attributes are attributes used in the rule identifiers of the service rules of the set of service nodes,

the set of service nodes is a set of one or more service nodes for processing data messages associated with the DCN, and

the first and second set of attributes comprise L7 parameters associated with the data messages.

10. The method of claim 9, wherein the first and second set of attributes do not include L2-L4 parameters.

11. The method of claim 9, wherein the service operations comprise at least one of firewall operations, load balancing operations, intrusion detection operations, intrusion prevention operations, and encryption operations.

12. The method of claim 1, wherein collecting the second set of attributes comprises receiving the second set of attributes from an introspection agent operating within the DCN.

13. The method of claim 1, wherein the second set of attributes is associated with more than one data message flow, and the service tag is stored by the DCN for associating with more than one data message flow.

14. The method of claim 1 further comprising:

storing in the service tag an identifier that identifies the host computer; and transmitting a definition of the service tag with the subset of attributes and the host- computer identifier to other host computers for service nodes on other host computers to use to perform services on data messages sent by the DCN with the service tag.

15. The method of claim 1, wherein collecting the first set of attributes comprises collecting the first set of attributes from the set of service nodes.

16. A method of performing a service at a host computer that executes data compute nodes (DCNs), the method comprising: receiving a data message;

identifying a service tag for the received data message;

identifying a set of attributes associated with the service tag;

using the identified set of attributes to identify a service rule;

performing a service on the data message based on the identified service rule.

17. The method of claim 16, wherein identifying the set of attributes comprises using the service tag to identify a record in a storage that stores a plurality of service tags and a set of attributes for each service tag.

18. The method of claim 17, wherein using the identified attribute set comprises using the attribute set to identify a record in a storage that stores a plurality of attribute sets and a service action parameter set for each attribute set.

19. The method of claim 16, wherein identifying the service tag comprises using a set of identifiers of the data message to identify the service tag in a storage that stores a plurality of service tags for a plurality of data message identifiers.

20. The method of claim 19 further comprising:

receiving a record that identifies the service tag for a message identifier of the data message before identifying the service tag for the received data message.

21. The method of claim 20, wherein receiving the record comprises receiving the record from a guest introspection agent of a DCN that sends the data message.

22. The method of claim 16, wherein identifying the service tag comprises extracting the service tag from a header of the data message.

23. The method of claim 22, wherein the header is a tunnel header.

Description:
PERFORMING CONTEXT-RICH

ATTRIBUTE-BASED SERVICES ON A HOST

Arnold Poon, Laxmikant Gunda, Jayant Jain, Anirban Sengupta, Sachin M. Vaidya

BACKGROUND

[0001] Middlebox services have historically been hardware appliances that are implemented at one or more points in a network topology in an enterprise or a datacenter. With the advent of software defined networking (SDN) and network virtualization, traditional hardware appliances do not take advantage of the flexibility and control that is provided by SDN and network virtualization. Accordingly, in recent years, some have suggested various ways to provide middlebox services on hosts. Most of these middlebox solutions, however, do not take advantage of the rich-contextual data that can be captured for each data message flow on the host. One reason for this is that existing techniques do not provide an efficient, distributed scheme for filtering the thousands of captured-contextual attributes in order to efficiently process service rules that are defined in terms of much smaller sets of contextual attributes.

BRIEF SUMMARY

[0003] Some embodiments provide a novel method for configuring a set of one or more service nodes on a host to perform context-rich, attribute-based services on the host computer, which executes several data compute nodes (DCNs) in addition to the set of service nodes. The method uses a context-filtering node on the host to collect a first set of attributes associated with service rules processed by the set of service nodes on the host computer. The context filter also collects a second set of attributes associated with at least one data message flow of a DCN (e.g., of a virtual machine (VM) or container) executing on the host. In some embodiments, the first and second sets of attributes are not L2-L4 header parameters. For instance, in some embodiments, these sets of attributes include L7 parameters or group identifiers (e.g., network security group identifiers). Also, in some embodiments, each service rule includes a rule identifier for matching with data message identifiers, and the first set of attributes are attributes used in the rule identifiers of the service rules of the set of service nodes.

[0004] After collecting the first and second sets of attributes, the context filtering node on the host compares the first and second sets of attributes to generate a service tag to represent a subset of the first set of attributes associated with the data message flow. The method associates this service tag with the data message flow. This service tag can then be used to identify the subset of attributes associated with the data message flow when a service node needs to process its attribute-based service rules for the data message flow.

[0005] In some embodiments, the context filter provides the service tag associated with the data message flow to the DCN and to an attribute-resolving engine. When sending a data message for this flow, the DCN forwards the data message's service tag along in-band (i.e., with the data message flow) or out-of-band (i.e., separately from the data message flow), so that a service node can use this service tag to process its attribute-based service rules.

[0006] To process its attribute-based service rules, a service node provides the data message flow's service tag to the attribute-resolving engine. From the context-filtering engine, the attribute-resolving engine receives the service tag with the subset of attributes represented by this tag, and stores this information in a mapping data structure (e.g., mapping table) that associates each service tag with its corresponding attribute subset. The attribute-resolving engine is part of a service node in some embodiments, while it is separate from all service nodes in other embodiments.

[0007] When the attribute-resolving engine receives the service tag from the service node, this engine finds this service tag in its mapping data structure, retrieves the tag's associated attribute subset from the mapping structure, and returns the attribute subset to the service node. The service node then uses the returned attribute subset to identify a service rule that matches this subset and then performs a service operation based on the action parameters specified by this service rule. Examples of such service operations include firewall operations, load balancing operations, intrusion detection operations, intrusion prevention operations, encryption operations, and other types of middlebox operations.

[0008] As mentioned above, the first and second sets of attributes collected by the context- filtering node in some embodiments are L7 parameters and/or group identifiers. For instance, in some embodiments, a guest introspection (GI) agent runs on a DCN. Upon occurrence of an event (e.g., a login event or the start of a connection session), the guest introspection agent collects a number of L7 parameters relating to the event. Examples of these parameters include login identifiers, group identifiers to which subscribers (e.g., active directory SIDs) or applications belong, location, device information, access levels, etc. In some embodiments, the information collected by the GI relates to one data message flow or connection session, while in other embodiments, the collected information relates to more than one data message flow or connection session (e.g., relates to all data message flows after a login event).

[0009] The guest introspection agent passes this collected information to the context-filtering node. This information serves as the second set of attributes mentioned above. In some embodiments, the first set of attributes are used in the rule identifiers of the set of service nodes and the context-filtering node collects these attributes from the service node set. In some embodiments, the first and second sets of attributes are identical types of attributes and the context-filtering node just identifies the subset of the second set of attributes that are part of the first set of attributes, in order to identify the service tag for a data message flow. Conjunctively, or alternatively, the first set of attributes in some embodiments are group identifiers, and the context-filtering node uses the second set of attributes to identify one or more group identifiers associated with the data message flow. The context-filtering node then associates the identified group identifiers with the service tag for the data message flow.

[0010] When the DCN embeds the service tag in-band within the data message flow, the service tag can be used to perform service operations on other appliances and/or host computers (e.g., on host computers other than the host computer on which the DCN executes). In such cases, an attribute-resolving agent on the other appliance and/or host computer would translate the service tag to a set of attributes, which are then used to identify one or more service rules to process for the data message flow. [0011] To facilitate the processing of attribute-based service rules at the other appliance and/or host computer, the host computer that creates the service tag definition in some embodiments sends this definition to the other appliance/host computer. In some of these embodiments, a module (e.g., a context filter or an attribute-resolving agent) of the other device (i.e., other appliance or host computer) receives or retrieves the service tag definitions from the context-filtering agent of the host computer that generated the service tag definition. In some of these embodiments, this communication is a control channel communication between these two devices.

[0012] As mentioned above, the service tag definitions identify a set of attributes associated with each service tag in some embodiments. In some embodiments, each service tag's definition also includes a unique identifier (e.g., a host identifier). This unique identifier allows the attribute-resolving agent of the device that receives the service tag definition from multiple host computers to map the service tags to attributes correctly, because different host computers may use the same service tag and the attribute-resolving agent would need the unique identifiers embedded in the service tags to differentiate two similar service tags.

[0013] The preceding Summary is intended to serve as a brief introduction to some embodiments of the invention. It is not meant to be an introduction or overview of all inventive subject matter disclosed in this document. The Detailed Description that follows and the Drawings that are referred to in the Detailed Description will further describe the embodiments described in the Summary as well as other embodiments. Accordingly, to understand all the embodiments described by this document, a full review of the Summary, Detailed Description, the Drawings and the Claims is needed. Moreover, the claimed subject matters are not to be limited by the illustrative details in the Summary, Detailed Description and the Drawing.

BRIEF DESCRIPTION OF THE DRAWINGS

[0014] The novel features of the invention are set forth in the appended claims. However, for purposes of explanation, several embodiments of the invention are set forth in the following figures.

[0015] Figure 1 illustrates a host computer that in some embodiments is used to establish a distributed architecture for configuring and performing context-rich, attribute-based services in a datacenter.

[0016] Figure 2 illustrates a more-detailed example of a host computer that in some embodiments is used to establish a distributed architecture for configuring and performing context-rich, attributed-based services in a datacenter.

[0017] Figure 3 conceptually illustrates a filtering process that the context filter performs whenever it receives a set of attributes from a GI agent.

[0018] Figure 4 conceptually illustrates a process that the service engine performs when a port calls the engine for a received message associated with a VM connected to the port.

[0019] Figure 5 presents a process that conceptually illustrates a set of operations performed on a source VM's host computer and a destination VM's host computer to perform service operations on the destination VM's host computer based on a service tag defined on the source VM's host computer.

[0020] Figure 6 illustrates an example of how the hypervisor service engines are managed in some embodiments.

[0021] Figure 7 conceptually illustrates a computer system with which some embodiments of the invention are implemented.

DETAILED DESCRIPTION

[0022] In the following detailed description of the invention, numerous details, examples, and embodiments of the invention are set forth and described. However, it will be clear and apparent to one skilled in the art that the invention is not limited to the embodiments set forth and that the invention may be practiced without some of the specific details and examples discussed.

[0023] Some embodiments provide a novel distributed architecture for configuring and performing context-rich, attribute-based services in a datacenter with several host computers that execute data computer nodes (DCNs) and service nodes. This distributed architecture allows the hosts to perform context-rich, attribute-based services efficiently on data messages transmitted by and/or to the data compute nodes executing on the hosts. This architecture can efficiently process service rules that are defined by reference to a wide range of contextual attributes.

[0024] As used in this document, data messages refer to a collection of bits in a particular format sent across a network. One of ordinary skill in the art will recognize that the term data message may be used herein to refer to various formatted collections of bits that may be sent across a network, such as Ethernet frames, IP packets, TCP segments, UDP datagrams, etc. Also, as used in this document, references to L2, L3, L4, and L7 layers (or layer 2, layer 3, layer 4, layer 7) are references respectively to the second data link layer, the third network layer, the fourth transport layer, and the seventh application layer of the OSI (Open System Interconnection) layer model.

[0025] Figure 1 illustrates a host computer 100 that in some embodiments is used to establish a distributed architecture for configuring and performing context-rich, attributed- based services in a datacenter. As shown, host computer 100 includes several DCNs 105, a context filter module 110, several service nodes 115, and an attribute-resolving engine 120. It also includes an attribute-based, service-rule storage 125, a service-tag definition storage 130, and a mapping storage 135.

[0026] The context filter 110 on the host 100 collects a set of attributes associated with service rules processed by the set of service nodes on the host computer, in order to identify the attributes that are used to define the service rules, which it will then use to filter out the unnecessary attributes collected from the DCNs. In some embodiments, the context filter 110 interacts with the service node 115 through a policy engine (not shown). This policy engine serves as an interface between the service nodes 115 and the context filter 110. [0027] In some embodiments, the collected set of attributes may include L2-L4 header parameters, but also include attributes that are not L2-L4 header parameters. For instance, in some embodiments, these sets of attributes include L7 parameters or group identifiers (e.g., network security group identifiers). Also, in some embodiments, each service rule includes a rule identifier for matching with data message identifiers, and the collected set of attributes are attributes used in the rule identifiers of the service rules of the set of service nodes. The service nodes are service modules (also called service engines below) that execute on the host 100.

[0028] In some embodiments, the context filter 110 collects the set of attributes by interacting with the service nodes through a service node interface (e.g., a set of APIs, not shown). In these embodiments, the service nodes retrieve these attributes from the service rule storage 125, and provide them to the context filter. In other embodiments, the context filter 100 collects the set of service-rule attributes by directly accessing the attribute-based, service-rule storage 125, or accessing these rules through another module.

[0029] In addition to collecting service rule attributes, the context filter 110 also collects attribute sets from the DCNs 105 that execute on host 100. The DCNs are virtual machines in some embodiments, containers in other embodiments, and VMs and containers in yet other embodiments. In some embodiments, each time a DCN starts a new data message flow, or each time the DCN is about to start a set of new data message flows (e.g., upon occurrence of an event, such as a login event), the DCN 105 provides a set of attributes to the context filter 110. In some embodiments, a guest-introspection agent on the DCN 105 detects a new event on the DCN, captures a set of attributes associated with that event, and sends this set of attributes to the context filter. Like the attributes collected from the service rules, the attributes collected from the DCNs in some embodiments may include L2-L4 header parameters, but also include attributes that are not L2-L4 header parameters (such as L7 parameters or group identifiers). In other embodiments, the attributes collected from the DCNs do not include L2-L4 header parameters.

[0030] After collecting a set of attributes from a DCN 105, the context filtering node on the host compares this set of attributes with the set of attributes collected from the service rules to generate a service tag to represent a subset of the DCN-collected attribute set that is used in one or more service rules (i.e., that are within the attribute set collected from the service rules). The context filter then returns the service tag to the DCN 105 from which it received the attribute set, so that the DCN can associate this service tag for the data message flow or flows for which it collected the attribute set. The DCN (e.g., the GI agent of the DCN) in some embodiments stores the service tag in a cache storage (not shown) of the DCN, so that it can use the service tag for the data message flow or flows subsequently without having to re-access the context filter 110.

[0031] The service tag can be used to identify the subset of attributes associated with a data message flow when a service node needs to process its attribute-based service rules for the data message flow that is associated (e.g., by the DCN) with the service tag. For each service tag, the context filter 110 in some embodiments generates a service tag definition that identifies the service tag and the subset of attributes associated with the service tag. In some embodiments, the context filter 110 provides the generated service tag's definition to the attribute-resolving engine 120, and the attribute-resolving engine 120 stores this definition in a mapping data storage 135 (e.g., with database with one or more mapping tables) that associates each service tag with its corresponding attribute subset.

[0032] When sending a data message for this flow, the DCN 105 forwards the data message's service tag along in-band (i.e., with the data message flow) or out-of-band (i.e., separately from the data message flow), so that a service node can use this service tag to process its attribute-based service rules. To process its attribute-based service rules, a service node 115 provides the data message flow's service tag to the attribute-resolving engine 120. The attribute-resolving engine 120 is part of a service node in some embodiments, while it is separate from all service nodes in other embodiments. In still other embodiments, each service node 115 has its own attribute-resolving engine 120.

[0033] When the attribute-resolving engine 120 receives the service tag from a service node 115, the attribute-resolving engine finds this service tag in its mapping data storage 135, retrieves the tag's associated attribute subset from the mapping storage, and returns the attribute subset to the service node 115. The service node then uses the returned attribute subset to identify a service rule that matches this subset and then performs a service operation based on the action parameters specified by this service rule. Examples of such service operations include firewall operations, load balancing operations, intrusion detection operations, intrusion prevention operations, encryption operations, and other types of middlebox operations.

[0034] Figure 2 illustrates a more-detailed example of a host computer 200 that in some embodiments is used to establish a distributed architecture for configuring and performing context-rich, attribute-based services in a datacenter. This host computer 200 includes many of the same components as host computer 100, such as context filters 110, service nodes 115, attribute resolving engine 120, attribute-based service rule storage 125, service tag definition storage 130, and mapping storage 135. Also, on host 200, the DCNs 105 are VMs 205.

[0035] In addition to these components, the host computer 200 includes a software forwarding element 210, a connection state data storage 225, and a service engine interface 227. In some embodiments, the software forwarding element 210, the service engine 115, the rule data storage 125, the connection state data storage 225, attribute resolving engine 120, the mapping storage 135, and the service engine interface 227 are in the kernel space of the hypervisor, while the VMs 205, context filter 110, and the service tag definition storage 130 are in the hypervisor' s user space. In other embodiments, the context filter 110 and the service tag definition storage 130 are also part of the hypervisor kernel space. In some embodiments, one or more service nodes are kernel space modules, while one or more other service nodes are user space modules (e.g., are service VMs).

[0036] The VMs 205 are virtual machines executing on top of the hypervisor (not shown) that executes on the host. Examples of such machines include webservers, application servers, database servers, etc. In some cases, all the VMs belong to one entity, e.g., an enterprise that operates the host. In other cases, the host executes in a multi-tenant environment (e.g., in a multi -tenant data center), and different VMs may belong to one tenant or to multiple tenants.

[0037] Each VM 205 includes a guest introspection (GI) agent 215 that interacts with the context filter 110 to provide attribute sets to this filter, and to receive service tags for these attribute sets. As mentioned above, the sets of attributes collected by the context-filtering node from the service rules and the DCNs in some embodiments are L7 parameters and/or group identifiers. Upon occurrence of an event (e.g., a login event or the start of a connection session) on a VM, the GI agent of the VM in some embodiments collects a number of L7 parameters relating to the event. Examples of these parameters include login identifiers, group identifiers to which subscribers (e.g., active directory SIDs) or applications belong, location, device information, access levels, etc. In some embodiments, the information collected by the GI relates to one data message flow or connection session, while in other embodiments, the collected information relates to more than one data message flow or connection session (e.g., relates to all data message flows after a login event).

[0038] The GI agent 215 passes this collected set of attributes for the event to the context filter 110. The context filter 110 then compares these attributes to the attributes used in the service rule identifiers of the service rules in the service rule storage 125 to identify the subset of the GI provided attribute set that is used in the service rules. If it has not created a service tag for this identified subset of attributes, the context filter 110 then creates a service tag for this subset, and stores this service tag's definition in the service tag definition storage 130. The context filter then provides the service tag associated with the identified subset of attributes to the GI that provides the attribute set, so that the GI can associate its VM's transmitted data messages in-band or out-of-band with the service tag, which, in turn, allows one or more service nodes to subsequently use the service tag to identify the service operations that are needed for the data messages.

[0039] In some embodiments, the sets of attributes that the context filter collects from the service rules and from the GI agents are identical types of attributes and the context-filtering node just identifies the subset of the Gl-collected attributes that are part of the service rule identifier attributes, in order to identify the service tag for a data message flow. Conjunctively, or alternatively, the service rule identifier attributes in some embodiments can be group identifiers, and the context filter 110 uses the Gl-collected attributes to identify one or more group identifiers associated with a data message flow. The context filter 110 then associates the identified group identifiers with the service tag for the data message flow.

[0040] As shown, each VM 205 also includes a virtual network interface card (VNIC) 255 in some embodiments. Each VNIC is responsible for exchanging messages between its VM and the software forwarding element 210. Each VNIC connects to a particular port of the software forwarding element 210. The software forwarding element 210 also connects to a physical network interface card (NIC) (not shown) of the host. In some embodiments, the VNICs are software abstractions of a physical NIC (PNIC) that are implemented by the virtualization software (e.g., by the hypervisor).

[0041] In some embodiments, the software forwarding element (SFE) maintains a single port 260 for each VNIC of each VM. The software forwarding element 210 connects to a physical NIC (through a NIC driver (not shown)) to send outgoing messages and to receive incoming messages. In some embodiments, the software forwarding element 210 is defined to include a port 265 that connects to the PNIC's driver to send and receive messages to and from the PNIC.

[0042] The software forwarding element 210 performs message-processing operations to forward messages that it receives on one of its ports to another one of its ports. For example, in some embodiments, the software forwarding element tries to use data in the message (e.g., data in the message header) to match a message to flow based rules, and upon finding a match, to perform the action specified by the matching rule (e.g., to hand the message to one of its ports 260 or 265, which directs the message to be supplied to a destination VM or to the PNIC).

[0043] In some embodiments, the software forwarding element 210 is a software switch, while in other embodiments it is a software router or a combined software switch/router. The software forwarding element 210 in some embodiments implements one or more logical forwarding elements (e.g., logical switches or logical routers) with software forwarding elements executing on other hosts in a multi-host environment. A logical forwarding element in some embodiments can span multiple hosts to connect VMs that execute on different hosts but belong to one logical network.

[0044] Different logical forwarding elements can be defined to specify different logical networks for different users, and each logical forwarding element can be defined by multiple software forwarding elements on multiple hosts. Each logical forwarding element isolates the traffic of the VMs of one logical network from the VMs of another logical network that is serviced by another logical forwarding element. A logical forwarding element can connect VMs executing on the same host and/or different hosts. In some embodiments, the SFE extracts from a data message a logical network identifier (e.g., a VNI) and a MAC address. The SFE in these embodiments uses the extracted VNI to identify a logical port group, and then uses the MAC address to identify a port within the port group.

[0045] Software switches (e.g., software switches of hypervisors) are sometimes referred to as virtual switches because they operate in software and they provide the VMs with shared access to the PNIC(s) of the host. However, in this document, software switches are referred to as physical switches because they are items in the physical world. This terminology also differentiates software switches from logical switches, which are abstractions of the types of connections that are provided by the software switches. There are various mechanisms for creating logical switches from software switches. VXLAN provides one manner for creating such logical switches. The VXLAN standard is described in Mahalingam, Mallik; Dutt, Dinesh G.; et al. (2013-05-08), VXLAN: A Framework for Overlaying Virtualized Layer 2 Networks over Layer 3 Networks, IETF.

[0046] The ports of the software forwarding element 210 in some embodiments include one or more function calls to one or more modules that implement special input/output (I/O) operations on incoming and outgoing messages that are received at the ports. Examples of I/O operations that are implemented by the ports 260 include ARP broadcast suppression operations and DHCP broadcast suppression operations, as described in U.S. Patent Application 14/070,360. Other I/O operations (such as firewall operations, load balancing operations, network address translation operations, etc.) can be so implemented in some embodiments of the invention. By implementing a stack of such function calls, the ports can implement a chain of I/O operations on incoming and/or outgoing messages in some embodiments. Also, in some embodiments, other modules in the data path (such as the VNICs, etc.) implement the I/O function call operations (such as the firewall function calls), instead of the ports 260.

[0047] In some embodiments, one or more of function calls of the SFE ports 260 can be to one or more service nodes 115 that process attribute-based service rules in the attribute-based service rule storage 125. Each service node 115 in some embodiments has its own attribute- based service rule storage 125. To perform its attribute-based service operation for a data message flow, a service engine 115 in some embodiments has the attribute resolving engine 120 first convert the service tag associated with the data message flow to a subset of attributes, and then uses this subset of attributes to identify a service rule in its service rule data storage 125 for it to process.

[0048] Specifically, for a service engine 115 to perform its service check operation, the forwarding element port 260 that calls the service engine supplies a set of attributes of a message that the port receives. In some embodiments, the set of message attributes are message identifiers, such as traditional five tuple identifiers, which include the message's source identifier, destination identifier, source port, destination port, and protocol. In some embodiments, one or more of the identifier values can be logical values that are defined for a logical network (e.g., can be IP addresses defined in a logical address space). In other embodiments, all of the identifier values are defined in the physical domains. In still other embodiments, some of the identifier values are defined in the logical domain, while other identifier values are defined in the physical domain.

[0049] When the VMs send the service tags in-band, the set of attributes supplied to the service node by the SFE port also includes in some embodiments the service tag for the data message flow. Before supplying these identifiers to the service engine, the SFE port extracts these identifiers from a message that it receives. When the VMs send the service tags in- band, the SFE port 260 in some embodiments has one of the I/O modules that it calls extract the service tag from the data message flow, and provides this service tag to the service engine 115.

[0050] In other embodiments, the VMs do not send the service tags in-band with the data message flow, but send the service tags out-of-band to one or more modules executing on the host 200. In some of these embodiments, the SFE port does not provide the service tag to the service node 115 that it calls. Instead, the service node 115 identifies the service tag for the data message flow (1) from the message identifiers (e.g., five-tuple identifiers) that it receives from the SFE port, and (2) from a record that the service node receives out-of-band from the VM that identifies the service tag for the message identifier.

[0051] In other embodiments, the service node 115 provides the message identifiers (e.g., five-tuple identifiers) that it receives from the SFE port to another module, which then identifies the service tag for the data message flow (1) from the message identifiers (e.g., five-tuple identifiers) that it receives from the service node, and (2) from a record that this module receives out-of-band from the VM that identifies the service tag for the message identifier. In some embodiments, this module is the attribute-resolving engine 120. Hence, in some embodiments, the service node provides a set of identifiers (e.g., five-tuple identifier) for a message to the attribute-resolving engine 120, and this engine first maps this identifier set to a service tag, and then maps this service tag to a subset of attributes that it returns to the service node. In other embodiments, the service node either identifies the service tag for a data message itself or obtains it from another module, and then provides this service tag to the attribute-resolving engine 120, in order to obtain the subset of attributes for a data message flow.

[0052] After receiving the subset of attributes for a data message flow from the attribute resolving engine 120, the service engine 115 in some embodiments performs its service operation based on service rules that are stored in the service rule storage 125. To perform its service operation, the service engine 115 matches the received attribute subset with corresponding attribute sets that are stored for the service rules. In some embodiments, the service engine 115 receives the service rules that it stores in its storage 125 through the service engine interface (SEI) 227 from one or more controllers, as further described below.

[0053] In some embodiments, each service rule in the data storage 125 has a rule identifier and an action parameter set. As mentioned above, the rule identifier of a service rule in some embodiments can be defined in terms of one or more parameters that are not L2-L4 header parameters (e.g., are L7 parameters). In some embodiments, a rule identifier can also include L2-L4 header parameters. Also, in some embodiments, one or more parameters in a rule identifier can be specified in terms of an individual value or a wildcard value. Also, in some embodiments, a rule identifier can include a set of individual values or a group identifier, such as a security group identifier, a compute construct identifier, a network construct identifier, etc. [0054] To match a received attribute subset with the rules, the service engine compares the received attribute subset with the associated identifiers of the service rules stored in the service rule data storage 125. Upon identifying a matching rule, the service engine 115 performs a service operation (e.g., a firewall operation, a load balancing operation, an encryption operation, other middlebox operation, etc.), based on the action parameter (e.g., based on Allow/Drop parameter, the load balancing criteria, encryption parameters, etc.) of the matching rule

[0055] In some embodiments, the service rule data storage 125 is defined in a hierarchical manner to ensure that a message rule check will match a higher priority rule before matching a lower priority rule, when the message's attribute subset matches multiple rules. Also, in some embodiments, the service rule data storage 125 contains a default rule that specifies a default action for any message rule check that cannot identify any other service rules; this default rule will be a match for all possible attribute subsets in some embodiments, and ensures that the service rule engine will return an action for all received attribute subsets. In some embodiments, the default rule will specify no service.

[0056] Multiple messages can have the same message identifier attribute sets, e.g., when the messages are part of one flow that is associated with one communication session between two machines. Accordingly, after matching a data message with a service rule in the storage 125 based on an attribute subset that the attribute-resolving engine 120 provided for a service tag associated with a first data message of a data message flow, the service engine of some embodiments stores the service rule (or a reference to the service rule) in the connection state data storage 225, so that it can later use this service rule for subsequent data messages of the same flow.

[0057] In some embodiments, the connection state data storage 225 stores the service rule, or a reference to the service rule, that the service engine 115 identifies for different message identifier sets (e.g., for different five-tuple identifiers that identify different data message flows). In some embodiments, the connection state data storage 225 stores each service rule, or reference to the service rule, with an identifier (e.g., a five-tuple identifier and/or a hash value) that is generated from the matching message identifier set. Before checking with the service rule data storage 125 for a particular message identifier set, the service rule engine 115 of some embodiments checks the connection state data storage 225 to determine whether this storage has a cached service for this message identifier set. If not, the service rule engine then directs the attribute-resolving engine 120 to map the message flow's service tag to an attribute subset, and then checks the service rule data storage 125 for a service rule that matches the identified attribute subset. When the connection state data storage has an entry for the particular message identifier set, the service engine performs its service operation based on this service rule's action parameter set.

[0058] Figure 3 conceptually illustrates a filtering process 300 that the context filter 110 performs whenever it receives a set of attributes from a GI agent 215. This process identifies the subset of attributes that are relevant for the service rules processed by the attribute-based service engine(s), generates a service tag for the identified attribute subset when necessary, and provides this service tag to the GI agent 215. As shown, the process 300 initially receives (at 305) a set of attributes from the GI agent 215.

[0059] Next, at 310, the process determines whether it has already processed this same exact set of attributes. To make this determination, the process stores in some embodiments each set of attributes that it processes in a cache storage, along with the service tag that the process identifies for each attribute set. At 310, the process 300 in these embodiments examines this cache storage to determine whether it has processed the attribute set received at 305. When the process 300 determines (at 310) that it has previously processed the attribute set received at 310, it provides (at 340) to the GI agent 215 the service tag that the process 300 previously generated for this attribute set (e.g., retrieves this service tag from its cache storage and provides this service tag to the GI agent 215). After 340, the process ends.

[0060] When the process determines (at 310) that it has not previously processed the received attribute set, the process selects (at 315) an attribute in this received set, and determines (at 320) whether this selected attribute is in the set of attributes that are currently used in the service rules of the attribute-based service engines 115. In some embodiments, the context filter collects the attributes used in the service rule identifiers from the service engines before it performs the process 300. Also, in some embodiments, as each service rules are deleted, added or modified, the context filter modifies the set of attributes collected from the service rules to reflect these changes. In some of these embodiments, the process adjusts the service tag definitions to account for these service rule changes, and adjusts these definitions in its own service tag definition storage 130 and provides these new definitions to the attribute- resolving engine 120 so that this engine can store these new definitions in its storage 135.

[0061] When the process determines (at 320) that the selected attribute is in the attribute set collected from the service rules, the process adds (at 325) this attribute to the subset of attributes that it creates for the GI agent, and then transitions to 330. The process also transitions to 330 from 320, when it determines that the selected attribute is not in the attribute set collected from the service rules. By transitioning from 320 to 330 without adding the selected attribute to the defined attribute subset, the process effectively filters out the selected attribute from the service tag definition.

[0062] At 330, the process determines whether it has examined all the attributes in the attribute set received from the GI agent. If not, the process transitions back to 315 to select another attribute in the received attribute set, and then repeats operation 320 to either add this selected attribute to the attribute subset (at 325), or to filter out this selected attribute. When the process determines (at 330) that it has examined all the attributes in the received attribute set, the process determines (at 335) whether it has previously created a service tag for the attribute subset that it has just generated by examining each attribute in the received attribute set.

[0063] To make this determination (at 335), the process 300 in some embodiments examines the service tag storage 130 to determine whether any service tag in this storage has been created for the exact attribute subset that the process created in this iteration. When the process 300 determines (at 335) that it has previously generated a service tag for the exact same attribute subset that it just generated, it provides (at 340) to the GI agent 215 the service tag that it has stored in its service tag storage 130 for this attribute subset. After 340, the process ends. It should be noted that in some embodiments, the process 300 does not perform the check at 310, but performs the check at 335, while in other embodiments, the process does not perform the check at 335, but performs the check at 310.

[0064] When the process 300 determines (at 335) that it has not previously generated a service tag for the exact same attribute subset that it just generated, it (at 345) creates a service tag for this attribute subset, and stores this service tag's definition (which includes the service tag's identifier and its associated attribute subset) in the service tag definition storage 130. In some embodiments, the service tag definition also includes a host identifier that identifies the host on which the service tag was generated. As further described below, this host identifier can be used by attribute-resolving engines to differentiate identical service tags that are generated by different context filters 110 on different hosts 200.

[0065] At 345, the process also provides the service tag to the GI agent 215 that provided the attribute set at 305. Also, the process provides (at 345) a service tag definition (which includes the service tag's identifier, its associated attribute subset, and in some cases, the host identifier) to the attribute-resolving engine 120, so that it can store this definition in its mapping storage 135. After 345, the process ends. While Figure 3 illustrates that a service tag is created for the attribute set received from a GI agent 215, one of ordinary skill will realize that in some embodiments no service tag is generated when none of the supplied attributes are used in any of the service rules of the service node(s) of the host.

[0066] Figure 4 conceptually illustrates a process 400 that the service engine 115 performs when a port 260 calls the engine 115 for a received message associated with a VM 205 connected to the port 260. The received message in some embodiments can be a message sent by the port's VM. As shown, the process 400 initially receives (at 405) a set of identifiers (e.g., a set of L2, L3 and/or L4 header values) of the data message. In some embodiments, the received message identifiers are extracted from the message by the port or by a I/O module called by the port. For instance, in some embodiments, the received attributes set includes the message's extracted five tuples.

[0067] Next, at 410, the process 400 determines whether it has an entry for the received message identifier set in its connection state data storage 225. This is because the service engine may have previously performed a service rule check for another message with an identical set of attributes, and cached the result of this check in the connection state data storage 225. In some embodiments, the process examines the connection state data storage 225 for a record that has a record identifier that matches the received identifier set or a hash of the received identifier set.

[0068] When the process identifies (at 410) an entry in the connection state data storage 225 that matches the received message identifier set, the process retrieves (at 415) a service action parameter set that is stored in the identified entry of the connection state data storage 225. At 415, the process then performs the service action on the data message based on the retrieved service action parameter set. After 415, the process 400 ends.

[0069] On the other hand, when the process cannot identify (at 410) an entry in the connection state data storage 225, the process directs (at 420) the attribute-resolving engine 120 to identify the attribute subset for the data message from the service tag associated with the data message. To do this, the service tag associated with the received data message has to first be identified.

[0070] The service tag is identified differently in different embodiments. In some embodiments, the service engine receives the service tag for the data message from the SFE port that calls it (e.g., after this port extracts or has another module extract the service tag from the data message header or associated control messaging (e.g., SYN message)). In other embodiments, the service engine 115 provides the message identifier set (received at 405) to the attribute-resolving engine 120, and has this engine 120 identify the service tag before identifying the attribute subset based on this service tag. In some of these embodiments, the attribute resolving engine receives the service tag for the data message (e.g., receives a record that identifies the service tag for the data message's five tuple identifier) through an out-of- band communication, and stores this service tag and the data message's identifier in a storage, which the engine 120 later accesses when it receives the service tag from the service engine 115. In still other embodiments, the source VM's GI agent 215 provides the service tag and its associated data message identifier(s) to the service engine, which stores this tag and its associated message identifier(s) to use (at 420) subsequently to identify a service tag for a received data message.

[0071] At 425, the process 400 receives from the attribute-resolving engine the attribute subset associated with the message's service tag. The attribute-resolving engine 120 identifies the attribute subset by identifying a record in its mapping storage 135 that has the service tag in its record identifier, and retrieving the attribute subset from this matching record.

[0072] At 430, the process identifies a service rule in the service data storage 125 that matches the received attribute subset. To identify the service rule, the process 400 searches the service rule data storage 125 to identify the entry that has an attribute subset that matches the attribute subset received from the attribute-resolving engine. In some embodiments, the service rule data storage 125 is defined in a hierarchical manner to ensure that message rule check will match a higher priority rule before matching a lower priority rule, when the message's attribute subset matches multiple rules.

[0073] After identifying the matching service rule, the process 400 retrieves (at 435) a service action parameter set that is stored in the identified entry of the service rule data storage 125, and performs a service on the data message based on the retrieved service action parameter set. Examples of such actions include firewall operations, load balancing operations, encryption operations, other middlebox operations, etc.

[0074] After the service engine performs the service operation, the service engine provides (at 435) the processed data message to the SFE port when service operation does not result in the dropping of the data message. The SFE port can then call another service engine or provide this message to the SFE to forward the data message to the data message's destination. When the service operation requires the data message to be dropped (e.g., a firewall operation requires that the data message to be dropped) or re-directed, the service engine informs the SFE 210 that the data message should be dropped or re-directed. After 435, the process creates (440) an entry in the connection state data storage 225 for the received attribute set and stores the identified service action in this entry. The process ends after 440.

[0075] When a DCN (e.g., VM) embeds the service tag in-band within the data message flow, the service tag can be used to perform service operations on other appliances and/or host computers (e.g., on host computers other than the host computer on which the DCN executes). In such cases, an attribute-resolving agent on the other appliance and/or host computer would translate service tag to an attribute subset, which is then used to identify one or more service rules to process for the data message flow.

[0076] To facilitate the processing of attribute-based service rules at the other appliance and/or host computer, the host computer that creates the service tag definition in some embodiments sends this definition to the other appliance/host computer. In some of these embodiments, a module (e.g., context filter or attribute-resolving engine) of the other device (i.e., other appliance or host computer) receives or retrieves the service tag definitions from the context-filtering agent of the host computer that generated the service tag definition. In some of these embodiments, this communication is a control channel communication between these two devices. In other embodiments, this communication is through a set of controllers (e.g., a set of controllers that configure the devices with service rules).

[0077] As mentioned above, the service tag definitions identify an attribute subset associated with each service tag in some embodiments. In some embodiments, each service tag's definition also includes a unique identifier (e.g., a host identifier) that allows the attribute- resolving agent of the device that receives the service tag definition from multiple host computers to map the service tags to the attributes correctly, because different host computers may use the same service tag and the attribute-resolving agent would need the unique identifiers embedded in the service tags to differentiate two similar service tags.

[0078] Figure 5 presents a process 500 that conceptually illustrates a set of operations performed on a source VM's host computer and a destination VM's host computer to perform service operations on the destination VM's host computer based on a service tag defined on the source VM's host computer. A source VM is a VM that is a source of a data message (i.e., is the VM that sends the data message), while a destination VM is a VM that is a destination of the data message (i.e., is the VM that receives the data message). In the flowchart illustrated in this figure, dashed arrows are shown for operations that might have an arbitrary amount of delay between them.

[0079] As shown, the process 500 starts (at 505) when the context filter 110 of the source VM's host computer generates a service tag for an attribute set provided by the source VM's GI agent 215. One detailed example of how a service tag is created for an attribute set was explained above with reference to Figure 3. The generated service tag in some embodiments includes the host identifier of the source VM's host computer, so that the attribute-resolving engine at the destination VM's host computer can uniquely identify the service tag.

[0080] The context filter 110 provides (at 505) the generated service tag to the GI agent that provided the attribute set, and the GI agent then ensures that the source VM includes this service tag with any new data message flow that relates to the attribute data set that the GI provided initially. In some embodiments, the GI agent 215 embeds the service tag into a SYN packet of a tunnel encapsulation header (e.g., a GENEVE header, VXLAN header, GRE header, etc.). In other embodiments, the GI agent 215 sends the service tag to one or more modules on the host computer along with a set of one or more data message identifiers that relate to a set of one or more data flows related to this service tag. One of these modules then embeds the service tag in a tunnel header of a tunnel that is established between two tunnel endpoint modules on the source and host computers.

[0081] The agent also caches this service tag for any future data flows that relate to the same set of attributes. This optimizes the agent to context filtering node interactions. Some embodiments expire this cache based on a time interval policy. For example, for identity- based firewall services, once a user has logged in, the Subscriber group to which the user belongs typically remains the same and hence the end point need not continue asking for the new service tag. In other cases, it may keep generating context for every new event/connection.

[0082] After generating (at 505) the service tag and providing this tag to the GI agent 215, the context filter distributes (at 510) the service tag to the context filters of other host computers. In some embodiments, the source computer's context filter sends the definition of a service tag, each time it defines a service tag, while in other embodiments, the context filter waits a predetermined time interval to try to collect a batch of service tags to distribute. Also, in some embodiments, the context filters of the host computers directly send the service tag definitions to each other (through intervening network fabric of the datacenter). In other embodiments, the source computer's context filter sends its new service tag definitions to a set of one or more controllers (e.g., a controller that configures the service rules of the service nodes of the host computers), which then distributes the service tag definitions to context filters of other host computers.

[0083] Some embodiments limit the span and scope of the distribution of the service tag, so that the control plane is not bogged down by this traffic. Some of these embodiments limit the distribution's scope by factors such as the span of the logical switch or network, or the presence of address-sets on certain hypervisor. By transporting the service tags, in the overlay header of the underlying flow, some embodiments are able to easily move the full dynamic context around in the datacenter.

[0084] Figure 5 illustrates that after the source computer's context filter 110 distributes (at 510) the service tag definition for the service tag that was generated (at 505), the context filter 110 of the destination VM's host computer receives (at 515) the service tag definition. As mentioned above, the definition of the service tag includes the service tag identifier, a subset of one or more attributes, and host identifier of the host computer at which the service tag was defined. The destination computer's context filter 110 provides the service tag definitions that it receives to its computer's attribute resolving engine 120.

[0085] After service tag is provided (at 505) to the GI agent, the source VM sends (at 520) the service tag along with a data message. For example, in some embodiments, the source VM sends the data message to the destination VM or a module on the destination VM's host computer along a tunnel, and inserts the service tag in the tunnel header. In other embodiments, instead of configuring the source VM to insert the service tag in the tunnel header, the GI agent 215 configures one of the I/O modules (called by the source VM's SFE port or VNIC) to insert this service tag in the tunnel header. In some of these embodiments, this I/O module is the module that performs the tunnel encapsulation operation for the VM. In other words, in these embodiments, the source VM does not establish the tunnel to the destination computer, but rather the source VM's encapsulation I/O module establishes this tunnel.

[0086] Once the data message is sent (at 520) along with the service tag, the destination VM's SFE port receives (at 525) this data message. The destination VM's SFE port then calls (at 530) an I/O module of the destination VM to decapsulate the tunnel header from the received data message. In decapsulating the tunnel header, the I/O module extracts (at 530) the service tag from the header and provides this service tag to the destination VM's SFE port. This port then calls (at 530) a service node 115 to perform a service operation (e.g., a firewall operation, a load balancing operation, an encryption operation, or any other middlebox operation) on the data message based on the extracted service tag. In some embodiments, the destination VM's SFE port also provides other attributes of the received data message (such as the received data message's five tuple identifiers) to the service node 115, which selects its service rule not only based on the service tag but also based on these other attributes. [0087] At 535, the service node then provides the service tag to the attribute-resolving engine 120, which then maps this service tag to an attribute subset based on the mapping records in its mapping data storage 135. This storage stores all the service tag definitions that its computer's context filter generates or receives from context filters of other host computers. When the destination computer's mapping data storage 135 stores two service tags with the same service tag identifier, the attribute-resolving engine uses the host identifier of the received service tag to select the stored service tag that has the same host identifier. After identifying the stored service tag that matches the received data message's service tag, the attribute resolving engine retrieves the attribute subset of the stored service tag, and returns (at 535) this attribute subset to the service node that called it.

[0088] Next, at 540, the service engine of the destination VM's host computer then identifies a service rule in its service rule storage 125 for the data message. To do this, the service engine 115 uses the returned attribute subset to identify the highest priority service rule in the service rule storage 125 that has a rule identifier with a matching subset of attributes. In some embodiments, the service engine 115 not only identifies the matching service rule by using the returned attribute subset, but also uses one or more other identifiers of the received data message (e.g., uses one or more of the received data message's five tuple identifiers).

[0089] Once the service engine 115 identifies the service rule that matches the service tag's associated attribute subset, the destination VM's service engine 115 of some embodiment stores this rule or a reference to this rule in a cache connection state storage 225 (along with the data message's identifier (e.g., five tuple identifier), so that this engine can subsequently access this connection storage for other data messages that are part of the same flow to identify this service rule. As mentioned above, the service engines in some embodiments first check their connection state storage 225 before asking the attribute-resolving engine to map a service tag to an attribute subset when the connection state storage 225 does not store any record for the received data message's flow.

[0090] Also, after identifying (at 540) the matching service rule, the service engine 115 performs a service operation on the received data message based on the service action parameter set of the matching service rule. Examples of such service operations include firewall operations, load balancing operations, intrusion detection operations, intrusion prevention operations, encryption operations, and other types of middlebox operations.

[0091] Once the service engine 115 performs the service operation of the data message, the service engine notifies the destination VM's SFE port. When the service operation does not result in the data message being discarded or re-directed, the destination VM's SFE port provides the data message to the destination VM. In some embodiments, the destination VM's SFE port successively calls multiple service nodes 115 to perform sequentially multiple service operations on the data message based on the service tag and/or identifier, before providing the data message to the destination VM, in cases when the data message is not discarded or redirected as a result of these operations.

[0092] Figure 6 illustrates an example of how the service engines 115 are managed in some embodiments. This figure illustrates multiple hosts 600 in a datacenter. As shown, each host includes a service engine 115, a context filter 110, attribute-resolving engine 120, several VMs 205, and an SFE 210. It also illustrates a set of controllers 610 for managing the service engines 115, VMs 205, and SFEs 210. As mentioned above, the context filters 110 in some embodiments pass service tag definitions to each other through the controller set, while in other embodiments these filters pass these definitions directly to each other. The controller set communicates with the hosts through a network 650 (e.g., through a local area network, a wide area network, a network of networks (such as the Internet), etc.). The hosts are also communicatively connected to each other through this network 650.

[0093] Some embodiments use the following service tag data structure:

Service Tag n = { uid,

SN1 = <nsgl, nsg2, nsg3>,

SN2 = <nsg2, nsg4, nsg5>,

SN3 = <nsg3, nsg5> },

where SN1, SN2, and SN3 are three service nodes (e.g., a firewall, a load balancer and an encryptor), uid is the host identifier, nsg stands for network security group, and nsgl-nsg5 are five network security group identifiers. As shown by this example, the service tags of some embodiments define different attribute subsets for different service nodes. In this example, nsgl, nsg2, and nsg3 are the attribute subset for service node SN1, nsg2, nsg4, and nsg5 are the attribute subset for service node SN2, and nsg3 and nsg5 are the attribute subset for service node SN3.

[0094] A nsg can be composed of AD (active directory) groups for the user identity use cases. For example, a firewall rule can be specified as

src:nsgroup:Nurse, dest:Internet_NSG, actiomblock, HTTP, which indicates that http packets from VMs used by Nurses that are addressed to destination Internet NSG should be dropped. [0095] In some embodiments, the context filtering node upon startup gathers all the identity related nsgroups and their membership information that pertains to the service rules (e.g., firewall rules, encryption rules, load balancing rules, etc.) and stores it internally. It interacts with the service engines to gather this information, or with a higher level policy engine that serves as an interface between the context filtering node and the service engines.

[0096] One specific implementation of some embodiments is as follows. On every network connect operation in the VM, the GI agent (e.g., epsec agent) running inside the VM passes the context (e.g., AD user id, set of AD group ids, source ip, source port, destination ip, destination port) to the context filtering node. Based on these AD groups within the context, the context filtering node determines the associated nsgroup(s) for this context and generates a service tag with a unique host identifier. It passes this service tag back to the GI agent. The GI agent embeds this service tag into the SYN packet GENEVE header. The agent also caches this service tag for any future connections from this user. This optimizes the agent to context filtering node interactions. For identity firewall use cases, the context filtering node programs the firewall service node with the generated service tags for every connection that the guest makes. The firewall service node maintains all the service tags and uses them whenever it receives a SYN packet with a service tag embedded in the GENEVE header from the guest. It uses the service tag to map to the nsgroups and then matches them with the rule.

[0097] The service tags generated by the context filter node might change when the AD group membership or the policy nsgroup membership changes. In some embodiments, the context filter node registers for AD group membership change notifications in the management plane. For any changes to the membership the management plane notifies the context filter, which then invalidates the service tags that are affected, and these service tags are re-generated on an as-needed basis.

[0098] By placing a context filtering node and attribute-resolving engine on each host, the above-described embodiments provide an efficient, scalable approach for providing context- rich service in a data center. Huge amounts of context are typically generated at various points in a datacenter by different applications and events. Contexts such as Login IDs, Group IDs to which subscribers / applications belong, multiple attributes related to a connection such as location, device information, access levels etc.

[0099] The context itself is greatly important/precious to various service nodes but because of a variety of issues, this context information is not usually consumed within a datacenter. First, at times, this context (such as Active Directory Group SID numbers to which a subscriber belongs) can run to hundreds or thousands of values. However, typically, only a portion of the context information is used by the service rules, such as a few of the SIDs might be consumed. Second, the voluminous nature of the context makes the transportation of the context to various points of interpretation and enforcement impossible and hence renders it useless. Third, in a typical datacenter, thousands of endpoints generate the context. It becomes impossible for a central node to process this context in its entirety and distribute it to the various points of consumption.

[00100] Accordingly, the distributed context filtering and mapping of some embodiments allows the amount of context to be consumed to just the right amount required for consumption, hence doing away with the unnecessary contextual data. This approach also tokenizes the contextual data so that the context can be transported to any point within the datacenter network or beyond.

[00101] Many of the above-described features and applications are implemented as software processes that are specified as a set of instructions recorded on a computer readable storage medium (also referred to as computer readable medium). When these instructions are executed by one or more processing unit(s) (e.g., one or more processors, cores of processors, or other processing units), they cause the processing unit(s) to perform the actions indicated in the instructions. Examples of computer readable media include, but are not limited to, CD- ROMs, flash drives, RAM chips, hard drives, EPROMs, etc. The computer readable media does not include carrier waves and electronic signals passing wirelessly or over wired connections.

[00102] In this specification, the term "software" is meant to include firmware residing in read-only memory or applications stored in magnetic storage, which can be read into memory for processing by a processor. Also, in some embodiments, multiple software inventions can be implemented as sub-parts of a larger program while remaining distinct software inventions. In some embodiments, multiple software inventions can also be implemented as separate programs. Finally, any combination of separate programs that together implement a software invention described here is within the scope of the invention. In some embodiments, the software programs, when installed to operate on one or more electronic systems, define one or more specific machine implementations that execute and perform the operations of the software programs.

[00103] Figure 7 conceptually illustrates a computer system 700 with which some embodiments of the invention are implemented. The computer system 700 can be used to implement any of the above-described hosts, controllers, and managers. As such, it can be used to execute any of the above described processes. This computer system includes various types of non-transitory machine readable media and interfaces for various other types of machine readable media. Computer system 700 includes a bus 705, processing unit(s) 710, a system memory 725, a read-only memory 730, a permanent storage device 735, input devices 740, and output devices 745.

[00104] The bus 705 collectively represents all system, peripheral, and chipset buses that communicatively connect the numerous internal devices of the computer system 700. For instance, the bus 705 communicatively connects the processing unit(s) 710 with the read-only memory 730, the system memory 725, and the permanent storage device 735.

[00105] From these various memory units, the processing unit(s) 710 retrieve instructions to execute and data to process in order to execute the processes of the invention. The processing unit(s) may be a single processor or a multi-core processor in different embodiments. The read-only-memory (ROM) 730 stores static data and instructions that are needed by the processing unit(s) 710 and other modules of the computer system. The permanent storage device 735, on the other hand, is a read-and-write memory device. This device is a non-volatile memory unit that stores instructions and data even when the computer system 700 is off. Some embodiments of the invention use a mass-storage device (such as a magnetic or optical disk and its corresponding disk drive) as the permanent storage device 735.

[00106] Other embodiments use a removable storage device (such as a floppy disk, flash drive, etc.) as the permanent storage device. Like the permanent storage device 735, the system memory 725 is a read-and-write memory device. However, unlike storage device 735, the system memory is a volatile read-and-write memory, such a random access memory. The system memory stores some of the instructions and data that the processor needs at runtime. In some embodiments, the invention's processes are stored in the system memory 725, the permanent storage device 735, and/or the read-only memory 730. From these various memory units, the processing unit(s) 710 retrieve instructions to execute and data to process in order to execute the processes of some embodiments.

[00107] The bus 705 also connects to the input and output devices 740 and 745. The input devices enable the user to communicate information and select commands to the computer system. The input devices 740 include alphanumeric keyboards and pointing devices (also called "cursor control devices"). The output devices 745 display images generated by the computer system. The output devices include printers and display devices, such as cathode ray tubes (CRT) or liquid crystal displays (LCD). Some embodiments include devices such as a touchscreen that function as both input and output devices.

[00108] Finally, as shown in Figure 7, bus 705 also couples computer system 700 to a network 765 through a network adapter (not shown). In this manner, the computer can be a part of a network of computers (such as a local area network ("LAN"), a wide area network ("WAN"), or an Intranet, or a network of networks, such as the Internet. Any or all components of computer system 700 may be used in conjunction with the invention.

[00109] Some embodiments include electronic components, such as microprocessors, storage and memory that store computer program instructions in a machine-readable or computer-readable medium (alternatively referred to as computer-readable storage media, machine-readable media, or machine-readable storage media). Some examples of such computer-readable media include RAM, ROM, read-only compact discs (CD-ROM), recordable compact discs (CD-R), rewritable compact discs (CD-RW), read-only digital versatile discs (e.g., DVD-ROM, dual-layer DVD-ROM), a variety of recordable/rewritable DVDs (e.g., DVD-RAM, DVD-RW, DVD+RW, etc.), flash memory (e.g., SD cards, mini- SD cards, micro-SD cards, etc.), magnetic and/or solid state hard drives, read-only and recordable Blu-Ray® discs, ultra density optical discs, any other optical or magnetic media, and floppy disks. The computer-readable media may store a computer program that is executable by at least one processing unit and includes sets of instructions for performing various operations. Examples of computer programs or computer code include machine code, such as is produced by a compiler, and files including higher-level code that are executed by a computer, an electronic component, or a microprocessor using an interpreter.

[00110] While the above discussion primarily refers to microprocessor or multi-core processors that execute software, some embodiments are performed by one or more integrated circuits, such as application specific integrated circuits (ASICs) or field programmable gate arrays (FPGAs). In some embodiments, such integrated circuits execute instructions that are stored on the circuit itself.

[00111] As used in this specification, the terms "computer", "server", "processor", and

"memory" all refer to electronic or other technological devices. These terms exclude people or groups of people. For the purposes of the specification, the terms display or displaying means displaying on an electronic device. As used in this specification, the terms "computer readable medium," "computer readable media," and "machine readable medium" are entirely restricted to tangible, physical objects that store information in a form that is readable by a computer. These terms exclude any wireless signals, wired download signals, and any other ephemeral or transitory signals.

[00112] While the invention has been described with reference to numerous specific details, one of ordinary skill in the art will recognize that the invention can be embodied in other specific forms without departing from the spirit of the invention. For instance, several figures conceptually illustrate processes. The specific operations of these processes may not be performed in the exact order shown and described. The specific operations may not be performed in one continuous series of operations, and different specific operations may be performed in different embodiments. Furthermore, the process could be implemented using several sub-processes, or as part of a larger macro process. Thus, one of ordinary skill in the art would understand that the invention is not to be limited by the foregoing illustrative details, but rather is to be defined by the appended claims.