vtep — hardware_vtep database schema
This schema specifies relations that a VTEP can use to integrate physical ports into logical switches maintained by a network virtualization controller such as NSX.
Glossary:
- VTEP
VXLAN Tunnel End Point, an entity which originates and/or terminates VXLAN tunnels.
- HSC
Hardware Switch Controller.
- NVC
Network Virtualization Controller, e.g. NSX.
- VRF
Virtual Routing and Forwarding instance.
Common Column
Some tables contain a column, named other_config. This column has the same form and purpose each place that it appears, so we describe it here to save space later.
- other_config: map of string-string pairs
-
Key-value pairs for configuring rarely used or proprietary features.
Some tables do not have other_config column because no key-value pairs have yet been defined for them.
Table Summary
The following list summarizes the purpose of each of the tables in the hardware_vtep database. Each table is described in more detail on a later page.
- Table
Purpose
- Global
Top-level configuration.
- Manager
OVSDB management connection.
- Physical_Switch
A physical switch.
- Tunnel
A tunnel created by a physical switch.
- Physical_Port
A port within a physical switch.
- Logical_Binding_Stats
Statistics for a VLAN on a physical port bound to a logical network.
- Logical_Switch
A layer-2 domain.
- Ucast_Macs_Local
Unicast MACs (local)
- Ucast_Macs_Remote
Unicast MACs (remote)
- Mcast_Macs_Local
Multicast MACs (local)
- Mcast_Macs_Remote
Multicast MACs (remote)
- Logical_Router
A logical L3 router.
- Arp_Sources_Local
ARP source addresses for logical routers
- Arp_Sources_Remote
ARP source addresses for logical routers
- Physical_Locator_Set
Physical_Locator_Set configuration.
- Physical_Locator
Physical_Locator configuration.
- ACL_entry
ACL_entry configuration.
- ACL
ACL configuration.
Table Relationships
The following diagram shows the relationship among tables in the database. Each node represents a table. Tables that are part of the “root set” are shown with double borders. Each edge leads from the table that contains it and points to the table that its value represents. Edges are labeled with their column names, followed by a constraint on the number of allowed values: ? for zero or one, * for zero or more, + for one or more. Thick lines represent strong references; thin lines represent weak references.
linethick = 1; linethick = 0.500000; box at 0.174802,3.159404 wid 0.349605 height 0.233070 "Global" box at 0.174802,3.159404 wid 0.294049 height 0.177514 linethick = 1.000000; box at 1.414595,3.334206 wid 0.420822 height 0.233070 "Manager" linethick = 1.000000; box at 1.414595,2.984601 wid 0.692731 height 0.233070 "Physical_Switch" linethick = 1.000000; box at 2.923071,3.405432 wid 0.595634 height 0.233070 "Physical_Port" linethick = 1.000000; box at 2.923071,1.204179 wid 0.349605 height 0.233070 "Tunnel" linethick = 0.500000; box at 4.862306,2.479585 wid 0.666860 height 0.233070 "Logical_Switch" box at 4.862306,2.479585 wid 0.611304 height 0.177514 linethick = 0.500000; box at 4.862306,3.230630 wid 0.349605 height 0.233070 "ACL" box at 4.862306,3.230630 wid 0.294049 height 0.177514 linethick = 1.000000; box at 4.862306,3.774429 wid 0.919321 height 0.233070 "Logical_Binding_Stats" linethick = 1.000000; box at 4.862306,0.958197 wid 0.725127 height 0.233070 "Physical_Locator" linethick = 0.500000; box at 2.923071,1.935786 wid 0.789828 height 0.233070 "Ucast_Macs_Local" box at 2.923071,1.935786 wid 0.734272 height 0.177514 linethick = 0.500000; box at 2.923071,1.566743 wid 0.867533 height 0.233070 "Ucast_Macs_Remote" box at 2.923071,1.566743 wid 0.811978 height 0.177514 linethick = 0.500000; box at 1.414595,2.563770 wid 0.809266 height 0.233070 "Mcast_Macs_Local" box at 1.414595,2.563770 wid 0.753710 height 0.177514 linethick = 1.000000; box at 2.923071,0.116535 wid 0.880492 height 0.233070 "Physical_Locator_Set" linethick = 0.500000; box at 1.414595,2.071713 wid 0.886971 height 0.233070 "Mcast_Macs_Remote" box at 1.414595,2.071713 wid 0.831416 height 0.177514 linethick = 0.500000; box at 2.923071,2.971643 wid 0.660381 height 0.233070 "Logical_Router" box at 2.923071,2.971643 wid 0.604825 height 0.177514 linethick = 0.500000; box at 2.923071,0.466140 wid 0.815745 height 0.233070 "Arp_Sources_Local" box at 2.923071,0.466140 wid 0.760189 height 0.177514 linethick = 0.500000; box at 2.923071,0.815745 wid 0.886971 height 0.233070 "Arp_Sources_Remote" box at 2.923071,0.815745 wid 0.831416 height 0.177514 linethick = 0.500000; box at 6.247674,3.230630 wid 0.504969 height 0.233070 "ACL_entry" box at 6.247674,3.230630 wid 0.449414 height 0.177514 linethick = 1.000000; spline -> from 0.349805,3.184062 to 0.349805,3.184062 to 0.574098,3.215667 to 0.962579,3.270485 to 1.203154,3.304373 "managers*" at 0.660381,3.305072 linethick = 1.000000; spline -> from 0.352514,3.107196 to 0.352514,3.107196 to 0.389805,3.097640 to 0.429068,3.088504 to 0.466140,3.081698 to 0.663970,3.045432 to 0.887624,3.021706 to 1.068113,3.006789 "switches*" at 0.660381,3.130270 linethick = 1.000000; spline -> from 1.762569,3.081652 to 1.762569,3.081652 to 2.023607,3.154463 to 2.377174,3.253098 to 2.625207,3.322320 "ports*" at 2.168856,3.298593 linethick = 1.000000; spline -> from 1.685376,2.866994 to 1.685376,2.866994 to 1.747046,2.831800 to 1.808810,2.788916 to 1.858081,2.738573 to 2.194820,2.394561 to 2.180836,2.225958 to 2.363050,1.780375 to 2.431293,1.613544 to 2.359274,1.526142 to 2.479585,1.391941 to 2.548807,1.314748 to 2.655040,1.268274 to 2.746171,1.240911 "tunnels*" at 2.168856,2.657651 linethick = 1.000000; spline -> from 3.221214,3.323858 to 3.221214,3.323858 to 3.305492,3.302369 to 3.397555,3.280367 to 3.483091,3.262980 to 3.836845,3.191055 to 3.969602,3.313882 to 4.285878,3.139966 to 4.407214,3.073261 to 4.647276,2.767613 to 4.776070,2.596540 "vlan_bindings value*" at 3.884484,3.311552 linethick = 1.000000; spline -> from 3.222472,3.422446 to 3.222472,3.422446 to 3.499686,3.433168 to 3.922661,3.436757 to 4.285878,3.385994 to 4.422363,3.366929 to 4.572554,3.325722 to 4.685173,3.290529 "acl_bindings value*" at 3.884484,3.473396 linethick = 1.000000; spline -> from 3.221307,3.486354 to 3.221307,3.486354 to 3.305632,3.507843 to 3.397648,3.529985 to 3.483091,3.547838 to 3.787108,3.611373 to 4.131818,3.668009 to 4.400595,3.708796 "vlan_stats value*" at 3.884484,3.732336 linethick = 1.000000; spline -> from 3.100017,1.189916 to 3.100017,1.189916 to 3.360403,1.167914 to 3.861970,1.121952 to 4.285878,1.061774 to 4.354307,1.052031 to 4.427071,1.040191 to 4.496992,1.028025 "local" at 3.884484,1.200963 linethick = 1.000000; spline -> from 3.098852,1.119762 to 3.098852,1.119762 to 3.259158,1.042942 to 3.472650,0.941090 to 3.483091,0.938759 to 3.820623,0.862825 to 4.216469,0.878767 to 4.497272,0.907108 "remote" at 3.884484,0.987331 linethick = 1.000000; spline -> from 5.037575,3.230630 to 5.037575,3.230630 to 5.279502,3.230630 to 5.716275,3.230630 to 5.993628,3.230630 "acl_entries+" at 5.658473,3.279155 linethick = 1.000000; spline -> from 3.320688,2.014471 to 3.320688,2.014471 to 3.375227,2.025099 to 3.430557,2.035820 to 3.483091,2.045842 to 3.839549,2.113898 to 3.941587,2.080150 to 4.285878,2.194727 to 4.415092,2.237752 to 4.551997,2.304456 to 4.659955,2.362491 "logical_switch" at 3.884484,2.243299 linethick = 1.000000; spline -> from 3.318031,1.913318 to 3.318031,1.913318 to 3.690570,1.889405 to 4.203930,1.848385 to 4.285878,1.799813 to 4.569244,1.631910 to 4.744839,1.263706 to 4.819888,1.074872 "locator" at 3.884484,1.951961 linethick = 1.000000; spline -> from 3.357560,1.572337 to 3.357560,1.572337 to 3.695418,1.583571 to 4.131818,1.615315 to 4.285878,1.702716 to 4.551717,1.853512 to 4.730855,2.182561 to 4.811963,2.360207 "logical_switch" at 3.884484,1.751241 linethick = 1.000000; spline -> from 3.358166,1.480367 to 3.358166,1.480367 to 3.723200,1.406764 to 4.198616,1.307942 to 4.285878,1.275406 to 4.424974,1.223524 to 4.571388,1.142556 to 4.681910,1.075385 "locator" at 3.884484,1.498780 linethick = 1.000000; spline -> from 1.820230,2.553888 to 1.820230,2.553888 to 2.508159,2.537060 to 3.890871,2.503312 to 4.527385,2.487789 "logical_switch" at 2.923071,2.586424 linethick = 1.000000; spline -> from 1.629299,2.446070 to 1.629299,2.446070 to 1.709615,2.393582 to 1.796270,2.325572 to 1.858081,2.246515 to 2.233510,1.766204 to 2.198456,1.558166 to 2.363050,0.971109 to 2.445837,0.675996 to 2.286230,0.529209 to 2.479585,0.291338 to 2.497345,0.269504 to 2.518228,0.250387 to 2.541022,0.233648 "locator_set" at 2.168856,2.133243 linethick = 1.000000; spline -> from 3.364039,0.109594 to 3.364039,0.109594 to 3.643117,0.122511 to 4.003257,0.172169 to 4.285878,0.323706 to 4.512748,0.445350 to 4.696360,0.692311 to 4.790987,0.839705 "locators+" at 3.884484,0.372264 linethick = 1.000000; spline -> from 1.859992,2.174543 to 1.859992,2.174543 to 1.898635,2.181955 to 1.937278,2.188807 to 1.974616,2.194727 to 2.892539,2.340302 to 3.987175,2.424021 to 4.527012,2.459401 "logical_switch" at 2.923071,2.411622 linethick = 1.000000; spline -> from 1.454217,1.954059 to 1.454217,1.954059 to 1.565671,1.640067 to 1.911081,0.773699 to 2.479585,0.291338 to 2.504850,0.269923 to 2.533051,0.250928 to 2.562605,0.234128 "locator_set" at 2.168856,0.896667 linethick = 1.000000; spline -> from 3.254729,2.897853 to 3.254729,2.897853 to 3.531430,2.835063 to 3.935620,2.740624 to 4.285878,2.647955 to 4.364469,2.627118 to 4.448654,2.603392 to 4.527758,2.580411 "switch_binding value*" at 3.884484,2.890721 linethick = 1.000000; spline -> from 3.253611,2.964790 to 3.253611,2.964790 to 3.531383,2.964744 to 3.937671,2.978168 to 4.285878,3.042869 to 4.423482,3.068413 to 4.573626,3.118104 to 4.686105,3.159916 "acl_binding value*" at 3.884484,3.091394 linethick = 1.000000; spline -> from 3.333880,0.440129 to 3.333880,0.440129 to 3.610394,0.435832 to 3.979111,0.456109 to 4.285878,0.563237 to 4.456485,0.622810 to 4.623689,0.746850 to 4.733652,0.840078 "locator" at 3.884484,0.611809 linethick = 1.000000; spline -> from 3.368747,0.724475 to 3.368747,0.724475 to 3.636778,0.684247 to 3.983493,0.658283 to 4.285878,0.718648 to 4.407027,0.742794 to 4.534517,0.792951 to 4.638699,0.841150 "locator" at 3.884484,0.767173
Global TABLE
Top-level configuration for a hardware VTEP. There must be exactly one record in the Global table.
Summary
- switches
set of Physical_Switchs
- Database Configuration:
- managers
set of Managers
- Common Column:
- other_config
map of string-string pairs
Details
- switches: set of Physical_Switchs
-
The physical switch or switches managed by the VTEP.
When a physical switch integrates support for this VTEP schema, which is expected to be the most common case, this column should point to one Physical_Switch record that represents the switch itself. In another possible implementation, a server or a VM presents a VTEP schema front-end interface to one or more physical switches, presumably communicating with those physical switches over a proprietary protocol. In that case, this column would point to one Physical_Switch for each physical switch, and the set might change over time as the front-end server comes to represent a differing set of switches.
Database Configuration:
These columns primarily configure the database server (ovsdb-server), not the hardware VTEP itself.
- managers: set of Managers
Database clients to which the database server should connect or to which it should listen, along with options for how these connection should be configured. See the Manager table for more information.
Common Column:
The overall purpose of this column is described under Common Column at the beginning of this document.
other_config: map of string-string pairs
Manager TABLE
Configuration for a database connection to an Open vSwitch Database (OVSDB) client.
The database server can initiate and maintain active connections to remote clients. It can also listen for database connections.
Summary
- Core Features:
- target
string (must be unique within table)
- Client Failure Detection and Handling:
- max_backoff
optional integer, at least 1,000
- inactivity_probe
optional integer
- Status:
- is_connected
boolean
- status : last_error
optional string
- status : state
optional string, one of ACTIVE, BACKOFF, CONNECTING, IDLE, or VOID
- status : sec_since_connect
optional string, containing an integer, at least 0
- status : sec_since_disconnect
optional string, containing an integer, at least 0
- status : locks_held
optional string
- status : locks_waiting
optional string
- status : locks_lost
optional string
- status : n_connections
optional string, containing an integer, at least 2
- Connection Parameters:
- other_config : dscp
optional string, containing an integer
Details
Core Features:
- target: string (must be unique within table)
-
Connection method for managers.
The following connection methods are currently supported:
- ssl:host[:port]
-
The specified SSL port (default: 6640) on the given host, which can either be a DNS name (if built with unbound library) or an IP address.
SSL key and certificate configuration happens outside the database.
- tcp:host[:port]
The specified TCP port (default: 6640) on the given host, which can either be a DNS name (if built with unbound library) or an IP address.
- pssl:[port][:host]
Listens for SSL connections on the specified TCP port (default: 6640). If host, which can either be a DNS name (if built with unbound library) or an IP address, is specified, then connections are restricted to the resolved or specified local IP address.
- ptcp:[port][:host]
Listens for connections on the specified TCP port (default: 6640). If host, which can either be a DNS name (if built with unbound library) or an IP address, is specified, then connections are restricted to the resolved or specified local IP address.
Client Failure Detection and Handling:
- max_backoff: optional integer, at least 1,000
Maximum number of milliseconds to wait between connection attempts. Default is implementation-specific.
- inactivity_probe: optional integer
Maximum number of milliseconds of idle time on connection to the client before sending an inactivity probe message. If the Open vSwitch database does not communicate with the client for the specified number of seconds, it will send a probe. If a response is not received for the same additional amount of time, the database server assumes the connection has been broken and attempts to reconnect. Default is implementation-specific. A value of 0 disables inactivity probes.
Status:
- is_connected: boolean
true if currently connected to this manager, false otherwise.
- status : last_error: optional string
A human-readable description of the last error on the connection to the manager; i.e. strerror(errno). This key will exist only if an error has occurred.
- status : state: optional string, one of ACTIVE, BACKOFF, CONNECTING, IDLE, or VOID
-
The state of the connection to the manager:
- VOID
Connection is disabled.
- BACKOFF
Attempting to reconnect at an increasing period.
- CONNECTING
Attempting to connect.
- ACTIVE
Connected, remote host responsive.
- IDLE
Connection is idle. Waiting for response to keep-alive.
These values may change in the future. They are provided only for human consumption.
- status : sec_since_connect: optional string, containing an integer, at least 0
The amount of time since this manager last successfully connected to the database (in seconds). Value is empty if manager has never successfully connected.
- status : sec_since_disconnect: optional string, containing an integer, at least 0
The amount of time since this manager last disconnected from the database (in seconds). Value is empty if manager has never disconnected.
- status : locks_held: optional string
Space-separated list of the names of OVSDB locks that the connection holds. Omitted if the connection does not hold any locks.
- status : locks_waiting: optional string
Space-separated list of the names of OVSDB locks that the connection is currently waiting to acquire. Omitted if the connection is not waiting for any locks.
- status : locks_lost: optional string
Space-separated list of the names of OVSDB locks that the connection has had stolen by another OVSDB client. Omitted if no locks have been stolen from this connection.
- status : n_connections: optional string, containing an integer, at least 2
-
When target specifies a connection method that listens for inbound connections (e.g. ptcp: or pssl:) and more than one connection is actually active, the value is the number of active connections. Otherwise, this key-value pair is omitted.
When multiple connections are active, status columns and key-value pairs (other than this one) report the status of one arbitrarily chosen connection.
Connection Parameters:
Additional configuration for a connection between the manager and the database server.
- other_config : dscp: optional string, containing an integer
The Differentiated Service Code Point (DSCP) is specified using 6 bits in the Type of Service (TOS) field in the IP header. DSCP provides a mechanism to classify the network traffic and provide Quality of Service (QoS) on IP networks. The DSCP value specified here is used when establishing the connection between the manager and the database server. If no value is specified, a default value of 48 is chosen. Valid DSCP values must be in the range 0 to 63.
Physical_Switch TABLE
A physical switch that implements a VTEP.
Summary
- ports
set of Physical_Ports
- tunnels
set of Tunnels
- Network Status:
- management_ips
set of strings
- tunnel_ips
set of strings
- Identification:
- name
string (must be unique within table)
- description
string
- Error Notification:
- switch_fault_status : mac_table_exhaustion
none
- switch_fault_status : tunnel_exhaustion
none
- switch_fault_status : lr_switch_bindings_fault
none
- switch_fault_status : lr_static_routes_fault
none
- switch_fault_status : lr_creation_fault
none
- switch_fault_status : lr_support_fault
none
- switch_fault_status : unspecified_fault
none
- switch_fault_status : unsupported_source_node_replication
none
- Common Column:
- other_config
map of string-string pairs
Details
- ports: set of Physical_Ports
The physical ports within the switch.
- tunnels: set of Tunnels
Tunnels created by this switch as instructed by the NVC.
Network Status:
- management_ips: set of strings
IPv4 or IPv6 addresses at which the switch may be contacted for management purposes.
- tunnel_ips: set of strings
-
IPv4 or IPv6 addresses on which the switch may originate or terminate tunnels.
This column is intended to allow a Manager to determine the Physical_Switch that terminates the tunnel represented by a Physical_Locator.
Identification:
- name: string (must be unique within table)
Symbolic name for the switch, such as its hostname.
- description: string
An extended description for the switch, such as its switch login banner.
Error Notification:
An entry in this column indicates to the NVC that this switch has encountered a fault. The switch must clear this column when the fault has been cleared.
- switch_fault_status : mac_table_exhaustion: none
Indicates that the switch has been unable to process MAC entries requested by the NVC due to lack of table resources.
- switch_fault_status : tunnel_exhaustion: none
Indicates that the switch has been unable to create tunnels requested by the NVC due to lack of resources.
- switch_fault_status : lr_switch_bindings_fault: none
Indicates that the switch has been unable to create the logical router interfaces requested by the NVC due to conflicting configurations or a lack of hardware resources.
- switch_fault_status : lr_static_routes_fault: none
Indicates that the switch has been unable to create the static routes requested by the NVC due to conflicting configurations or a lack of hardware resources.
- switch_fault_status : lr_creation_fault: none
Indicates that the switch has been unable to create the logical router requested by the NVC due to conflicting configurations or a lack of hardware resources.
- switch_fault_status : lr_support_fault: none
Indicates that the switch does not support logical routing.
- switch_fault_status : unspecified_fault: none
Indicates that an error has occurred in the switch but that no more specific information is available.
- switch_fault_status : unsupported_source_node_replication: none
Indicates that the requested source node replication mode cannot be supported by the physical switch; this specifically means in this context that the physical switch lacks the capability to support source node replication mode. This error occurs when a controller attempts to set source node replication mode for one of the logical switches that the physical switch is keeping context for. An NVC that observes this error should take appropriate action (for example reverting the logical switch to service node replication mode). It is recommended that an NVC be proactive and test for support of source node replication by using a test logical switch on vtep physical switch nodes and then trying to change the replication mode to source node on this logical switch, checking for error. The NVC could remember this capability per vtep physical switch. Using mixed replication modes on a given logical switch is not recommended. Service node replication mode is considered a basic requirement since it only requires sending a packet to a single transport node, hence it is not expected that a switch should report that service node mode cannot be supported.
Common Column:
The overall purpose of this column is described under Common Column at the beginning of this document.
other_config: map of string-string pairs
Tunnel TABLE
A tunnel created by a Physical_Switch.
Summary
- local
Physical_Locator
- remote
Physical_Locator
- Bidirectional Forwarding Detection (BFD):
- BFD Local Configuration:
- bfd_config_local : bfd_dst_mac
optional string
- bfd_config_local : bfd_dst_ip
optional string
- BFD Remote Configuration:
- bfd_config_remote : bfd_dst_mac
optional string
- bfd_config_remote : bfd_dst_ip
optional string
- BFD Parameters:
- bfd_params : enable
optional string, either true or false
- bfd_params : min_rx
optional string, containing an integer, at least 1
- bfd_params : min_tx
optional string, containing an integer, at least 1
- bfd_params : decay_min_rx
optional string, containing an integer
- bfd_params : forwarding_if_rx
optional string, either true or false
- bfd_params : cpath_down
optional string, either true or false
- bfd_params : check_tnl_key
optional string, either true or false
- BFD Status:
- bfd_status : enabled
optional string, either true or false
- bfd_status : state
optional string, one of admin_down, down, init, or up
- bfd_status : forwarding
optional string, either true or false
- bfd_status : diagnostic
optional string
- bfd_status : remote_state
optional string, one of admin_down, down, init, or up
- bfd_status : remote_diagnostic
optional string
- bfd_status : info
optional string
Details
- local: Physical_Locator
Tunnel end-point local to the physical switch.
- remote: Physical_Locator
Tunnel end-point remote to the physical switch.
Bidirectional Forwarding Detection (BFD):
BFD, defined in RFC 5880, allows point to point detection of connectivity failures by occasional transmission of BFD control messages. VTEPs are expected to implement BFD.
BFD operates by regularly transmitting BFD control messages at a rate negotiated independently in each direction. Each endpoint specifies the rate at which it expects to receive control messages, and the rate at which it’s willing to transmit them. An endpoint which fails to receive BFD control messages for a period of three times the expected reception rate will signal a connectivity fault. In the case of a unidirectional connectivity issue, the system not receiving BFD control messages will signal the problem to its peer in the messages it transmits.
A hardware VTEP is expected to use BFD to determine reachability of devices at the end of the tunnels with which it exchanges data. This can enable the VTEP to choose a functioning service node among a set of service nodes providing high availability. It also enables the NVC to report the health status of tunnels.
In many cases the BFD peer of a hardware VTEP will be an Open vSwitch instance. The Open vSwitch implementation of BFD aims to comply faithfully with the requirements put forth in RFC 5880. Open vSwitch does not implement the optional Authentication or ``Echo Mode’’ features.
BFD Local Configuration:
The HSC writes the key-value pairs in the bfd_config_local column to specify the local configurations to be used for BFD sessions on this tunnel.
- bfd_config_local : bfd_dst_mac: optional string
Set to an Ethernet address in the form xx:xx:xx:xx:xx:xx to set the MAC expected as destination for received BFD packets. The default is 00:23:20:00:00:01.
- bfd_config_local : bfd_dst_ip: optional string
Set to an IPv4 address to set the IP address that is expected as destination for received BFD packets. The default is 169.254.1.0.
BFD Remote Configuration:
The bfd_config_remote column is the remote counterpart of the bfd_config_local column. The NVC writes the key-value pairs in this column.
- bfd_config_remote : bfd_dst_mac: optional string
Set to an Ethernet address in the form xx:xx:xx:xx:xx:xx to set the destination MAC to be used for transmitted BFD packets. The default is 00:23:20:00:00:01.
- bfd_config_remote : bfd_dst_ip: optional string
Set to an IPv4 address to set the IP address used as destination for transmitted BFD packets. The default is 169.254.1.1.
BFD Parameters:
The NVC sets up key-value pairs in the bfd_params column to enable and configure BFD.
- bfd_params : enable: optional string, either true or false
True to enable BFD on this Tunnel. If not specified, BFD will not be enabled by default.
- bfd_params : min_rx: optional string, containing an integer, at least 1
The shortest interval, in milliseconds, at which this BFD session offers to receive BFD control messages. The remote endpoint may choose to send messages at a slower rate. Defaults to 1000.
- bfd_params : min_tx: optional string, containing an integer, at least 1
The shortest interval, in milliseconds, at which this BFD session is willing to transmit BFD control messages. Messages will actually be transmitted at a slower rate if the remote endpoint is not willing to receive as quickly as specified. Defaults to 100.
- bfd_params : decay_min_rx: optional string, containing an integer
An alternate receive interval, in milliseconds, that must be greater than or equal to bfd_params:min_rx. The implementation should switch from bfd_params:min_rx to bfd_params:decay_min_rx when there is no obvious incoming data traffic at the tunnel, to reduce the CPU and bandwidth cost of monitoring an idle tunnel. This feature may be disabled by setting a value of 0. This feature is reset whenever bfd_params:decay_min_rx or bfd_params:min_rx changes.
- bfd_params : forwarding_if_rx: optional string, either true or false
When true, traffic received on the Tunnel is used to indicate the capability of packet I/O. BFD control packets are still transmitted and received. At least one BFD control packet must be received every 100 * bfd_params:min_rx amount of time. Otherwise, even if traffic is received, the bfd_params:forwarding will be false.
- bfd_params : cpath_down: optional string, either true or false
Set to true to notify the remote endpoint that traffic should not be forwarded to this system for some reason other than a connectivity failure on the interface being monitored. The typical underlying reason is ``concatenated path down,’’ that is, that connectivity beyond the local system is down. Defaults to false.
- bfd_params : check_tnl_key: optional string, either true or false
Set to true to make BFD accept only control messages with a tunnel key of zero. By default, BFD accepts control messages with any tunnel key.
BFD Status:
The VTEP sets key-value pairs in the bfd_status column to report the status of BFD on this tunnel. When BFD is not enabled, with bfd_params:enable, the HSC clears all key-value pairs from bfd_status.
- bfd_status : enabled: optional string, either true or false
Set to true if the BFD session has been successfully enabled. Set to false if the VTEP cannot support BFD or has insufficient resources to enable BFD on this tunnel. The NVC will disable the BFD monitoring on the other side of the tunnel once this value is set to false.
- bfd_status : state: optional string, one of admin_down, down, init, or up
Reports the state of the BFD session. The BFD session is fully healthy and negotiated if UP.
- bfd_status : forwarding: optional string, either true or false
Reports whether the BFD session believes this Tunnel may be used to forward traffic. Typically this means the local session is signaling UP, and the remote system isn’t signaling a problem such as concatenated path down.
- bfd_status : diagnostic: optional string
A diagnostic code specifying the local system’s reason for the last change in session state. The error messages are defined in section 4.1 of [RFC 5880].
- bfd_status : remote_state: optional string, one of admin_down, down, init, or up
Reports the state of the remote endpoint’s BFD session.
- bfd_status : remote_diagnostic: optional string
A diagnostic code specifying the remote system’s reason for the last change in session state. The error messages are defined in section 4.1 of [RFC 5880].
- bfd_status : info: optional string
A short message providing further information about the BFD status (possibly including reasons why BFD could not be enabled).
Physical_Port TABLE
A port within a Physical_Switch.
Summary
- vlan_bindings
map of integer-Logical_Switch pairs, key in range 0 to 4,095
- acl_bindings
map of integer-ACL pairs, key in range 0 to 4,095
- vlan_stats
map of integer-Logical_Binding_Stats pairs, key in range 0 to 4,095
- Identification:
- name
string
- description
string
- Error Notification:
- port_fault_status : invalid_vlan_map
none
- port_fault_status : invalid_ACL_binding
none
- port_fault_status : unspecified_fault
none
- Common Column:
- other_config
map of string-string pairs
Details
- vlan_bindings: map of integer-Logical_Switch pairs, key in range 0 to 4,095
Identifies how VLANs on the physical port are bound to logical switches. If, for example, the map contains a (VLAN, logical switch) pair, a packet that arrives on the port in the VLAN is considered to belong to the paired logical switch. A value of zero in the VLAN field means that untagged traffic on the physical port is mapped to the logical switch.
- acl_bindings: map of integer-ACL pairs, key in range 0 to 4,095
Attach Access Control Lists (ACLs) to the physical port. The column consists of a map of VLAN tags to ACLs. If the value of the VLAN tag in the map is 0, this means that the ACL is associated with the entire physical port. Non-zero values mean that the ACL is to be applied only on packets carrying that VLAN tag value. Switches will not necessarily support matching on the VLAN tag for all ACLs, and unsupported ACL bindings will cause errors to be reported. The binding of an ACL to a specific VLAN and the binding of an ACL to the entire physical port should not be combined on a single physical port. That is, a mix of zero and non-zero keys in the map is not recommended.
- vlan_stats: map of integer-Logical_Binding_Stats pairs, key in range 0 to 4,095
Statistics for VLANs bound to logical switches on the physical port. An implementation that fully supports such statistics would populate this column with a mapping for every VLAN that is bound in vlan_bindings. An implementation that does not support such statistics or only partially supports them would not populate this column or partially populate it, respectively. A value of zero in the VLAN field refers to untagged traffic on the physical port.
Identification:
- name: string
Symbolic name for the port. The name ought to be unique within a given Physical_Switch, but the database is not capable of enforcing this.
- description: string
An extended description for the port.
Error Notification:
An entry in this column indicates to the NVC that the physical port has encountered a fault. The switch must clear this column when the error has been cleared.
- port_fault_status : invalid_vlan_map: none
Indicates that a VLAN-to-logical-switch mapping requested by the controller could not be instantiated by the switch because of a conflict with local configuration.
- port_fault_status : invalid_ACL_binding: none
Indicates that an error has occurred in associating an ACL with a port.
- port_fault_status : unspecified_fault: none
Indicates that an error has occurred on the port but that no more specific information is available.
Common Column:
The overall purpose of this column is described under Common Column at the beginning of this document.
other_config: map of string-string pairs
Logical_Binding_Stats TABLE
Reports statistics for the Logical_Switch with which a VLAN on a Physical_Port is associated.
Summary
- Statistics:
- packets_from_local
integer
- bytes_from_local
integer
- packets_to_local
integer
- bytes_to_local
integer
Details
Statistics:
These statistics count only packets to which the binding applies.
- packets_from_local: integer
Number of packets sent by the Physical_Switch.
- bytes_from_local: integer
Number of bytes in packets sent by the Physical_Switch.
- packets_to_local: integer
Number of packets received by the Physical_Switch.
- bytes_to_local: integer
Number of bytes in packets received by the Physical_Switch.
Logical_Switch TABLE
A logical Ethernet switch, whose implementation may span physical and virtual media, possibly crossing L3 domains via tunnels; a logical layer-2 domain; an Ethernet broadcast domain.
Summary
- Per Logical-Switch Tunnel Key:
- tunnel_key
optional integer
- Replication Mode:
- replication_mode
optional string, either service_node or source_node
- Identification:
- name
string (must be unique within table)
- description
string
- Common Column:
- other_config
map of string-string pairs
Details
Per Logical-Switch Tunnel Key:
Tunnel protocols tend to have a field that allows the tunnel to be partitioned into sub-tunnels: VXLAN has a VNI, GRE and STT have a key, CAPWAP has a WSI, and so on. We call these generically ``tunnel keys.’’ Given that one needs to use a tunnel key at all, there are at least two reasonable ways to assign their values:
-
Per Logical_Switch+Physical_Locator pair. That is, each logical switch may be assigned a different tunnel key on every Physical_Locator. This model is especially flexible.
In this model, Physical_Locator carries the tunnel key. Therefore, one Physical_Locator record will exist for each logical switch carried at a given IP destination.
-
Per Logical_Switch. That is, every tunnel associated with a particular logical switch carries the same tunnel key, regardless of the Physical_Locator to which the tunnel is addressed. This model may ease switch implementation because it imposes fewer requirements on the hardware datapath.
In this model, Logical_Switch carries the tunnel key. Therefore, one Physical_Locator record will exist for each IP destination.
- tunnel_key: optional integer
-
This column is used only in the tunnel key per Logical_Switch model (see above), because only in that model is there a tunnel key associated with a logical switch.
For vxlan_over_ipv4 encapsulation, when the tunnel key per Logical_Switch model is in use, this column is the VXLAN VNI that identifies a logical switch. It must be in the range 0 to 16,777,215.
Replication Mode:
For handling L2 broadcast, multicast and unknown unicast traffic, packets can be sent to all members of a logical switch referenced by a physical switch. There are different modes to replicate the packets. The default mode of replication is to send the traffic to a service node, which can be a hypervisor, server or appliance, and let the service node handle replication to other transport nodes (hypervisors or other VTEP physical switches). This mode is called service node replication. An alternate mode of replication, called source node replication involves the source node sending to all other transport nodes. Hypervisors are always responsible for doing their own replication for locally attached VMs in both modes. Service node replication mode is the default and considered a basic requirement because it only requires sending the packet to a single transport node.
- replication_mode: optional string, either service_node or source_node
This optional column defines the replication mode per Logical_Switch. There are 2 valid values, service_node and source_node. If the column is not set, the replication mode defaults to service_node.
Identification:
- name: string (must be unique within table)
Symbolic name for the logical switch.
- description: string
An extended description for the logical switch, such as its switch login banner.
Common Column:
The overall purpose of this column is described under Common Column at the beginning of this document.
other_config: map of string-string pairs
Ucast_Macs_Local TABLE
Mapping of unicast MAC addresses to tunnels (physical locators). This table is written by the HSC, so it contains the MAC addresses that have been learned on physical ports by a VTEP.
Summary
- MAC
string
- logical_switch
Logical_Switch
- locator
Physical_Locator
- ipaddr
string
Details
- MAC: string
A MAC address that has been learned by the VTEP.
- logical_switch: Logical_Switch
The Logical switch to which this mapping applies.
- locator: Physical_Locator
The physical locator to be used to reach this MAC address. In this table, the physical locator will be one of the tunnel IP addresses of the appropriate VTEP.
- ipaddr: string
The IP address to which this MAC corresponds. Optional field for the purpose of ARP supression.
Ucast_Macs_Remote TABLE
Mapping of unicast MAC addresses to tunnels (physical locators). This table is written by the NVC, so it contains the MAC addresses that the NVC has learned. These include VM MAC addresses, in which case the physical locators will be hypervisor IP addresses. The NVC will also report MACs that it has learned from other HSCs in the network, in which case the physical locators will be tunnel IP addresses of the corresponding VTEPs.
Summary
- MAC
string
- logical_switch
Logical_Switch
- locator
Physical_Locator
- ipaddr
string
Details
- MAC: string
A MAC address that has been learned by the NVC.
- logical_switch: Logical_Switch
The Logical switch to which this mapping applies.
- locator: Physical_Locator
The physical locator to be used to reach this MAC address. In this table, the physical locator will be either a hypervisor IP address or a tunnel IP addresses of another VTEP.
- ipaddr: string
The IP address to which this MAC corresponds. Optional field for the purpose of ARP supression.
Mcast_Macs_Local TABLE
Mapping of multicast MAC addresses to tunnels (physical locators). This table is written by the HSC, so it contains the MAC addresses that have been learned on physical ports by a VTEP. These may be learned by IGMP snooping, for example. This table also specifies how to handle unknown unicast and broadcast packets.
Summary
- MAC
string
- logical_switch
Logical_Switch
- locator_set
Physical_Locator_Set
- ipaddr
string
Details
- MAC: string
-
A MAC address that has been learned by the VTEP.
The keyword unknown-dst is used as a special ``Ethernet address’’ that indicates the locations to which packets in a logical switch whose destination addresses do not otherwise appear in Ucast_Macs_Local (for unicast addresses) or Mcast_Macs_Local (for multicast addresses) should be sent.
- logical_switch: Logical_Switch
The Logical switch to which this mapping applies.
- locator_set: Physical_Locator_Set
The physical locator set to be used to reach this MAC address. In this table, the physical locator set will be contain one or more tunnel IP addresses of the appropriate VTEP(s).
- ipaddr: string
The IP address to which this MAC corresponds. Optional field for the purpose of ARP supression.
Mcast_Macs_Remote TABLE
Mapping of multicast MAC addresses to tunnels (physical locators). This table is written by the NVC, so it contains the MAC addresses that the NVC has learned. This table also specifies how to handle unknown unicast and broadcast packets.
Multicast packet replication may be handled by a service node, in which case the physical locators will be IP addresses of service nodes. If the VTEP supports replication onto multiple tunnels, using source node replication, then this may be used to replicate directly onto VTEP-hypervisor or VTEP-VTEP tunnels.
Summary
- MAC
string
- logical_switch
Logical_Switch
- locator_set
Physical_Locator_Set
- ipaddr
string
Details
- MAC: string
-
A MAC address that has been learned by the NVC.
The keyword unknown-dst is used as a special ``Ethernet address’’ that indicates the locations to which packets in a logical switch whose destination addresses do not otherwise appear in Ucast_Macs_Remote (for unicast addresses) or Mcast_Macs_Remote (for multicast addresses) should be sent.
- logical_switch: Logical_Switch
The Logical switch to which this mapping applies.
- locator_set: Physical_Locator_Set
The physical locator set to be used to reach this MAC address. In this table, the physical locator set will be either a set of service nodes when service node replication is used or the set of transport nodes (defined as hypervisors or VTEPs) participating in the associated logical switch, when source node replication is used. When service node replication is used, the VTEP should send packets to one member of the locator set that is known to be healthy and reachable, which could be determined by BFD. When source node replication is used, the VTEP should send packets to all members of the locator set.
- ipaddr: string
The IP address to which this MAC corresponds. Optional field for the purpose of ARP supression.
Logical_Router TABLE
A logical router, or VRF. A logical router may be connected to one or more logical switches. Subnet addresses and interface addresses may be configured on the interfaces.
Summary
- switch_binding
map of string-Logical_Switch pairs
- static_routes
map of string-string pairs
- acl_binding
map of string-ACL pairs
- Identification:
- name
string (must be unique within table)
- description
string
- Error Notification:
- LR_fault_status : invalid_ACL_binding
none
- LR_fault_status : unspecified_fault
none
- Common Column:
- other_config
map of string-string pairs
Details
- switch_binding: map of string-Logical_Switch pairs
Maps from an IPv4 or IPv6 address prefix in CIDR notation to a logical switch. Multiple prefixes may map to the same switch. By writing a 32-bit (or 128-bit for v6) address with a /N prefix length, both the router’s interface address and the subnet prefix can be configured. For example, 192.68.1.1/24 creates a /24 subnet for the logical switch attached to the interface and assigns the address 192.68.1.1 to the router interface.
- static_routes: map of string-string pairs
One or more static routes, mapping IP prefixes to next hop IP addresses.
- acl_binding: map of string-ACL pairs
Maps ACLs to logical router interfaces. The router interfaces are indicated using IP address notation, and must be the same interfaces created in the switch_binding column. For example, an ACL could be associated with the logical router interface with an address of 192.68.1.1 as defined in the example above.
Identification:
- name: string (must be unique within table)
Symbolic name for the logical router.
- description: string
An extended description for the logical router.
Error Notification:
An entry in this column indicates to the NVC that the HSC has encountered a fault in configuring state related to the logical router.
- LR_fault_status : invalid_ACL_binding: none
Indicates that an error has occurred in associating an ACL with a logical router port.
- LR_fault_status : unspecified_fault: none
Indicates that an error has occurred in configuring the logical router but that no more specific information is available.
Common Column:
The overall purpose of this column is described under Common Column at the beginning of this document.
other_config: map of string-string pairs
Arp_Sources_Local TABLE
MAC address to be used when a VTEP issues ARP requests on behalf of a logical router.
A distributed logical router is implemented by a set of VTEPs (both hardware VTEPs and vswitches). In order for a given VTEP to populate the local ARP cache for a logical router, it issues ARP requests with a source MAC address that is unique to the VTEP. A single per-VTEP MAC can be re-used across all logical networks. This table contains the MACs that are used by the VTEPs of a given HSC. The table provides the mapping from MAC to physical locator for each VTEP so that replies to the ARP requests can be sent back to the correct VTEP using the appropriate physical locator.
Summary
- src_mac
string
- locator
Physical_Locator
Details
- src_mac: string
The source MAC to be used by a given VTEP.
- locator: Physical_Locator
The Physical_Locator to use for replies to ARP requests from this MAC address.
Arp_Sources_Remote TABLE
MAC address to be used when a remote VTEP issues ARP requests on behalf of a logical router.
This table is the remote counterpart of Arp_sources_local. The NVC writes this table to notify the HSC of the MACs that will be used by remote VTEPs when they issue ARP requests on behalf of a distributed logical router.
Summary
- src_mac
string
- locator
Physical_Locator
Details
- src_mac: string
The source MAC to be used by a given VTEP.
- locator: Physical_Locator
The Physical_Locator to use for replies to ARP requests from this MAC address.
Physical_Locator_Set TABLE
A set of one or more Physical_Locators.
This table exists only because OVSDB does not have a way to express the type ``map from string to one or more Physical_Locator records.’’
Summary
- locators
immutable set of 1 or more Physical_Locators
Details
locators: immutable set of 1 or more Physical_Locators
Physical_Locator TABLE
Identifies an endpoint to which logical switch traffic may be encapsulated and forwarded.
The vxlan_over_ipv4 encapsulation, the only encapsulation defined so far, can use either tunnel key model described in the ``Per Logical-Switch Tunnel Key’’ section in the Logical_Switch table. When the tunnel key per Logical_Switch model is in use, the tunnel_key column in the Logical_Switch table is filled with a VNI and the tunnel_key column in this table is empty; in the key-per-tunnel model, the opposite is true. The former model is older, and thus likely to be more widely supported. See the ``Per Logical-Switch Tunnel Key’’ section in the Logical_Switch table for further discussion of the model.
Summary
- encapsulation_type
immutable string, must be vxlan_over_ipv4
- dst_ip
immutable string
- tunnel_key
optional integer
Details
- encapsulation_type: immutable string, must be vxlan_over_ipv4
The type of tunneling encapsulation.
- dst_ip: immutable string
-
For vxlan_over_ipv4 encapsulation, the IPv4 address of the VXLAN tunnel endpoint.
We expect that this column could be used for IPv4 or IPv6 addresses in encapsulations to be introduced later.
- tunnel_key: optional integer
-
This column is used only in the tunnel key per Logical_Switch+Physical_Locator model (see above).
For vxlan_over_ipv4 encapsulation, when the Logical_Switch+Physical_Locator model is in use, this column is the VXLAN VNI. It must be in the range 0 to 16,777,215.
ACL_entry TABLE
Describes the individual entries that comprise an Access Control List.
Each entry in the table is a single rule to match on certain header fields. While there are a large number of fields that can be matched on, most hardware cannot match on arbitrary combinations of fields. It is common to match on either L2 fields (described below in the L2 group of columns) or L3/L4 fields (the L3/L4 group of columns) but not both. The hardware switch controller may log an error if an ACL entry requires it to match on an incompatible mixture of fields.
Summary
- sequence
integer
- L2 fields:
- source_mac
optional string
- dest_mac
optional string
- ethertype
optional string
- L3/L4 fields:
- source_ip
optional string
- source_mask
optional string
- dest_ip
optional string
- dest_mask
optional string
- protocol
optional integer
- source_port_min
optional integer
- source_port_max
optional integer
- dest_port_min
optional integer
- dest_port_max
optional integer
- tcp_flags
optional integer
- tcp_flags_mask
optional integer
- icmp_type
optional integer
- icmp_code
optional integer
- direction
string, either egress or ingress
- action
string, either deny or permit
- Error Notification:
- acle_fault_status : invalid_acl_entry
none
- acle_fault_status : unspecified_fault
none
Details
- sequence: integer
The sequence number for the ACL entry for the purpose of ordering entries in an ACL. Lower numbered entries are matched before higher numbered entries.
L2 fields:
- source_mac: optional string
Source MAC address, in the form xx:xx:xx:xx:xx:xx
- dest_mac: optional string
Destination MAC address, in the form xx:xx:xx:xx:xx:xx
- ethertype: optional string
Ethertype in hexadecimal, in the form 0xAAAA
L3/L4 fields:
- source_ip: optional string
Source IP address, in the form xx.xx.xx.xx for IPv4 or appropriate colon-separated hexadecimal notation for IPv6.
- source_mask: optional string
Mask that determines which bits of source_ip to match on, in the form xx.xx.xx.xx for IPv4 or appropriate colon-separated hexadecimal notation for IPv6.
- dest_ip: optional string
Destination IP address, in the form xx.xx.xx.xx for IPv4 or appropriate colon-separated hexadecimal notation for IPv6.
- dest_mask: optional string
Mask that determines which bits of dest_ip to match on, in the form xx.xx.xx.xx for IPv4 or appropriate colon-separated hexadecimal notation for IPv6.
- protocol: optional integer
Protocol number in the IPv4 header, or value of the "next header" field in the IPv6 header.
- source_port_min: optional integer
Lower end of the range of source port values. The value specified is included in the range.
- source_port_max: optional integer
Upper end of the range of source port values. The value specified is included in the range.
- dest_port_min: optional integer
Lower end of the range of destination port values. The value specified is included in the range.
- dest_port_max: optional integer
Upper end of the range of destination port values. The value specified is included in the range.
- tcp_flags: optional integer
Integer representing the value of TCP flags to match. For example, the SYN flag is the second least significant bit in the TCP flags. Hence a value of 2 would indicate that the "SYN" flag should be set (assuming an appropriate mask).
- tcp_flags_mask: optional integer
Integer representing the mask to apply when matching TCP flags. For example, a value of 2 would imply that the "SYN" flag should be matched and all other flags ignored.
- icmp_type: optional integer
ICMP type to be matched.
- icmp_code: optional integer
ICMP code to be matched.
- direction: string, either egress or ingress
Direction of traffic to match on the specified port, either "ingress" (toward the logical switch or router) or "egress" (leaving the logical switch or router).
- action: string, either deny or permit
Action to take for this rule, either "permit" or "deny".
Error Notification:
An entry in this column indicates to the NVC that the ACL could not be configured as requested. The switch must clear this column when the error has been cleared.
- acle_fault_status : invalid_acl_entry: none
Indicates that an ACL entry requested by the controller could not be instantiated by the switch, e.g. because it requires an unsupported combination of fields to be matched.
- acle_fault_status : unspecified_fault: none
Indicates that an error has occurred in configuring the ACL entry but no more specific information is available.
Acl Table
Access Control List table. Each ACL is constructed as a set of entries from the ACL_entry table. Packets that are not matched by any entry in the ACL are allowed by default.
Summary
- acl_entries
set of 1 or more ACL_entrys
- acl_name
string (must be unique within table)
- Error Notification:
- acl_fault_status : invalid_acl
none
- acl_fault_status : resource_shortage
none
- acl_fault_status : unspecified_fault
none
Details
- acl_entries: set of 1 or more ACL_entrys
A set of references to entries in the ACL_entry table.
- acl_name: string (must be unique within table)
A human readable name for the ACL, which may (for example) be displayed on the switch CLI.
Error Notification:
An entry in this column indicates to the NVC that the ACL could not be configured as requested. The switch must clear this column when the error has been cleared.
- acl_fault_status : invalid_acl: none
Indicates that an ACL requested by the controller could not be instantiated by the switch, e.g., because it requires an unsupported combination of fields to be matched.
- acl_fault_status : resource_shortage: none
Indicates that an ACL requested by the controller could not be instantiated by the switch due to a shortage of resources (e.g. TCAM space).
- acl_fault_status : unspecified_fault: none
Indicates that an error has occurred in configuring the ACL but no more specific information is available.
Referenced By
ovn-architecture(7), ovn-controller-vtep(8), ovsdb(7), vtep-ctl(8).