Infrastructure-as-Code, NETCONF and REST API

Wednesday, September 19, 2018 08:18 +0200

Infrastructure-as-Code, NETCONF and REST API

This is the third blog post in “thinking out loud while preparing Network Infrastructure as Code presentation for the network automation course” series. You might want to start with Network-Infrastructure-as-Code Is Nothing New and Adjusting System State blog posts.

As I described in the previous blog post, the hardest problem any infrastructure-as-code (IaC) tool must solve is “how to adjust current system state to desired state described in state definition file(s)”… preferably without restarting or rebuilding the system.

There are two approaches to adjusting system state:

Data model based approach: overall system state is described as a data model that can be read and manipulated with API calls. NETCONF is often used to manipulate data model of networking devices.
CRUD (Create, Read, Update, Delete) approach where API provides a gazillion calls to list, read, and manipulate individual objects. OpenStack, AWS, NSX, ACI… are typical examples of this approach.

Replacing a Data Model

Implementing IaC for devices that allow you to replace system state is trivial (see previous blog post), although one might get into interesting challenges when dealing with devices where system state is described in arcane domain-specific language (example: Cisco IOS configuration) instead of easy-to-parse data structures (like XML or JSON).

Even if your tool doesn’t generate full system state that can be used as a replacement for existing state it can:

Read system state;
Adjust the resulting data structure based on desired (sub)state of the system;
Replace system state.

Adjusting System State Data Model

Adjusting system state for devices that use a data model based approach but don’t support an equivalent of configure replace functionality is reasonable easy assuming we’re dealing with data structures:

Read system state;
Compare current state with desired state – keep in mind that we’re dealing with data structures, and implementing a recursive data structure diff is a simple programming exercise;
Tell the device to implement the differences between current and desired state.

Obviously this approach only works if the device is able to (A) generate current state as a data structure and (B) accept differences as a data structure and implement them.

Even if you have to deal with devices that use arcane data manipulation languages (DML) like Cisco IOS configuration syntax you can use tools like (now long gone) napalm-yang that parse the configuration into a data model, create a difference between two data models, and transform the differences back into configuration commands.

The CRUD Hell

The worst API model you might have to deal with when implementing infrastructure-as-code is the CRUD approach: device (or system) API has calls to list, inspect, and manipulate individual objects. Using a CRUD API an infrastructure-as-code tool has to:

List existing objects;
Inspect the state of existing objects;
Figure out which objects to delete, modify, or create based on the desired final state of the system;
Perform create/delete/modify operations in just the right order based on inter-object dependencies. For example, you cannot attach a VM to a virtual network before the virtual network is created.

Does this remind you of the way we were configuring network devices in the past… executing carefully orchestrated dance of configuration commands (now API calls) to get from where we were to where we wanted to be? One has to wonder why some people prefer to call this progress and why they’re so obsessed with REST API.

Finally, let me mention that most orchestration systems, cloud management systems, SDN controllers and intent-based networking products I’ve seen use this approach. Go figure.

More Information

We talked about network infrastructure-as-code in the Network Automation Concepts webinar.

5 comments:

Chris Tomkins 19 September 2018 11:21

You're bang on the money here Ivan. Declarative > imperative. Idempotency.

This is why in my experience Ansible works so much better with Junos than with Cisco via NAPALM. Idempotency and atomic commits built in.

Ivan Pepelnjak 19 September 2018 13:04

"Ansible works so much better with Junos than with Cisco via NAPALM" << Yeah, I keep telling people what they should consider when buying new gear, but it's as successful as fighting windmills with RITA (see RFC 2321).

https://blog.ipspace.net/2016/10/network-automation-rfp-requirements.html

Anonymous 19 September 2018 11:23

I think people want to use rest so that there is an ability to maintain consistency of configuration. Cisco IOS CLI doesnt lend itself to consistency i.e what you push is what you get if you implement diff logic for replacing config. The idea is not REST it is about well defined models which allow idempotency in network operations

Ivan Pepelnjak 19 September 2018 13:02

REST is just a pretty inconvenient transport mechanism (because it maintains no state across calls).

The real question is whether you deal with the whole data model at once (which we seem to agree on) or whether you have to carefully move around like ninja in a dark room full of tripwires.

As for Cisco IOS, "configure replace" is a far cry from what Junos or Arista have, but it's continuously getting better... not that I would use it for a new IOS feature without testing it first.

Unknown 19 September 2018 14:19

I work in the Internet of Things space more than Network Automation, but it strikes me that there are a lot of parallels.
NETCONF/YANG is widely acknowledged as a powerful tool. For example, the IETF "YANG of Things" working group would like to see YANG used all the way from SDN down to IoT edge devices.
But yet, the industry lags behind. The oneM2M "standard" gives 3 options for Device Management: TR-069 (BroadBand Forum), OMA-DM (Open Mobile Alliance), and LWM2M (Open Mobile Alliance). All use HTTP-based protocols and CRUD data models.
Sigh.

Replacing a Data Model

Adjusting System State Data Model

The CRUD Hell

More Information

Latest blog posts in Network Infrastructure as Code series

Recent posts in the same categories

automation

AWS

5 comments: