New Concepts in Syslogd2

(Rethinking the 'Common Knowledge' of Syslog Event Processing)

New Concepts

Forward

In Base Feature-Set

Application Mode
Delayed Resolution
Deployment
Extra Facilities
Hostname Management
Input Processing
Multi-Homed Hosts
Network Awareness
Output Processing
Prorammable Interrupts
Static Data Parameters

In Optional Features

Command Tool
Configuration Display
Filters
Advanced Multi-Threading
Name Cache (w/ pre-load)
Named-Pipe Input
Spooling
TCP Support
Text-File Input
Variable Buffer-Lengths

External Links

Syslogd2 Project Site
DBD2 Home Page
DBD2 Project Site

Other References

RFC 3164 (The BSD Syslog Protocol)
RFC 3339 (Internet Time Format)
RFC 5424 (Syslog Version 1)

Focus on Network

Syslogd2 Input Options
Syslogd2 Output Options
Queueing and Data Loss

Forward

[Top of page]

Syslogd2 implements several features based on technologies and techniques never before integraed into a system logging daemon. Some of these features are based on computer-science theory (such as capacity-analysis, and queueing-theory), some on operations-research techniques (such as human-nature-based tendencies and traffic-flow-management) and some features are based on simple common-sense, but ALL are based on experience (specifically MY experience) attempting to manage significant numbers of Linux servers and network devices.
To some extent, Syslogd2 is also somewhat of an experiment to discover what features would be most useful in network- and host-administration personnel as well as the result of about 2 decades of hobby programming...

To fully understand Syslogd2, it is necessary to question every assumption currently held about how syslog is used and configured in today's networks, as compared to its original purpose when it was first devised in the early 1970s. Syslogd2 is more of a 'feature grab-bag' and 'network-data-collector' than it is a 'one-size-fits-all' 'host-based-logger' daemon (like rsyslog or syslog-ng).

This (New Concepts) page attempts to summarize most of the new features and concepts and to provide links to the web pages where each feature or concept is discussed in more detail.

A brief history of the development of syslog processors sice 1970.

Syslog-ng existed for a brief time for Linux, but due to it's radically different configuration structure that only a programmer could love, it was never widely accepted.

Rsyslog introduced a mult-threaded version of a syslog daemon in the early 2000s that supports TCP/IP and IPv6 (and now also supports reading the systemd journal interface to obtain syslog messages). Rsyslog is Linux-only as it relies on system calls that are unique to the Linux OS.

Syslogd2 is being released in 2023/2024 and supports a variety of features. Written in 'C' with no external support libraries required, Syslogd2 is portable to any Unix-like OS. (It HAS been ported to MAX OSX though that port is currently out-of-date.)

It is my hope that the addtional functionality of Syslogd2 and its ability to be ported to other systems will be able to overcome the industry's usual inertia against anything new or different.

That was a pretty brief summary - wasn't it ?

Syslogd2's contribution

Syslogd2 was designed and developed expressly to address all the issues I experienced while trying to use the syslog protocol in both network and host administration environments.

It has the speed to handle busy firewalls,
It has facilities to read and process external text-files
It has filtering capabilities to select messages by key words or phrases
It's filtering capabilities can act to 'guard' the network from excessive traffic by discarding all unwanted messagges instead of sending them to remote hosts.
Filtering allows Syslogd2 to extract and forward only selected events which virtually eliminates network-congestion concerns.
Transmission over TCP with connection-spooling facilitates reliable (and firewall-friendly) delivery of syslog events. Should a network failure occur:
- The connection disruption is detected and traffic can be automatcially spooled until such time as the link is restored.
- Once the link is restored, the traffic is automatically spooled to the originally-intended receiver. All de-spooled traffic is subject to an 'age'check that will discard any traffic that has 'expired' while spooled. The 'age' check is to prevent forwarding spurious alerts about no-longer-relevant network or application alerts.
Syslogd2 can be scaled up to gainfully use hundreds of threads or scaled down to use as few as 2 threads -- or anywhere in between.
Syslogd2 provides excellent support for multi-homed hosts allowing each input source (each text-file, each socket, each named-pipe in addition to the local kernel) to have it's own individualized configuration (and it's own entire threadpool if the administrator so desires).

New Concepts Implemented in the Base Feature-Set

Application Mode

[Top of page]

'Application Mode' refers to the ability of Syslogd2 to run along side of another syslog daemon, sharing the same host yet not interfering with (or being interfered with by) the existing syslog processor. This might be desirable where a network-management data-collection capability is desired on a host that is under the control of a different organization or where the existing syslog processor is providing some output-formatting or other service that Syslogd2 is not designed to support.

This 'application' role allows Syslogd2 to enhance or supplement (not replace) the abilities of the host's existing syslog processor in the role of of syslog data colleciton, processing and logging. For example, Syslogd2 might be deployed on a cloud-based server or a remotely-hosted server to monitor one or more log files, reporting back to 'home base' only selected events instead of the entire contents of facility/priority data streams or it may allow Syslogd2 to be deployed to collect and concentrate TCP syslog streams and log-file input on a host whose native syslog processor does not support TCP connections.

Running Syslogd2 in Applciation Mode is not without drawbacks. One issue is how to set up the data-transfer from the native syslog processor into Syslogd2 where filters and other processing steps can be applied for purposes of event filtering and 'noise reduction' (reducing the congestion of network-links and down-stream processors). Running in 'Application Mode' is the primary use-case for Syslogd2's support of named-pipe input.

Not running as the primary syslog processor for a host is generally a less efficient method of syslog data collection (both because many message will have to be processed twice and because Syslogd2 may not recieve 'clean' or complete data either through administrative configuration error or if the other processor is unable to keep up with the incoming traffic and the system buffers drop messages that it is not fast enough to read).

There may also be instances where a host (perhaps not even a Linux host) processes syslog data in a manner that is 'foreign' to Syslogd2 due either to the proprietary nature of the host operating system or due to some unique capability that Syslogd2 does not possess. (One example I've already found is the unique format of the OSX operationg system's syslog file output).

For all the above reasons, Syslogd2 provides the --ApplicationMode 'macro' that simplifies the process of 'converting' Syslogd2 from a system-default-syslog-service daemon to simply a backgrounding application process. '--ApplicationMode' makes the following changes as if they had been to compile-time settings (meaning they may still be individually over-ridden by run-time settings without running afoul of Syslogd2 first-come-first-serve parsing policy).

Automatic creation of the default IP socket is disabled to prevent conflict with the other syslog processor.
~ --disable DefaultIp
The 'default' IP address may still be used, but the input-socket will have to be explicitly declared.
Automatic creation of the default Linux socket is disabled to prevent conflict with the other syslog processor.
~ --disable Syslog
The 'default' Linux socket may still be used, but the input-socket will have to be explicitly declared.
All Kernel-logging by Syslogd2 is disabled by default, as is activation of the kernel threadpool (if declared at compile-time).
~ --disable KernelThreads, KernelLogging
All User-Terminal logging by Syslogd2 is disabled by default, as is activation of the User threadpool (if declared at compile-time).
~ --disable UserThreads, UserLogging
Logging to the system console is disabled (even if configured). This an attempt to prevent duplicate messages to the console device.
~ --disable Console
Syslogd2 is instructed not to use the 'Last Message Repeated Times...' mode of logging to local files, but to log every message individually no matter how many times it is repeated.
~ --enable AllMessages
IP support and IP Forwarding as well as HouseKeeping and the internal Name-Cache are enabled.
~ --enable Inet, Forwarding, HouseKeeping, NameCache

Application Mode may be invoked in using either of two methods. As a Syslogd2 keyword, it is non-case-sensitive either way:

As an independent command-line parameter on either the actual command-line or from inside the configuration file:
--ApplicationMode
As a global boolean value on the actual command-line or from the configuration file:
~ --enable applicationmode

The promise of Syslogd2 running in ApplicationMode on a UNIX or Linux host is not without potential failure. There are technical and implementation issues that may restrict the ability of a Syslogd2 application to receive syslog data from a legacy syslog processor running on the same host.
Because there is at least one method of communication between rsyslo (legacy syslog processo) and Syslogd2 on Linux, it is feasible to run both rsyslog and Syslogd2 on the same host (if Syslogd2 is in 'ApplicationMode'). This allows Syslogd2 to function as a supplement to rsyslog for network-management purposes (reading log-file (and perhaps IP or kernel input) input while receiving other syslog data from rsyslog). This feasibility is enhanced because rsyslog is designed with pluggable modules for (at least) kernel input and IP support, so it appears that these input functions can be disabled on either system and enabled on the other.

Delayed Resolution

[Top of page]

Traditional (single-threaded) syslog processors (host-loggers) assume that the network will be up and stable at all times so if a host canot be resolved at startup, they mark that host unusable and move on. At no time do they go back and attempt to re-resolve a hostname that did not initialize at startup. A big part of the reason for this is that most traditional syslog processors are not multi-threaded so once they finish startup, they start a processing loop that only terminates when they shut down. To leave that processing loop for any purpose would be to stop processing syslog traffic and to potentially miss incoming traffic due to system-buffer overflows.

Syslogd2 (being multi-threaded) has a specialized threadpool set aside to run background operations while the primary threadpools are focused on processing syslog input. This new threadpool is called a 'housekeeping' threadpool because it does various 'maintenance' chores for the Syslogd2 applicaiton. Two of the 'maintenance chores' that it is called upon to do (by the parent-thread running as the main-scheduler) are to periodically check the network state and run the CheckSources routine and the CheckDesitnations routine. In the event of an 'upwards' network state change (Down->Local or Local->Other), these routines call upon a 2nd pair of routines to 'walk' the list of input and output connections and to resolve any previously unresolved entries. After resolving all entries that can be resolved, the proposed connections are checked for conflicts. Those entries not marked 'in-conflict' either with the network state or with each other, are then opened. Those connections that succeed in being opened at this time (either newly resolved or previously opened, but closed due to communication error) are returned to service -- all while the primary threads continue to process incoming syslog input.

The term 'Delayed Resolution' refers to the (possibly lengthy) delay between system startup and when individual IP hosts can be resolved and activated. Syslogd2's awareness of the state of the network combined with periodic attempts to resolve IP hostnames implements both delayed resolution and standard network-recovery. (Note this can only occur if the network is in state 'Other' if DNS servcie is required to resolve addresses because otherwise the DNS is assumed to be unreachable/unusable.) In addition to just checking and reopening input/output connections, CheckSources and CheckDestinations do other connection-housekeeping chores as well. Among these chores are:

CheckSources verifies that the filesystem entries are present for Linux socket and pipe input. If not, the file-system entries for the entries are re-created.
CheckDestinations looks for spoolfiles and if it finds one for a file that is now open, it schedules the FlushSpoolFiles routine to flush that file to its (now reachable) destination.
CheckDestinations will also re-create (and/or re-open) any output-file (or log-file) that has been deleted since it was last run.

New Concepts

In Base Feature-Set

In Optional Features

External Links

Other References

Focus on Network

Forward

A brief history of the development of syslog processors sice 1970.

Syslogd2's contribution

New Concepts Implemented in the Base Feature-Set

Application Mode

Delayed Resolution

Deployment

Extra Facilities

Hostname Management

Input Processing

Multi-Homed Hosts

Network Awareness

Output Processing

Programmable Interrupts

Static Data Parameters (SD-String)

New Concepts Implemented in Optional Feature-Sets

Command-Tool

Configuration Display

Filters

Advanced Multi-Threading

Integrated Name Cache (with pre-load file)

Named-Pipe Input

Spooling

TCP Support

Text-File Input

Variable Buffer-Lengths