Tickless lguest

Rusty recently committed my patch to convert lguest over to the modern hrtimers API. This allows guests to run tickless (also known as dynamic ticks), which means that instead of being fed a constant stream of timer ticks, a guest will only receive a timer interrupt when it really needs one. As a bonus, these ticks will likely arrive with a much higher resolution than every 1/1000 s (or whatever HZ was set to).

Tickless operation is important for virtualized systems. Imagine if you had ten guests, each running at 1000HZ. With static ticks, the host would need to generate 10,000 virtual ticks per second, and each guest would be handling 1000 synthetic hardware interrupts/s, the vast majority of which would very likely be unnecessary. Instead, lguest now allows guests to request clock ticks to occur only at some appropriate point in the future, with nanosecond resolution.

A virtual clock event device is instantiated for each running guest. Instead of being a programmable piece of hardware (such as an HPET), the virtual device is quite simply a high-resolution timer attached to the guest. When the guest needs to program a clock tick event, the smarts of which are thankfully buried in the hrtimer code, it issues a new hypercall, HCALL_SET_CLOCKEVENT, with an argument in nanoseconds representing the relative amount of time until the event is required. The hypercall traps into the host, which converts the relative delta into an absolute value (we know when ‘now’ is in the guest’s virtual time, as we’re in its hypercall context, which is similar to being in the syscall context of a userland application), then starts a highres timer running in the host. When this fires, the host simply sets the timer interrupt for the guest. When the guest runs next, its own hrtimer mechanism is then invoked to process the timer interrupt.

Conceptually, it’s quite simple, which keeps the puppies happy. This is due in large part to the great hrtimer subsystem by Thomas Gleixner and Ingo Molnar. Check out the 2006 OLS paper for more info on hrtimers, as well as the LWN articles here and here.

Article on SELinux Networking

Josh Brindle has written an in-depth article, Secure Networking with SELinux, which covers some of the different network security mechanisms in SELinux. Of particular value is its discssion of labeled IPSec, which is currently not well documented. Josh includes some worked examples of use with explanations, and discussion of potential uses:

For example, one possible application of this technology is to have an ‘internal’ and ‘external’ browser on employee workstations. The internal browser would run in a domain that is allowed to access internal company web application servers that contain confidential customer information while the external browser can access the internet. This reduces the risk that rogue internet content can compromise your internal data. This kind of separation would be much more difficult (or impossible) without SELinux’ advanced networking controls.

Linux now has some very powerful and expressive network security capabilities; perhaps unmatched anywhere. Better usability is an important next step with this, and I suspect from comments in the article that Josh may have more for us on this soon.

Mitigation of Samba vulnerabilities by SELinux

Dan Walsh writes in detail about how SELinux mitigates recent vulnerabilities in Samba. The referenced vulnerabilities potentially allow remote arbitrary code execution due to coding errors: exactly the kind of thing SELinux was designed to protect against.

It’s notable is that the SELinux memory execution controls are key here, as they are often the cause of problems with third-party software installation vs. SELinux issues. More often than you’d expect, applications and/or libraries will try to do highly questionable things like making areas of the heap executable, which SELinux will typically prevent. This tends to be seen more in closed software, which is also unfortunately harder to fix. In any case, SELinux is usually doing the right thing for you when this happens, as explained by Ulrich Drepper.

Generally, the best solution for software such as this is to fix it and make it safe; not to disable security protections. Ulrich provides example code of how to make such fixes in the referenced article.

SELinux and Solaris Trusted Extensions

Recently, Glenn Faden of Sun posted a comparison of SELinux and Solaris Trusted Extensions from an MLS point of view.

Karl MacMillan has now published a response, starting with a discussion of classic “trusted operating systems” and their manifest inadequacies. Then, he outlines the role of SELinux in providing a more general and flexible approach to security in mainstream OSs, meeting a wider range of requirements in a more comprehensive manner, and to do this not as a stale fork of an OS, but as a first class component, available by default. Karl also addresses some of the pervasive technical errors in the Sun article, which need rebuttal, even though they sidetrack more essential issues.

On trusted systems:

The worst of it, though, is that these systems were only useful for a small set of customers. Namely, government customers that maintain classified data (you know, “Top Secret”, “Secret”, etc.). The systems weren’t super secure versions of their general purpose counterparts as many people mistakenly assume. Instead, they were only secure in terms of a very narrow set of requirements.

Indeed, these old MLS models are full of problems, although generally mitigated with subsequent band-aids, people may be surprised to know, for example, that one of the foundational MLS policies, BLP, provides no integrity at all and can be trivially subverted.

Frederick Cohen, in his now fascinatingly dated A Short Course on Computer Viruses describes with great simplicity a virus which would allow the least trusted user on a BLP system to infect software run by the most trusted user. This was not exploiting an implementation flaw; merely an inherent limitation of the narrowly focused security policy. Another feature of classic MLS or trusted systems is that a trusted application can be defined as one that violates the security policy. That is, the policy is so inflexible that you must break it with a privileged application to make the system work.

This is not to get into a critique of MLS, but to highlight the role of SELinux in providing a flexible security framework with integrity built in from the ground up, and that thinking in these areas has advanced significantly since the 1970s and 1980s.

There are still some cases where MLS is indicated, particularly where there are very large numbers of distinct compartments, and for these cases, SELinux provides an integrated MLS capability. Typically, however, users who claim to need MLS quite often don’t—it’s all they’ve ever known—and they’re better off with a more flexible security policy which meets their specific requirements and does not need to be subverted by trusted applications to work.

Framing a comparison of SELinux and Solaris Trusted Extensions in terms of how each meets classic MLS requirements ignores the essential issue of SELinux as a fundamental paradigm shift in security. With SELinux, mandatory access control is available for virtually any class of user.

The case for generalized MAC is compelling, and I would invite the OpenSolaris project to consider integrating the Flask model (of which SELinux is an implementation) into their OS.

The SE-BSD and SE-Darwin efforts prove that SELinux components can be integrated into other OSs (where licensing allows), or otherwise re-implemented. I believe this would further improve and refine the technology due to greater diversity of implementations, with the potential to standardize flexible mandatory access control, similarly to the rough consensus of DAC. We’d then have greater interoperability and normalization of strong security mechanisms, which would be greatly beneficial to general users and thus computer security in general.

SELinux Developer Summit 2007

The SELinux Symposium proper finished yesterday, with another WIP session, then some interesting applications of SELinux talks, and finally, talks on MLS. It’s interesting to see the conference evolve from being mostly about research and theory, to having an increasing orientation toward case studies and similarly practical topic areas.

Daniel Frye, an IBM VP, gave the second keynote and highlighted the growth of Linux as an enterprise OS from a security point of view, and the persistent challenge to make security compelling outside of the public sector.

Something which should help general users is setroubleshoot, which is now shipping in the RHEL5 and Fedora distros. It was developed by John Dennis, who gave a talk at the symposium. setroubleshoot detects when SELinux encounters a problem and tries to explain it to the user and provide a course of action to follow. It does this by popping an alert up in the system tray and optionally sending an email notification:

setroubleshoot_email

The symposium concluded with the usual drawing of prizes, this time including iPods, SELinux books, RHEL5, a Mac Mini and a HP laptop. Guess which one everyone wanted.

Today is the Developer Summit: a focused discussion on current and future directions by around twenty of the core developers.

SELinux Developer Summit

In the photo, Karl MacMillan (2nd from the left) is talking, which is not unusual.

SELinux Symposium 2007

I’m in Baltimore for the 2007 SELinux Symposium. The past two days have been for tutorials, with two days now for the conference, and then a final developer summit day.

SELinux Symposium 2007

This morning’s keynote featured a talk by Richard Schaeffer, Director of Information Assurance at the NSA. Richard has been in this business for a very long time, and he provided some very interesting perspectives on computer security (I think the slides will eventually go up on the web site). We then had two sessions of solid technical talks, and are currently in the first of two WIP sessions. There’s a lot of interesting work happening now extending SELinux out past the base OS, with increasingly mature analysis and development tools, as well as continued refinements to the core technology.

Chris Vance’s SEDarwin talk was particularly interesting: apparently the next version of OSX will ship with the TrustedBSD MAC framework. This won’t initially include the SE Darwin work, but hopefully it’ll be possible to get it running without too much trouble.

There’s also good progress being made on extending SELinux to the desktop, as described in NSA talks on GConf integration and X.org integration. Xinwen Zhang of Samsung gave an interesting talk on extending SELinux to mobile platforms (such as cell phones), and related research into platform integrity.

The SELinux community is growing and looking very healthy, although I think we can still do a lot more to encourage wider participation.

Snippets

Steve Rostedt and Glauber de Oliveira Costa have just posted initial patches for a 64-bit (x86-64) version of lguest. Looks like the next steps will be to consolidate the code into common and per-arch components. Initial feedback from Rusty seems good.

I’ve been working on converting lguest over to the new clockevent and dyntick frameworks. (Thomas Gleixner’s OLS paper, Hrtimers and Beyond: Transforming the Linux Time Subsystems, is also a great reference on the topic).

Tickless operation is particularly useful for virtual machines, allowing clock events (for timers etc.) to be programmed on demand, with events only being delivered to VMs as required, rather than say, generating a synthetic tick stream for each VM. (Or in the case of lguest, switching out of each guest on each host clock tick). It’ll be interesting to re-run the networking benchmarks again with tickless & high-resolution timer support.

There may be some scope to consolidate common clock code between several HV projects (Xen & lguest have nearly identical clockevent code in progress), although it’s not entirely clear yet how much can be usefully gained, as the new clock APIs make the client code fairly simple.

Máirín Duffy has created a new logo for lguest (also now known as the puppyvisor):

new lguest logo

Via Val Henson: some hilarious slides from a talk on network protocols by Radia Perlman. I’m sad to say I haven’t seen Radia give a talk as yet.

The SELinux Symposium is on next week!

Snippets

The rustyvisor is now segmentless, paving the way for x86_64, nested interrupts (useful for oprofile) and probably simpler guest SMP. It’s amazing how much can happen in Linux over a weekend. I was happily surprised to see my simple patch to enable bridging was applied in the midst of the rewrite.

Early registration for the 2007 SELinux Symposium ends in a few days. The WIP program is now quite full — I’ll be interested to see KaiGai Kohei’s talk on what’s happening with SELinux in Japan.