Archive for » November, 2008 «

Thursday, November 20th, 2008 | Author: admin

Setup:

15Mbit download (tests to 11Mbit at http://speedtest.net) via FTTH connection.

46″ 1080P Sony Bravia (not XBR, year-old model)

So, I’ve signed up for the free netflix trial to test out streaming to the 360. Frankly, after renting titles from the video marketplace the netflix version looks horrible in comparison. In subjective terms it looks to be about the same quality as youtube.  If you’re familiar with the netflix instant play option via PC, it looks about the same. In playing, it’s quicker than the xbox video marketplace as it doesn’t have to buffer as much. I suspect that on a standard definition television it would be quite sufficient and look similar to VHS.  It gave me ‘two bars’ on video quality testing. I recently requested an upgrade to 30Mbit down (they offer up to 60Mbit but it’s $100/mo), so hopefully that will go through soon and I can update you on whether that improves things, though I doubt the majority of people have such speeds available in the US.

UPDATE (12/05/08): Ok, so there are a few select titles that they currently offer in HD, which buffers quickly and looks great in comparison to the above. It’s not very straightforward on the netflix page which titles can stream in HD, but basically it seems that you have to go to the ‘blu-ray’ genre and look for the disks that offer ‘play it now’. I guess that would seem to make sense, but I don’t imagine they get their HD streaming content off of the Blu-Ray disc itself so I’m not sure why they’re coupled. For example, they could just as easily offer a link to the HD streaming from the DVD version of the title, or offer a section where you can browse just movies that can stream in HD.  If the plain streaming weren’t so unwatchable I wouldn’t car as much, but at this point the only streaming worth watching are the HD titles offered through Netflix and the titles offered through XBOX Live marketplace.  I believe they’re just getting the hang of this and just came out of beta, so I look forward to improvements in this regard.

Category: Recreation  | Leave a Comment
Wednesday, November 19th, 2008 | Author: admin

Ok, so this is more of a recreational post, but we can have one of these once in awhile, right?  I, like many others, downloaded the much awaited fall XBOX update, dubbed the ‘NXE’, or New XBOX Experience.  I don’t care much for the new avatar system, I could take it or leave it, but it’s not too bad and not really overbearing and ‘in your face’ aside from actually forcing you to create one. If I had to like one thing about it, it’s the fact that my gamer pic is now of the avatar looking heroically upward and to the right, rather than the blue snail that I so entusiastically chose from the anemic default options of the older system.

I haven’t had a lot of time to play with it, but I like the new interface much better than the blades. It opens the functionality up. For example, the ‘Iron Man’ rental from the video marketplace shows up and is visible without being out of place or looking like it’s in a designated ad spot that we’ve all been trained to ignore, whereas before I don’t think most people even realized they could rent movies.

I look forward to trying out the Netflix streaming, too.  I may update this post with my experiences on that, since it seems that most people have been looking forward to that feature the most.

I did run into a bit of a bug, I had previously had it connected to my linux server for streaming my movie collection. It still works, but it had me download the codecs package again, which there was some weirdness with. It acted as though I needed to download it, I got prompted to install it, then it showed it was installed but did not work. Then I went into the prompt to install again, where it was checked as downloaded and installed already. I selected it, and got ‘to use this feature, please launch the game that it was intended for’, or something to that effect. Instead, I opted to reinstall, and that time it actually showed it downloading, installing, and then it worked.

In all, though, I’d say this refresh was a good move.

Category: Recreation  | Leave a Comment
Monday, November 17th, 2008 | Author: admin

What’s up world?  Things aren’t so great, eh?  Well, we feel the pain where I work as well, but it’s not so bad because we’ve always been stretched pretty thin.  Instead, people are leaving for greener pastures.  As for me, I’m sticking around for the moment.

Mainly I just wanted to post an update and stay in the habit of writing. I’ve got some documentation on LVM that I’m working on, but I took a tangent by deciding to refresh my 3D graphics hobby and am creating visuals for the write-up. It will primarily be a primer on the concept of LVM, followed by a how-to and hopefully some performance metrics.

As far as work is concerned, we’ve been working on several projects, such as migrating one of our VMware clusters off of SAN storage and on to Filer with plain old NFS.  The environment was way over-engineered, and we’ve found that we’ve got a huge, expensive piece of storage sitting there almost idle.  So off to the cheaper stuff with free (built-in, that is) de-duplication!  The other major project we’ve been working on is migrating one of our Oracle RAC clusters from one data center to another (both on-site) and in the process going from cheap, low performance SAN to more expensive, high performance SAN.  We’ve done the first portion and the systems are now running over ISL, tomorrow we’ll take a few nodes down and move them,  bring them up in the new data center, then bring the other nodes down and move them.  In order to do this we had to fiber two switches together across rooms to provide the private cluster interconnects. Fun stuff.

I’ve also taken a look at the RHCE book, I’m about a fifth of the way through it, but haven’t done a whole lot as far as prep. Still looking forward to it, though.

Category: Stuff  | Leave a Comment
Thursday, November 06th, 2008 | Author: admin

A few of us from work were invited to meet with Vice Chairman Tom Mendoza from NetApp downtown this morning.  He seems to have sort of a side job being a fairly popular inspirational speaker in the business world. Apparently he stopped by between flights from the east to west coast and the local NetApp folks talked him into holding an informal seminar over breakfast with some of us customers.  There were only 12 or 15 of us, and it was in a lofty conference room with a wonderful city view, which gave us the feeling that it was a special event.

He spoke a lot about NetApp, but I didn’t get the feeling that he was doing it as a salesman so much as he was doing it because they were valid experiences to illustrate his point. He spoke a lot about how they’ve run their business, their culture, and the problems he’s seen in other companies.

He pointed out a few things he’s seen, such as companies saying that people are their greatest resource, but not spending any time in board meetings discussing how they show that as a company.  He talked about how NetApp has a program to let the executives know if you feel that someone has done a good job and given extra effort, and various ways in which they recognize those extraordinary people.  He even talked about the one layoff they had, where 50 of the 70 affected individuals wrote thank-you letters to the company for how they handled it, which was first to let the people know that it wasn’t their fault, that the company had to do it, second to compensate them fairly in their severance, and last to be involved in helping them find jobs elsewhere.  He said that later on, many of the individuals came back.

He also spoke of their business culture, and how they’re more than happy to save their customers money by coming up with new technologies, for example when Oracle asked them for read/write snapshots, creating what they now calll Flex Clone. It allowed Oracle to buy fewer NetApp products, but it made NetApp products better and made them money in the long run.  He spoke about the economic downturn, and how many companies throw their arms around what they’ve got and try to protect it, looking at what they need to cut to stay the same, when they should be meeting, changing, and figuring out new strategies that will allow them to adapt and grow. ‘Either you’re moving forward or you’re moving backward. If you’re standing still then you’re moving backward.’ On this same topic, he spoke about candor and about how it’s crucial to the company, that people shouldn’t be afraid to say what they think, and the productivity that comes along with that.

Last, he spoke about personal goals, and how the majority of people who become successful have them. He detailed a bit about how he thought one should go about managing their goals, and offered to e-mail any of us a more detailed outline  that he’s come up with.

In all, it was a pretty good speech and I’m glad that I went. Part of me did wonder whether it was a roundabout recruiting mechanism, since I’m pretty sure everyone there went away wishing they worked for NetApp, but at the same time  I think some of the information he shared really is valuable if  we implement what we can in our current environments.

Category: Stuff  | Leave a Comment
Wednesday, November 05th, 2008 | Author: admin

In the future I’d like to discuss some tests I’ve done with linux md-raid and hardware raid, as well as write up a few instructional documents on md-raid, but thought it might be interesting to talk a bit about parity before digging into some details on raid 5 performance, caching, and stripe size. This is a pretty basic explanation, but hopefully it will give some insight to those beginner/intermediate folks like me regarding exactly how we’re able to provide fault tolerance and redundancy simply by adding one extra drive to the array.

You may already be familiar with parity or at least heard of it, perhaps in the context of a parity bit, which works in a similar manner but is used for error detection. More on that some other time, perhaps. Most system admins have at least a general idea of what the parity data in a RAID array is, i.e. ‘extra data that can be used to rebuild an array’, but I find it interesting to go through the exercise of exactly how it works.

If you’re only somewhat familiar with how RAID 5 works, you may have at least heard something about XOR calculations or XOR hardware on your controller. XOR is a logic operation, which stands for “exclusive or”, meaning ‘I’ll take this or that but not both’. Without getting too much into what that means, it’s basically just binary addition. In other words, if we were to XOR bits ’0′ and ’0′, we’d get a ’0′ (0 + 0 = 0). If we XOR bits ’1′ and ’0′, we get 1. It doesn’t get tricky until we XOR bits ’1′ and ’1′, which basically rolls the digit over and we get a binary 2, or ’10′. We’re only interested in the least significant bit, however, so in this case 1 + 1 = 0. Using the following four equations, we should be able to XOR anything we need to:

  • 0 + 0 = 0 (nothing)
  • 1 + 0 = 1 (I’ll take this)
  • 0 + 1 = 1 (I’ll take that)
  • 1 + 1 = 0 (but not both)

Now, let’s apply this to our parity striped raid set. In the following example, we’ll look at a single stripe set across three disks. Two hold data and the third holds parity for that data.

If we lose Stripe 1, we can determine what it held by reversing the equation, or solving ? + 0 = 1.  Likewise, if we lose Stripe 2 we can do the same ( 1 + ? = 1). If we lose the Parity stripe we haven’t really lost any data and can just recalculate parity. Now, the interesting thing about the math is that rather than looking at it as, for example, 1 + ? = 1 when we lose Stripe 2, we can restore Stripe 2 by doing an XOR with Stripe 1 and the Parity stripe, 1(Stripe 1) + 1(Parity) = 0(Stripe 2)

Now, in reality, the stripes are much larger than one single bit as described above, you’ll likely have a stripe of 4 kilobytes up to 256 kilobytes or maybe more, but the mechanics are the same.  Let’s upgrade the previous example to a three bit stripe just to show how it works with multiple bits in a stripe.  Stripe 1 will contain ’101′, stripe 2 will contain ’011′. So, to calculate the parity, we match up the bit placements of stripe 1 and 2 and then XOR each set of bits individually. It helps to stack them vertically and work on each column one by one, as the following figure shows:

And again, an example of ‘losing’ stripe 2 and recalculating from parity.

Now, that’s great, but what about four, five, six disk arrays?  Well, once you get the idea of how XOR works, it’s pretty simple. If you understand binary math you can continue to use that to add four, five, six bits together, but another rule of thumb is to think of ’0′ as even and ’1′ as odd. For example, 1+1+1, an odd plus an odd plus an odd will be an odd number, so parity equals 1. Another example, 0+1+0+1, an odd plus an even will be odd, plus an even will again be odd, plus an odd will be even, so parity equals 0.  If that gets too confusing, you can simply count how many ’1′s you have. If you’ve got an odd number of ’1′s, then parity is 1, if you have an even number of ’1′s, parity is 0.

These parity calculations are fairly simple, yet interesting for the fact that we can provide protection for the data on any number of disks by simply adding a single disk.  No need to have a full copy of the data. The caveat, of course is that you can only lose 1 disk per parity stripe. Yes, that’s right, you can have dual parity as well, also known as RAID 6.

Another drawback is that while you’re missing one of your disks any stripe sets that had data on that disk will take a performance hit because they’ll be doing XOR calculations to read the data is lost, rather than just reading it from the now lost disk. Stripes that had parity on the lost disk won’t take a performance hit, but it will take some processing time to rebuild the parity once the disk is replaced. These performance hits are going to be an important part of later discussions regarding raid 5 performance.

To finish off the exersize, I’ve put together this simple RAID 5 array. Each disk has four stripe sets, each stripe set (in matching color), has three data stripes and one parity stripe. You can see that the parity stripe(???) alternates on to a different disk for each stripe, which is the primary difference between RAID 5 and RAID 3, which places all parity stripes on the same disk. If you’d like to test yourself and see just how your data is protected, first calculate the parity stripe for each set, then cover up a whole column (disk) at random and see if you can reconstruct its contents by doing the math.

As always, if anyone is reading this ;-) , feel free to leave your feedback, correct me, whatever.

Category: Storage  | Leave a Comment
Tuesday, November 04th, 2008 | Author: admin

When installing a new Sun M4000 recently, we got a bit of a surprise when we went to perform a wanboot on it. The wanboot binary downloaded, then the miniroot, and as soon as the miniroot completed, we got the following error:

krtld: load_exec: fail to expand cpu/$CPU
krtld: error during initial load/link phase
panic – boot: exitto64 returned from client program
Program terminated

Some searching led me to this document which is a bug report for OpenSolaris. It seems that at least one cause for this error is an out of date wanboot executable. I downloaded the latest Solaris DVD for sparc, and followed this Sun document regarding creating an updated wanboot executable and miniroot, then copied them to the web server that hosts the wanboot data by moving/renaming the old files and putting the new ones in their place.  After that, we were in business.

Monday, November 03rd, 2008 | Author: admin

Dilbert.com

I’ve seen this comic strip quoted several times during my search for information regarding professional IT certifications.  Like most Dilbert comics, it’s memorable in its use of hyperbole to illustrate points that ring true to so many of us out here in the professional world.  This particular strip emphasizes a phenomenon that has troubled our industry for the past ten years or so, that of the ‘paper tiger’, whose list of resume acronyms would put the entire faculty of your local college’s medical school to shame.  The drill is as follows; go to some website or buy some software that has remarkably accurate practice tests, cram for a few days, take the test, get the certificate, rinse and repeat. In a matter of a few weeks anyone remotely intelligent and motivated can rack up a resume that would get their foot in the door pretty much anywhere, while perhaps not really knowing enough to competently do anything… which brings me to the RHCE  (Red Hat Certified Engineer) Exam and how it attempts to circumvent the issue, but perhaps I’m getting ahead of myself. My interest in pursuing the certification came from an entirely different angle.

It started with my interest in professional progression.  I’ve been managing Solaris and Linux systems for several years now, and I find myself wanting to understand more about the deep dark details as I become  comfortable with the day to day issues that arise.  I began my quest for knowledge where I have so many times in the past, with searching for books and other self study materials, and ended up a bit disappointed with the selection.  I found plenty of cure-all books for the UNIX administrator that would serve as great reference material for the day to day tasks like Apache, FTP, NIS, file system management and etc, but not much on the engineering, programming, or architect level.  Somewhere in my searching I came across a book called the Red Hat Certified Engineer Linux Study Guide by Michael Jang.  While the book itself is probably best classified as a good reference of the kind which I described earlier, I began reading reports about the test that caught my interest, and made me want to try it out for myself. At this point I hit a tangent and embarked on a so-called ‘side quest’.

From what I’ve read, the RHCE Exam isn’t simply a brain-dump, cram session, multiple choice test.  It’s a hands-on test of skill, a gauntlet of sorts for the brave and daring Linux administrators of the world to test their mettle.  It’s a full day test. You show up in the morning, sit down in front of a broken server, fix it, eat lunch, come back, and build a new server to a given set of specifications. While there are study materials  that can give you an idea of what to expect and prepare you to pass the test (such as the book I mentioned, and even a Red Hat course), the real beauty of the approach is this: even if you know what’s going to be on the test in advance, you have to actually demonstrate your skills in order to pass.  Furthermore, your grade has nothing to do with whether you know what all of the awk flags do or other such questions that quizzes rely on, the grading is performed solely on the end result, not how you get there.

So I’m currently planning on taking the exam in January, the next available local test date. Not for the acronym, not for the resume bullet, but for the challenge and pure geekdom glory.  In the coming weeks I’ll try to keep you up to date on the status of my preparations (don’t worry, I’ve already begun), and will report on whether it was drake or dragon, whether I became the slayer or the slain.

In the mean time, feel free to share your certification thoughts and opinions. Have you taken the RHCE Exam? Have you gotten another certification that you’ve found useful (or not)?