fbpx

Does somebody has experience with dmraid + Intel Rapid Storage Technology on RedHat5 for quant development

(Last Updated On: May 16, 2012)

Does somebody has experience with dmraid + Intel Rapid Storage Technology on RedHat5 for quant development

 

Hi

 

I’m configuring 11 Intel servers type S2600JF with 4*500GB disks per server type TOSHIBA that I’d like to configure like a Raid0 or maybe Raid5 if I don’t loose too much performances because of the parity computation; does somebody manage the dmraid ( so called “FakeRAID” or “FirmwareRAID” ) on RedHat5 in production ? do I really gain performances from the Raid0 ? during a Raid5 disk failure how to change the broken disk to rebuild the Raid5, online ?

Many thanks to report your experiences !

Hi, my usual experience with related configs,

 

* if you need to choose between a vendor-specific fakeraid (Bios SW Raid with OS level sw raid add-in) or generic JBOD disks and then standard linux SW raid — usually the Linux Stock SW Raid gives you simpler management, fewer dependencies on custom config; less constraint to the ‘hardware’; and the same performance. However, that being said, usually either route is functional; it is more a matter of ‘preference’.

 

* you don’t want ‘it all fails if one drive fails’ – you probably want a raid10 config, not a raid0 config. With 4 drives raid5 is a not-great option IMHO. You will very likely get significantly better performance with 4 drive raid10 vs raid5 or raid6. Additionally if your chassis supports it you would get linear boost by going up to 6 or 8 drives in the raid10. More spindles = more parallel bandwidth which will help throughput approach saturation on the controller / rather than being constrained below ‘maximum possible throughput’. However, if you can only accommodate 4 drives – then you are stuck with 4 drives, clearly.

 

Hope this helps a bit,

 

I was thinking about Raid0 because my final users don’t care if they loose some jobs during a broken disk event, they will simply resubmit those jobs; because of that I believe I’ll reach the best performances by using Raid0 on my 4 disks.

 

Said that I’ll still spend hours to rebuild/validate the broken server so I was also looking for a reliable Raid layout with a little impact on performances and Raid10 or Raid5 seem to be a good compromise to me, with probably Raid10 faster than Raid5. I’ll measure both.

 

Yes I can accommodate just 4 disks, no more.

 

About mdraid, that’s the official Intel preference for this FakeRAID technology on RedHat6 but my final users need RedHat5 because their legacy applications.

 

I’d like to get a comment about dmraid from someone that’s using it in production since some years, by Googling for HowTos on this software I didn’t find so much..

 

 

Raid10 is to protect your time and sanity 🙂 (ie, avoid system rebuilds due to inevitable disk fails. Remember, all disks will always fail – eventually; it is just a matter of how long it takes. So you have to plan for the inevitable..:-)

 

Re: Pick of MDRaid vs DMRaid. Ultimately – both will work, I think. There may be more effort involved with one than the other (ie, linux swraid / mdraid – is easier to manage and more stable / doesn’t require kernel rebuilds etc; dmraid – will likely be more drama to get installed and potentially has more risk:opportunity for fail:issues).

 

I did a quick google and found a relevant thread that seems to have similar (stronger) opinion re: DMRaid.

 

http://www.linuxquestions.org/questions/linux-kernel-70/centos-kernel-upgrade-breaks-dmraid-on-intel-software-raid-638450/

 

However, this is purely a 3rd party ref so I don’t really have personal experience to comment. Ultimately it will be your pick.

 

Good luck!

 

 

 

I have done extensive testing and debugging on Intel’s latest SAS/SATA interface found on the new SandyBridge-EP platforms like the Jefferson Pass board you are using (S2600JF). I realize using dmraid is usually a cost saving measure. Your data is important and if you can afford it I encourage you to invest in a PCIe hardware RAID card like LSI’s 9265-8i. The kernel driver options for the onboard interface are not yet reliable in my opinion, especially in a RHEL 5 variant (RHEL, CentOS, ScientificLinux, etc).

 

With the interface in RSTe mode you use the isci kernel driver. It doesn’t appear until RHEL 6.2 or until 2.6.18-234.4.1 in RHEL 5. The driver isn’t fully baked and I have seen problems with it. It’s scatter-gather requirements differ from other kernel drivers and I’ve seen bug fixes in other areas of the kernel break this driver. In my opinion it’s not fully baked.

 

In ESRT mode you will use the MegaSR dmraid style driver. This is a binary only driver and not available in source form. This will restrict you to kernel patches or upgrades that have versions released by the author (LSI/Intel).

 

You invested in these machines I assume to do meaningful computing. You should consider additional investment in a stable storage infrastructure, at least until the isci or MegaSR environment becomes more functional.

 

In addition to the LSI RAID cards, Intel has some custom hardware RAID mezzanine boards designed for the S2600JF that give true hardware RAID without taking up a PCI slot and they use standard MegaRAID kernel drivers.

 

I hope this helps you,

 

I don’t know enough about your use case to give highly specific advice … general advice is that dmraid isn’t good. MD software raid is reasonably good, and well written.

 

As for the drivers, Jeff notes that they are flaky. We’ve seen all sorts of interesting half baked drivers in Linux for the various MB functions, not just for EP. Your best bet would be a well baked card and driver. The Intel MBs all have mezzanine cards you can run in JBOD or RAID modes. They are relatively inexpensive.

 

4 drives isn’t a whole lot … RAID5 performance will start at 3/4 of the full RAID0 performance, and thats only if you are doing full stripe reads/writes. Do smaller IO, and performance will suffer.

 

My concern is, based on what you are describing … I hope you aren’t going to run these for a parallel file system. This design (lots of small machines with a small number of disks) is a bad design pattern for such IO (yeah, even for Hadoop). Usually the motherboard controllers are connected to some pretty weak controller chip, or sharing an oversubscribed PCIe link. We’ve seen 4, 6, and 8 port SATA drives hung off a single/dual port PCIe connected controller … the disks could easily overwhelm the controller (lots of SM boards have had these issues).

 

RAID0 if you simply don’t care about data reliability. RAID10 if you do.

 

Thanks to both for your precious remarks!

 

I’m testing dmraid and so far the only configuration that worked like expected was Raid0; on Raid5 performances were very bad but the system survived when I’ve unplugged 1 disk; on Raid10 the performances were acceptable ( of course less than Raid0 ) but Linux got stuck when I’ve unplugged 1 disk.

 

I’m not using these nodes like a Parallel Filesystem, it would be a really weak and plenty of single point of failure architecture: they’re going to be individual SGE nodes with a local fast scratch FS.

 

I definitively agree on the Jeff’s clue about a dedicated LSI Raid controller with 1GB cache where I already had a lot of good experiences; or the “poor man” Raid offered by Intel. Both options sound better than dmraid. I think the Intel Raid is already there on my MB because once I’ve modified the BIOS and I’ve got a Raid dialogue different from that about RST but what I’ve found sexy in RST is this interesting feature to make 2 different Raids using the same disks.

 

When I’m going to have something interesting to report I’ll update this thread,

see you soon and thanks again for your comments

 

The ESRT2 mode allows making two RAIDs from the same disks. Linux mdraid does as well. In my opinion Linux mdraid is higher performing, more flexible and above all open (source code, community support, etc). ESRT2 uses the MegaSR driver which is closed source (binary only) which can be limiting..

 

 

Also, Intel’s mezzanine SAS/SATA HBA and HRA are not “poor man’s”. They OEM LSI’s 6Gb SAS HBA and RAID chips and put them on a custom mezzanine board designed for the S2600JF. High-performance, stable and using long standing and vetted kernel drivers.

NOTE I now post my TRADING ALERTS into my personal FACEBOOK ACCOUNT and TWITTER. Don't worry as I don't post stupid cat videos or what I eat!
Don't miss out!
Get my TRADE LIKE A BOSS 2 PDF Books

You will received instantly the download links.

Invalid email address
Give it a try. You can unsubscribe at any time.

NOTE!

Check NEW site on stock forex and ETF analysis and automation

Scroll to Top