TechOpsGuys.com Diggin' technology every day

June 4, 2013

Infoworld suggests radical Windows 8 changes

Filed under: General — Tags: , — Nate @ 8:46 am

Saw this come across on slashdot, an article over at Infoworld on how Microsoft can fix Windows 8.

They suggest ripping out almost all of the new stuff (as defaults) and replacing it with a bunch of new options that users can pick from.

Perhaps I am lucky in that I’ve never used Windows 8 (I briefly held a MS Surface RT in my hands, a friend who is an MS employee got one for free(as did all employees I believe) and handed it to me to show me some pictures on it).

Some of the suggestions from Infoworld sound pretty good to me, though hard to have a firm opinion since I’ve never used the Metro UI (oh, sorry they changed the name to something else).

Windows 8 (as it stands today) certainly sounds pretty terrible from a UI standpoint. The only positives I have read on Windows 8 is people say it is faster. Which isn’t much these days, machines have been fast enough for many years(which at least in part has led to the relative stagnation of the PC market). My computers have been fast enough for years(the laptop I am typing on is almost 3 years old, I plan to keep it around for at least another year as my primary machine — I have another year of on site support so I’m covered from that angle).

It has been interesting to see, that really since XP was released, there haven’t been anything really exciting on the Windows desktop front, it’s a mature product(the results have shown, much like the economy pretty much every OS launch they’ve done has had weaker reception than the previous – Windows 7 sort of an exception from the hard core community but from a broader sense it still seemed weak). It’s come a long way from the mess many of us dealt with in the 90s (and instability in NT4 was one big driver for me to attempt Linux on my primary desktop 15 years ago and I’m still with Linux today).

I don’t use Windows enough to be able to leverage the new features. I’m still used to using the XP interface, so am not fond of many of the new UI things that MS has come up with over the years. Since I don’t use it much,  it’s not critical.

The last time I did use Windows seriously was at a few different companies I had windows as my primary desktop. But you probably wouldn’t know it if you saw it. It was customized with cygwin, and Blackbox for windows. Most recently was about three years ago (company was still on XP at the time). Most of the time my screen was filled with rxvt X terminals (there is a native Windows port for rxvt in cygwin that works wonderfully), and firefox. Sometimes had Outlook open or Visio or in rare cases IE.

Not even the helpdesk IT guy could figure my system out “Can you launch control panel for me?”. It gave it a nice Linux look & feel(I would of killed for proper virtual desktop edge flipping but I never found a solution for that) with the common windows apps.

Ironically enough I’ve purchased more copies of Windows 7 (I think I have 7 now – 2 or 3 licenses are not in use yet – stocked up so I wouldn’t have to worry about Windows 8 for a long time) than all previous MS operating systems combined. I’ve bought more Microsoft software in the past 3-4 years (Visio Pro 2010 is another one) than in the previous decade combined. As my close friends will attest I’m sure – I have not been a “hater” of Microsoft for some time now (12 years ago I threatened to quit if they upgraded from NT4 to Windows 2000 – and they didn’t at least not as long as I was there – those were the days when I was pretty hard core anti MS – I was working on getting Samba-tng and LDAP to replace NT4 – I never deployed the solution, and today of course I wouldn’t bother)

Some new Linux UIs suck too

Microsoft is not alone in crappy UIs though. Linux is right up there too (many would probably argue it always was, that very well could be true, though myself I was fine with what I have used over the years from KDE 0.x to AfterStep to GNOME 1.x/2.x). GNOME 3 (and the new Unity stuff from Ubuntu) looks at least as terrible as the new Microsoft stuff (if not more so).

I really don’t like how organizations are trying to unify the UI between mobile and PC. Well maybe if they did it right I’d like it (not knowing what “right” would be myself).

By the same notion I find it ludicrous that LG would want to put WebOS on a TV! Maybe they know something I don’t though, and they are actually going to accomplish something positive. I love WebOS (well the concept – the implementation needs a lot of work and billions of investment to make it competitive) don’t get me wrong but I just don’t see how there is any advantage to WebOS on a device like a TV. The one exception is ecosystem – if there is an ecosystem of WebOS devices that can seamlessly inter-operate with each other.  There isn’t such an ecosystem today, what’s left has been a rotting corpse for the past two years (yes I still use my HP Pre3 and Touchpad daily). There’s no sign LG has a serious interest in making such an ecosystem, and even if they did, there’s no indication they have the resources to pull it off (I’d wager they don’t).

I haven’t used Unity but last weekend I did install Debian 7 on my server at home (upgraded from 6). 99% of the time from a UI perspective this system just cycles through tens of thousands of images as a massive slide show (at some point I plan to get a 40″+ screen and hang it on my wall as a full sized digital picture frame, I downloaded thousands of nice 1080p images from interfacelift as part of the library).

I was happy to see Debian 7 included a “GNOME 2 like” option, as a moderately customized Gnome 2 is really what I am used to, and I have absolutely positively no interest to change it.

It gets mostly there, maybe 50-75% of the way. First thing I noticed was the new Gnome did not seem to import any of the previous settings. I got a stock look – stock wallpaper, stock settings, and no desktop icons(?). I tried to right click on the desktop to change the wall paper – that didn’t work either. I tried to right click on the menu bar to add some widgets, that didn’t work either. I went from 0 to very annoyed almost immediately. This was with the “compatibility” gnome desktop! Imagine if I had tried to login to regular GNOME 3, I probably would of thrown my laptop against the wall before it finished logging in! 🙂 (fortunately for my laptop’s sake I have never gotten to that point)

Eventually I found the way to restore the desktop icons and the right click on the desktop, I managed to set one of my wonderful high res NSFW desktop backgrounds. I still can’t add widgets to the menu bar I assume it is not possible. I haven’t checked if I can do virtual desktop edge flipping with brightside (or with something built in), I’d wager that doesn’t work either.

I’m not sure what I will do on my main laptop/desktop which are Ubuntu 10.04 which is now unsupported. I hear there are distros/packages out there that are continuing to maintain/upgrade the old Gnome 2 stuff (or have replaced Gnome 3’s UI with Gnome 2), so will probably have to look into that, maybe it will be easy to integrate into Debian or Ubuntu 12.04(or both).

I saw a fantastic comment on slashdot recently that so perfectly describes the typical OSS developer on this stuff

[..]

What X11 is, is old. And developers are bored with it. And they want something new and shiny and a chance to play with the hardware without abstraction throwing a wet blanket over their benchmark scores.

The benchmark of success for Wayland is that _users_ don’t actually notice that anything changed. They’ll fall short of that benchmark because too many people like using X11, and even the backward compatibility inevitably will cause headaches.

But developers will enjoy it more, and in the FOSS world those are the only consumers that matter.

(the last sentence especially)

That was in a conversation related to replacing X11 (the main GUI base for Linux) with something completely different (apparently being developed by some of the same folks that worked on X11) that has been under development for many, many years. Myself I have no issues with X11, it works fine for me. Last time I had major issues with X11 was probably 10+ years ago.

As someone who has worked closely with developers for the past 13 years now I see a lot of this first hand. Often times the outcome is good, many other times not so much.

One system I worked with was so architecturally complex that two people on my team left the company within a year of starting and their primary complaint was the application was too complicated to learn (they had been working with it almost daily for their time there). It was complex for sure(many many sleepless nights and long outages too) – though it didn’t make me want to throw my laptop against the wall like Chef does.

In the case of Microsoft, I found it really funny that one of(if not the) main managers behind Windows 8 suddenly resigned mere weeks after the Windows 8 launch.

June 2, 2013

Travel tips for Washington DC area?

Filed under: General — Nate @ 8:38 am

I am planning on being in the Washington DC area next week to visit a a friend I haven’t seen in a couple of years. I have another friend that I know that is in that area and will visit them too.

I’ve never been there before, so if anyone knows something/place cool to visit please send a note my way! Doesn’t have to be in DC – but within say a 2 or maybe even 3 hour drive is fine.

I arrive at 10 AM on Saturday and can’t check in till 3PM so my first thought to kill some time is to drive to Philadelphia to grab one of those famous original Cheese steak sandwiches at either one of Geno’s Steaks or Pat’s King of Steaks (which are apparently right across the street from each other).

Five hours round trip is a long ways to drive but in the grand scheme of life, who knows if I’ll ever be in the area again, so I figure it’s worth it. If I could find some place else to visit and take some pictures(preferably with a nice view, my camera has a 42X zoom). I’m not much for historical stuff or museums (though I may make some exceptions on this trip).

I browsed ~250 potential places on Trip Advisor in Philadelphia but sadly did not see anything that really interested me(except maybe this). Given the sheer number of ideas on that site I figure it may be difficult to find things on other sites that aren’t just repeats.

One day during the week(perhaps Sunday the 9th) I plan to visit Norfolk, VA – a full six hours round trip. But it looks like it will be worth it too — mainly to see the military stuff there. Of the three major locations it’s the one I am most looking forward to.

Possibilities in Norfolk include

One day hit Baltimore up for their Blue Crab, possibilities for this trip include

Then the rest of the time in DC, most of these places to visit just so I can say I visited them, really not excited about any of them (specifically avoided any places that don’t allow pictures like the Mint) –

Hopefully I can easily hit all the above sites in less than one day.

  • Sweetwater Tavern – local friend says there is good food there
  • Ted’s Montana Grill – enjoyed this place when I was in Atlanta – it was the first time I had Buffalo (that I can recall).
  • Tilted Kilt would be nice but is ~70 miles away so probably will wait till I’m in Atlanta next to hit that place up.

 

I’ll likely be working remotely for a couple days while there, not sure yet.

May 28, 2013

3PAR up 82% YoY – $1 Billion run rate

Filed under: Storage — Tags: — Nate @ 8:49 am

I came across this article on The Register which covered some of HP’s storage woes (short story: legacy storage is nose diving and 3PAR is shining). El Reg linked to the conference call transcript and I just ran a quick keyword search for 3PAR and saw this

This has been one of our most successful product introductions and 3PAR has now exceeded the $1 billion run-rate revenue mark.

[..]

Converged storage products were up 48% year-over-year and within that 3PAR was up 82%

Congratulations 3PAR! Woohoo! All of us over here at Techopsguys are really proud of you – keep up the good work! <voice=”Scotty”>Almost brings a tear to me eye.</voice>

For a comparison, I dug up the 3PAR results on archive.org for the quarter immediately previous to them being acquired:

3PAR® (NYSE: PAR), the leading global provider of utility storage, today reported results for the first quarter of fiscal year 2011, which ended June 30th, 2010. Revenue for the first quarter was $54.3 million, an increase of 22% as compared to revenue of $44.5 million for the same period in the prior year, and an increase of 1% as compared to $53.7 million in the prior quarter, which ended March 31st, 2010.

I can’t help but wonder how well Compellent is doing for Dell these days by contrast, since Dell withdrew from the bidding war for 3PAR with HP and went for them instead.. (side note: I once saw some value in Compellent as an alternative to 3PAR but that all went away with the 3PAR 7000-series.). I looked at the transcript for Dell’s latest conference call and the only thing they touch about storage is declines of 10%, no mention of any product lines as far as I could tell.

May 23, 2013

3PAR 7400 SSD SPC-1

Filed under: Storage — Tags: , , — Nate @ 10:41 am

I’ve been waiting to see these final results for a while, and now they are out! The numbers(performance + cost + latency) are actually better than I was expecting.

You can see a massive write up I did on this platform when it was released last year.

(last minute edits to add a new Huawei results that was released yesterday)

(more last minute edits to add a HP P6500 EVA SPC-1E)

SPC-1 Recap

I’ll say this again in case this happens to be read by someone who is new here. Myself, I see value in the SPC-1 as it provides a common playing field for reporting on performance in random transactional workloads (the vast majority of workloads are transactional). On top of the level playing field the more interesting stuff comes in the disclosures of the various vendors. You get to see things like

  • Cost (SpecSFS for example doesn’t provide this and the resulting claims from the vendors showing high performance relative to others at a massive cost premium but not disclosing the costs is very sad)
  • Utilization (SPC-1 minimum protected utilization is 55%)
  • Configuration complexity (only available in the longer full disclosure report)
  • Other compromises the vendor might of made (see the note about disabling cache mirroring)
  • 3 year 24×7 4 hour on site hardware support costs

There is a brief executive summary as well as what is normally a 50-75 page full disclosure report with the nitty gritty details.

SPC-1 also has maximum latency requirements – no I/O request can take longer than 30ms to serve or the test is invalid.

There is another test suite -  SPC-2, which tests throughput with various means. Much fewer systems participate in that test (3PAR never has, though I’d certainly like them to).

Having gone through several storage purchases over the years I can say from personal experience it is a huge pain to try to evaluate stuff under real workloads – often times vendors don’t even want to give evaluation gear (that is in fact in large part why I am a 3PAR customer today). Even if you do manage to get something in house to test, there are many things out there, with wide ranging performance / utilization ratios. At least with something like SPC-1 you can get some idea how the system performs relative to other systems at non trivial utilization rates. This example is rather extreme but is a good illustration.

I have no doubt the test is far from perfect, but in my opinion at least it’s far better than the alternatives, like people running 100% read tests with IOMeter to show they can get 1 million IOPS.

I find it quite strange that none of the new SSD startups have participated in SPC-1, I’ve talked to a couple different ones and they don’t like the test, they give the usual it’s not real world, customers should take the gear and test it out themselves. Typical stuff. Usually means they would score poorly – especially those that leverage SSD as a cache tier, with high utilization rates of SPC-1 you are quite likely to blow out that tier, once that happens performance tanks. I have heard reports of some of these guys getting their systems yanked out of production because they fail to perform after utilization goes up. System shines like a star during brief evaluation – then after several months of usage and utilization increasing, performance no longer holds up.

One person said their system is optimized for multiple workloads and SPC-1 is a single workload. I don’t really agree with that, SPC-1 does a ton of reads and writes all over the system, usually from multiple servers simultaneously. I look back to 3PAR specifically, who have been touting multiple workload (and mixed workload) support since their first array was released more than a decade ago. They have participated in SPC-1 for over a decade as well, so arguments saying testing is too expensive etc doesn’t hold water either. They did it when they were small, on systems that are designed from the ground up for multiple workloads (not just riding a wave of fast underlying storage and hoping that can carry them),  these new small folks can do it too. If they can come up with a better test with similar disclosures I’m all ears too.

3PAR Architecture with mixed workloads

The one place where I think SPC-1 could be improved is in failure testing. Testing a system in a degraded state to see how it performs.

The below results are from what I could find on all SSD SPC-1 results. If there is one/more I have missed(other than TMS, see note below), let me know. I did not include the IBM servers with SSD, since those are..servers.

Test Dates

System
Name
Date
Tested
HP 3PAR 7400May 23, 2013
HP P6500 EVA (SPC-1E)February 17, 2012
IBM Storwize V7000June 4, 2012
HDS Unified Storage 150March 26, 2013
Huawei OceanStor Dorado2100 G2May 22, 2013
Huawei OceanStor Dorado5100August 13, 2012

I left out the really old TMS (now IBM) SPC-1 results as they were from 2011, too old for a worthwhile comparison.

Performance / Latency

System NameSPC-1
IOPS
Avg Latency
(all utilization
levels)
Avg
Latency
(Max
utilization)
# of times
above 1ms
latency
# of
SSDs
HP 3PAR 7400258,0780.66ms0.86ms0 / 1532x
200GB
HP P6500 EVA (SPC-1E)20,0034.01ms11.23ms13 / 158x
200GB
IBM Storwize V7000120,4922.6ms4.32ms15 / 1518x
200GB
HDS Unified Storage 150125,0180.86ms1.09ms12 / 1520x
200GB
Huawei OceanStor Dorado2100 G2400,5870.60ms0.75ms0 / 1550x
200GB
Huawei OceanStor Dorado5100
600,0520.87ms1.09ms7 / 1596x
200GB

A couple of my own data points:

  • Avg latency (All utilization levels) – I just took aggregate latency of “All ASUs” for each of the utilization levels and divided it by 6 (the number of utilization levels)
  • Number of times above 1ms of latency – I just counted the number of cells in the I/O throughput table for each of the ASUs (15 cells total) that the test reported above 1ms of latency

Cost

System NameTotal
Cost
Cost per
SPC-1 IOP
Cost per
Usable TB
HP 3PAR 7400$148,737$0.58$133,019
HP P6500 EVA (SPC-1E)$130,982$6.55$260,239
IBM Storwize V7000$181,029$1.50$121,389
HDS Unified Storage 150$198,367$1.59$118,236
Huawei OceanStor Dorado2100 G2$227,062$0.57$61,186
Huawei OceanStor Dorado5100
$488,617$0.81$77,681

Capacity Utilization

System
Name
Raw
Capacity
Usable
Capacity
Protected
Application
Utilization
HP 3PAR 74003,250 GB1,159 GB70.46%
HP P6500 EVA (SPC-1E)1,600 GB515 GB64.41%
IBM Storwize V70003,600 GB1,546 GB84.87%
HDS Unified Storage 1503,999 GB1,717 GB85.90%
Huawei OceanStor Dorado2100 G210,002 GB3,801 GB75.97%
Huawei OceanStor Dorado510019,204 GB6,442 GB67.09%

Thoughts

The new utilization charts in the latest 3PAR/Huawei tests are quite nice to see, really good illustrations as to where the space is being used. They consume a full 3 pages in the executive summary. I wish SPC would go back and revise previous reports so they have these new easier forms of disclosure in them. The data is there for users to compute on their own.

HP EVA

This is a SPC-1e result rather than SPC-1 – I believe the work load is the same(?) they just measure power draw in addition to everything else. The stark contrast between the new 3PAR and the older P6500 is remarkable from every angle whether it is cost, performance, capacity, latency. Any way you slice it (well except power I am sure 3PAR draws more power 🙂 )

It is somewhat interesting in the power results for the P6500 that there is only a 16 watt difference between 0% load and 100% load.

I noticed that the P6500 is no longer being sold (P6550 was released to replace it – and the 3PAR 7000-series was released to replace the P6550 which is still being sold).

Huawei

While I don’t expect Huawei to be a common rival for the other three outside of China perhaps, I find their configuration very curious. On the 5100 with such a large number of apparently low cost SLC(!) SSDs, and “short stroking” (even though there are no spindles I guess the term can still apply) they have managed to provide a significant amount of performance at a reasonable cost. I am confused though they claim SLC but yet they have so many disks(would think you’d need fewer with SLC), at the same time at a much lower cost. Doesn’t compute..

No software

Huawei appears to have absolutely no software options for these products – no thin provisioning, no snapshots, no replication, nothing. Usually vendors don’t include any software options as part of the testing since they are not used. In this case the options don’t appear to exist at all.

They seem to be more in line with something that LSI/NetApp E-series, or Infortrend or something like that rather than an enterprise storage system. Though looking at Infortrend’s site earlier this morning shows them supporting thin provisioning, snapshots, and replication on some arrays. Even NetApp seems to have thin provisioning on their E-series included.

3PAR

3PAR’s metadata

3PAR’s utilization in this test is hampered by (relatively) excessive metadata, the utilization results say only 7% unused storage ratio which on the surface is an excellent number. But this number excludes metadata which in this case is 13%(418GB) of the system. Given the small capacity of the system this has a significant impact on utilization (compared to 3PAR’s past results). They are working to improve this.

The next largest meta data size in the above systems is IBM which has only 1GB of metadata (about 99.8% less than 3PAR). I would be surprised if 3PAR was not able to significantly slash the metadata size in the future.

3PAR 7400 SSD SPC-1 Configured Storage Capacity (one of the new charts from the SPC-1 report)

In the grand scheme of things this problem is pretty trivial. It’s not as if the meta data scales linearly with the system.

Only quad controller system

3PAR is the only SSD solution above tested with 4 controllers(totalling 4 Gen4 ASICs, 24 x 1.8Ghz Xeon CPU cores, 64GB of data cache, and 32GB of control cache), meaning with their persistent cache technology(which is included at no extra cost) you can lose a controller and keep a fully protected and mirrored write cache. I don’t believe any of the other systems are even capable of such a configuration regardless of cost.

3PAR Persistent Cache mirrors cache from a degraded controller pair to another pair in the cluster automatically.

The 7400 managed to stay below 1 millisecond response times even at maximum utilization which is quite impressive.

Thin provisioning built in

The new license model of the 3PAR 7000 series means this is the first SPC-1 result to include thin provisioning for a 3PAR system at least. I’m sure they did not use thin provisioning(no point when your driving to max utilization), but from a cost perspective it is something good to keep in mind. In the past thin provisioning would add significant costs onto a 3PAR system. I believe thin provisioning is still a separate license on the P10000-series (though would not be surprised if that changes as well).

Low cost model

They managed to do all of this while remaining a lower cost offering than the competition – the economics of this new 7000 series are remarkable.

IBM’s poor latency

IBM’s V7000 latency is really terrible relative to HDS and HP. I guess that is one reason they bought TMS.  Though it may take some time for them to integrate TMS technology (assuming they even try) to have similar software/availability capabilities as their main enterprise offerings.

Conclusion

With these results I believe 3PAR is showing well that they too can easily compete in the all SSD market opportunities without requiring excessive amounts of rack space or power circuits as some of their previous systems required. All of that performance(only 32 of the 48 drive bays are occupied!), in a small 4U package. Previously you’d likely be looking at a absolute minimum of half a rack!

I don’t know whether or not 3PAR will release performance results for the 7000 series on spinning rust, it’s not too important at this point though. The system architecture is distributed and they have proven time and again they can drive high utilization, so it’s just a matter of knowing the performance capacity of the controllers (which we have here), and just throwing as much disk as you want at it. The 7400 series tops out at 480 disks at the moment – even if you loaded it up with 15k spindles you wouldn’t come close to the peak performance of the controllers.

It is, of course nice to see 3PAR trouncing the primary competition in price, performance and latency. They have some work to do on utilization as mentioned above.

May 21, 2013

SHOCKER! Power grid vulernable to Cyberattack!

Filed under: Security — Tags: , — Nate @ 9:57 pm

Yeah, it shouldn’t be news.. but I guess I am sort of glad it is making some sort of headline. I have written in the past how I think the concept of a smart grid is terrible due to security concerns. I just have no confidence in today’s security technology to properly secure such a system. If we can’t properly secure our bank transactions(my main credit card was compromised for at least the 2nd or 3rd time this year and I am careful), how can anyone expect to secure the grid?

Just saw a new post on Slashdot which points to a new report being released that covers some of how vulnerable we are to attack on our grid.

The report, released ahead of a House hearing on cybersecurity by Congressmen Edward Markey (D-Mass.) and Henry Waxman (D-Calif.), finds that cyberattacks are a daily occurrence, with one power company claiming it fights off 10,000 attempted intrusions each month.

[..]

Such attacks could cut power to large sections of the country and take months to repair.

Oh how I miss the days of early cyber security where the threat was little more than kids poking around and learning. These days there is really little defense against the organized military of the likes of China, sigh.

If they want to get you, most likely they are going to get you.

I’ve had a discussion or two with a friend who works with industrial control systems and the security on those is generally worse than I had heard about with the various breaches around the world.

I don’t see any real value the so called smart grid has, anything remotely resembling gains that would offset the massive growth of the network access points that are connected to the grid.

It’s probably already too late. All security is some form of obscurity at the end of the day whether it is a password, or encryption or physical isolation. Obscuring the grid by reducing the network connections to it has got to provide some level of benefit…

May 20, 2013

When a server is a converged solution

Filed under: General — Tags: — Nate @ 3:56 pm

Thought this was kind of funny/odd/ironic/etc…

I got an email a few minutes ago which is talking about HP App System for Vertica. Which, among other things HP describes as being able to

This solution delivers system performance and reduces implementation from months to hours.

I imagine they are referring to competing solutions and not comparing to running Vertica on bare metal. In fact it may be kind of misleading as Vertica is very open – you can run it on physical hardware (any hardware really), virtual hardware, and even some cloud services (it is supported in *shudder*Amazon even..). So you can get implementation of a basic Vertica system without buying anything new.

But if you are past the evaluation stage, and perhaps outgrew your initial deployment and want to grow into something more formal/dedicated, then you may need some new hardware.

HP pitches this as a Converged Solution. So I was sort of curious what HP solutions are they converging here?

Basically it’s just a couple base configurations of HP DL380G8s with internal storage (these 2U servers support up to 25 2.5″ disks).  They don’t even install Vertica for you

HP Vertica Analytics Platform software installation is well documented and can be installed by customers.

They are kind enough to install the operating system though (no mention of any special tuning, other than they say it is “Standard” so I guess no tuning).

No networking included(outside of the servers as far as I can tell), the only storage is the internal DAS. Minimum of three servers is required so some sort of 10GbE switches are required (since the severs are 10GbE, you can run Vertica fine on 1GbE too for smaller data sets).

I would of expected the system to come with Vertica pre-installed, or automatically installed as part of setup and have a trial license built into the system.

Vertica is very easy to install and configure the basics, so in the grand scheme of things this AppSystem might save the average Vertica customer a few minutes.

Vertica is licensed normally by the amount of data stored in the cluster (pre-compression / encoding).  The node count, CPU count, memory, spindles doesn’t matter. There is a community edition that goes up to 3 nodes, and 3TB (it has some other software limitations – and as far as I know there is no direct migration path from community to enterprise without data export/import).

Don’t get me wrong I think this is a great solution, very solid server, with a lot of memory and plenty of I/O to provide a very powerful Vertica experience. Vertica’s design reduces I/O requirements by up to ~90% in some cases, so you’d be probably shocked the amount of performance you’d get out of just one of these 3 node clusters, even without any tuning at the Vertica level.

Vertica does not require a fancy storage system, it’s really built with DAS in mind. Though I know there are bunches of customers out there that run it on big fancy storage because they like the extra level of reliability/availability.

I just thought it was kind of strange some of the marketing behind it, saving months of time, being converged infrastructure and what not..

It makes me think(if I had not installed Vertica clusters before) that if I want Vertica and don’t get this AppSystem then I am in a world of hurt when it comes to setting Vertica up. Which is not the right message to send.

Here is this wonderful AppSystem that is in fact — just a server with RHEL installed.

For some reason I expected more.

May 17, 2013

Big pop in Tableau IPO

Filed under: General — Tags: , — Nate @ 9:35 am

I was first introduced to Tableau (and Vertica) a couple of years ago at a local event in Seattle. Both products really blew me away(and still do to this day). Though it’s not an area I spend a lot of time in – my brain struggles with anything analytics related (even when using Tableau, same goes for Splunk, or SQL). I just can’t make the connections, when I come across crazy Splunk queries that people write I just stare at it for a while in wonder(as in I can’t possibly imagine how someone could of come up with such a query even after working with Splunk for the past six years).. then I copy+paste it and hope it works.

Sample Tableu reports pulled from google images

But that doesn’t stop me from seeing an awesome combination that is truly ground breaking both in performance and ease of use.

I’ve seen people try to use Tableau with MySQL for example and they fairly quickly give up in frustration at how slow it is. I remember being told that Tableau used to get a bunch of complaints from users years ago saying how slow it seemed to be — but it really wasn’t Tableau’s fault it was the slow back end data store.

Vertica unlocks Tableau’s potential by providing a jet engine to run your queries against. Millions of rows? hundreds of millions? No problem.. billions ? It’ll take a bit longer but shouldn’t be an issue either. Try that with most other back ends and well you’ll be waiting there for days if not weeks.

Tableau is a new generation of data visualization technology that is really targeted at the Excel crowd. It can read in data from practically anything(Excel files included), and it provides a seamless way to analyze your data and provide fancy charts and graphs, tables and maps..

It’s not really for the hard core power users who want to write custom queries. Though I still think it is useful for those folks. A great use case for Tableau is for the business users to play around with it, and come up with the reports that they find useful, then the data warehouse people can take those results and optimize the warehouse for those types of queries (if required). It’s a lot simpler and faster than the alternative..

I remember two years ago I was working with a data warehouse guy at a company and we were testing Tableau with MySQL at the time actually (small tables), just playing around, he poked around, created some basic graphs and drilled down into them. In all we spent about 5 minutes on this task and we found some interesting information. He said if he had to do that in MySQL queries himself it would of taken him roughly two days. Running query after query and then building new queries based on results etc.  From two days to roughly five minutes — for a very experienced SQL/data warehouse person.

Tableau has a server component as well, which you can publish your reports for others to see with a web browser or mobile device, the server can also of course directly link to your data to get updates as frequently as you want them.

You can have profiles and policies, one example Tableau gave me last year was one big customer enforces certain color codes across their organization so no matter what they are looking at they know Blue means X and Orange means Y. This is enforced at the server level, so it’s not something people have to worry about remembering. They can also enforce policies around reporting so that the term “XYZ” is always the result of “this+that”, so people get consistent results every time — not a situation where someone interprets something one way, and another person another way. Again this is enforced at the server level, reducing the need for double checking and additional training.

They also have APIs – and users are able to embed Tableau reports directly into their applications and web sites(through the server component). I know one organization where almost all of their customer reporting is presented with Tableau – I’m sure it saved them a ton of time trying to replicate the behavior in their own code. I’ve seen folks try to write reporting UIs in past companies and usually what comes out is significantly sub par because it’s a complicated thing to get right. Tableau makes it easy, and probably very cost effective relative to full time developers taking months/years to try to do it yourself.

It’s one of the few products out there that I am really excited about, and I’ve seen some amazing stuff done with the software in a very minimal amount of time.

Tableau has a 15 day evaluation period if you want to try it out — it really should be more, but whatever.  Vertica has a community edition which you can use as a sort of long term evaluation – it’s limited to 1TB of data and 3 cluster nodes. You can get a full fledged enterprise evaluation from Vertica as well if you want to test all of the features.

I wrote some scripts at my current company to refresh/import about 150GB of data from our MySQL systems to Vertica each night. It is interesting to see MySQL struggle to read the data out, and Vertica is practically idle as it ingests it (I’d otherwise normally think that the writing of the data would be more intensive than the reading). In order to improve performance I compiled a few custom MySQL binaries that allowed me to run MySQL queries and pipe the results directly into Vertica (instead of writing 10s of GBs to disk only to read it back again). The need for the custom binaries is MySQL by default only supports tab delimited results which was not sufficient for this data set (I actually compiled 3-4 different binaries with different delimiters depending on the tables  – managed to get ~99.99999999% of the rows in without further effort). Also wrote a quick perl script to fix some of the invalid data like invalid time stamps which MySQL happily allows but Vertica does not.

Sample command:

$MYSQL --raw --batch --quick --skip-column-names -u $DB_USERNAME --password="${DB_PASSWORD}" --host=${DB_SERVER} $SOURCE_DBNAME -e "select * from $MY_TABLE" | $DATA_FIX | vsql -w $VERTICA_PASSWORD -c "COPY ${VERTICA_SCHEMA}.${MY_TABLE} FROM STDIN DELIMITER '|' RECORD TERMINATOR '##' NULL AS 'NULL' DIRECT"

 

Oh and back to the topic of the post – Tableau IPO’d today (ticker is DATA) – as of last check it is up 55%.

So, congrats Tableau on a great IPO!

 

May 14, 2013

Some pictures of large scale Microsoft ITPAC Deployments

Filed under: Datacenter — Tags: — Nate @ 11:16 am

Just came across this, at Data Center Knowledge. I have written about these in the past, from a high level perspective they are incredibly innovative, far more so than any other data center container I have seen at least.

Once you walk outside, you begin to see the evolution of Microsoft’s data center design. The next building you enter isn’t really a building at all, but a steel and aluminum framework. Inside the shell are pre-manufactured ITPAC modules. Microsoft has sought to standardize the design for its ITPAC – short for Information Technology Pre-Assembled Component – but also allows vendors to work with the specifications and play with the design. These ITPACs use air side economization, and there are a few variations.

I heard about some of these being deployed a couple years ago from some friends, though this is the first time I’ve seen pictures of any deployment.

You can see a video on how the ITPAC works here.

May 10, 2013

Activist investors and not wanting to be public

Filed under: Random Thought — Nate @ 11:49 am

Another somewhat off topic post but still kinda tech related.  There has been somewhat of a rash of companies the past few years that say they either don’t want to go public or some that already are public and want to go private again.

Obviously the leading reason behind this is often cited as companies not wanting to have to deal with the pressure of short term investors. Some are more problematic than others.

Two events caught my eye this morning, the first one is Carl Ichan’s attempt to crush Dell. The second is apparently a dual headed assault against the board of Emulex for not taking the buyout offer from Broadcom a few years ago.

The average amount of time people hold stocks for has collapsed over the past decade or two( I saw some charts of this at one point but can’t find them). People are becoming less and less patient demanding higher returns in shorter amounts of time. The damage is being done everywhere, from companies surrendering their intellectual property in order to be allowed to do business in China, to companies just squeezing their staff trying to get everything they can out of them. I saw someone who claimed to be an IBMer (widely regarded as giving investors what they want) claiming IBM is being gutted from the inside out, all the good people are leaving, it’s only a matter of time.. This certainly may not be true, I don’t know. But it wouldn’t surprise me given IBM’s performance.

Corporate profits are at record highs as many like to cite. Though I see severe long term damage being done across the board in order to get to these profits in the short term.

HP is probably a prime example of this, as a result they felt they had to go spend a ton of money acquiring companies to make up for the lack of investment in R&D in previous years.

But of course HP is not alone. This problem is everywhere, and it’s really depressing to see. The latest attempt from Icahn to kill Dell is just sad.  Here is Michael Dell the founder and CEO trying to save his company for the long term, and here comes in Icahn who just wants to squeeze it dry. Dell may not succeed if they go private maybe they go bust and investors lose out. Maybe Icahn’s deal is better for investors. For some reason I rather view the situation as for what’s best for the company. Dell has done some smart acquisitions over the past few years, they need (more)time to sort through them. If investors don’t like what Dell is doing they are free to sell their shares.

Not long ago BMC got acquired as well (I really have no knowledge of BMC’s products) by private equity and it’s quite possible the same happens there too.

HP attacked Dell when the plan was announced to go private, saying things like “Dell is distracted now”, and HP is a better fit for customers because they are not distracted. I got news for anyone who believed that – HP has more distractions than Dell. I have absolutely no doubt if HP could manage a way to go private they would do it in a heart beat.  I think HP will get past their distractions, they have some great products in Proliant, 3PAR, Vertica, and others I’m sure that I can’t cite off the top of my head.

Same sort of short sightedness I see all the time with folks wanting to embrace the cloud, and the whole concept of shifting things from CAPEX to OPEX (though some accountants and CFOs are wising up to this scam).

The same thing is happening in governments around the world as well. It’s everywhere.

It’s just one of those things that has kept my vision of the future of our economy at such low levels (well if I said what I really thought you might think I’m even more crazy than I already come off as being 🙂 )

anyway, sad to see….

May 9, 2013

Lost almost all respect for Consumer Reports today

Filed under: Random Thought — Tags: — Nate @ 10:30 pm

I was getting my morning dose of CNBC this morning(I’m not an investor – I watch CNBC purely for entertainment purposes) when news came over the wire that Tesla had gotten a 99/100 rating on their ultra luxury green car by Consumer Reports.

I watched the interview with the guy at Consumer Reports and I was shocked, and I still am. A bit disgusted too.

Let me start out by saying I have no issues with the car itself. I’m sure it’s a fine automobile. My issue is with the double talking standard of Consumer Reports in this particular case.

(forgive me the quotes will not be precise, see the video link above for the full interview)

The guy who wrote this report starts off by saying it’s better than the other $90,000 cars in it’s price range… (also goes on to say it’s better than pretty much ANY car they have EVER EVER EVER TESTED — not just better than any other electric car — ANY CAR)

..but..

It is an electric car, while it has a long range compared to other electric cars, I can take a Toyota Corolla and drive to Cleveland from New York  — I can’t do that in this car yet.

You can only go about 200 miles before charging it up, that is a severe limitation. (those last two are his words)

CNBC goes on to quote him as saying If you leave it unplugged, you experience what you describe as a parasitic loss of energy(Consumer Report’s words) that amounts to 12-15 miles per day, and asks him on that topic. He responds –

The concern is this is a new vehicle from a new automaker and there’s going to be growing pains. If you’re really looking for something to be trouble free off the bat – look elsewhere (his words too!)

I’m really glad I saw the interview. My disgust here is not with the car. I have no doubt it is a fine car! I think many people should go buy it. But if these sorts of flaws knock only a single point off the score ….that just seems wrong. Very wrong.  Especially the last bit — if you want something to be trouble free look elsewhere — for something that got rated 99 out of 100!!!

He goes on to talk about how it takes 30 minutes to charge the battery to half strength at one of Tesla’s charging stations. He thinks people would be happier (duh) if they could fully charge it in four minutes.

CNBC half seriously asked him if you could charge a Tesla from a hotel room power outlet. Consumer reports guy said yes but it would take a VERY long time.

People buying this $90k car obviously are not concerned about the price of gas. It’s really more for the image of trying to show your green than anything else, which is sad in itself(you can be more green by buying carbon offset credits – folks that can afford a $90k car should have no trouble buying some). But that’s fine.

Again my issue isn’t with the car. It’s with the rating. Maybe it should be a 75, or an 85. I don’t know. If it was me I’d knock at least 20 points off for the lack of range and lack of charging stations alone. Now if gas was say fifteen dollars a gallon I could see giving it some good credit for saving on that.

I think what Tesla is doing is probably a good thing, it seems like decent technology, good (relative to other electric vehicles) range. Likely can’t take it on a road trip between SFO and Seattle any time soon..

I have relied at least in part on Consumer Reports for a few different purchases I have made (including my current car – which if I recall right Consumer Reports had no rating on it at the time it was too new. One of my friends just bought the 2013 model year of my car a few weeks ago).  It was extremely disappointing to see this result today. Maybe I should not of been surprised. I don’t pay too close attention to what Consumer Reports does, I check them usually once every couple years at most. This may be par for the course for them.

Internet tech review sites have often had a terrible reputation for giving incorrect ratings for some reason or another (most often it seems to be because they want to keep getting free stuff from the vendors). I had thought(hoped) Consumer Reports was in a sort of league by itself for independent reporting.

I think at the end of the day the rating doesn’t matter – people who are in the market for a $90k car will do their homework. It just sort of reeks to me of a back room deal or perhaps a bunch of hippies over at Consumer reports dreaming of a future where all vehicles are electric, never mind the fact the power grid can’t handle it. (I can hear the voice of Cartman in the back of my head – Screw you hippies!). Tesla wants all the positive press they can get after all.

So let’s see. We have a perfect score of 99/100 along which we have the words severe limitation, parasitic loss of energy, and look elsewhere if you want a trouble free experience ……..

I’ll say it one more time – my problem is with the perfect rating of 99/100 not with the car itself.

« Newer PostsOlder Posts »

Powered by WordPress