May 04

So a little while back I had noticed that the Netgear DG834N that I had been using was getting quite hot at times with regards to the 240V Brick power supply – it even needed to be turned off at times to let it cool. So I hunted around and then decided to replace it with a fairly simple and effective Billion 5200N – nothing fancy – just a simple and cheap ADSL2+ capable Wireless Router/Modem combo.

So before removing the Netgear I had copied down all the specific settings information, powered up the Billion in isolation and applied all the settings OK, then tested and then finally swapped it out. And it ran very well – so well in fact that the lass who was working next door who was “borrowing” some Internet for her MacBook didn’t even realise there was a swap out.

So I thought I’d done pretty well with that – UNTIL – I came home to find a bill from Telstra (the incumbent Telco) for $73 for the month? WTF!!! We normally just pay the min. amount just to be connected to the copper and I have an account with PennyTel to redirect all phone comms on to the VoIP courtesy of a neat little Sipura SPA-3000 unit. Sipura has since been bought out by LinkSys and then Cisco – but that’s another story? ;-)

So I’m looking at this bill and then it occurs to me that I didn’t double-check the VoIP after the switch – DAMN!! So I check the Web Admin page, and sure enough I can see that it’s failing to register with the service… so I then started to try and configure a SIP service on the device for port 5060, and find that it’s not simple and not easy, and what the heck – why should it be so hard? So then I start looking around and consulting lot’s of forums, etc…  So I spent about 4 – 5 hours plus that night trying to get the bottom of it – no joy. Major bummer.

So when I came home the next day I was chatting to my brother and we go through all the obvious things…  then we decide that I really should try a SoftPhone from my PC, bloody hell, it works – so the Router *IS* passing VoIP and SIP anyway? I confirmed this with Wireshark – 5060 and SIP traffic is passing through no problems. So what is the cause of this problem? Why won’t my little box access and deliver VoIP traffic?

So I step through the settings from the top one more time – and then it hits me that I had set up the little SPA-3000 unit with a static IP – NOT DHCP!!!!
The older Netgear was set with 192.168.0.1 as the Gateway – the Billion came default with 192.168.1.254 – so I arbitrarily chose to make it 192.168.0.254 as the new gateway

AND – I had not updated the SPA-3000 to point to the .254 – it was still pointing the DNS and Gateway to the old .1 address
Problem solved, VoIP registration complete and Phone now working as it should.

  • Share/Bookmark

written by dcaddick

Dec 23

So as things slowdown in the lead up to the break I have had some time to get back and review some twitter posts and other blog articles that I have been meaning to catch up on. I must admit that I tend not to use Twitter that much but do find it quite useful as pointers to additional material for research and recently posted material on the topics that I’m interested in.

So one article that really caught my eye was this from Ruben Spruijt’s post at Brian Madden, because for a number of years now it is the Hard Drive that has not kept pace with advancements in technology for the rest of the PC/Laptop components, and in particular I have noted a number of people getting very enthusiastic about the performance improvements they have actually noticed when using Laptops with SSD’s

Runcore SSD’s – promises to boost computer performance vs. the traditional 1.8" HDD by more than 400% with read/write speeds up to 75/40MB/sec.
Experience with 128GB 1.8" ZIF in HP 2710p – Read/write performance for 4K random files has improved over 500% (even under Bitlocker). it just became the fastest computer I ever used… Word and Excel launch in one or two seconds (compared to 30-60 before)…

Just how fast does the storage component of VDI have to be?
So it stands to reason that if you are going to Virtualize your Desktops (and by definition centralize?) then you are going to need to give some serious throughput for the Disk I/O or IOPS? So for anyone who is embarking on this then I would seriously recommend giving this article a through read? Understanding how storage design has a big impact on your VDI!

Possibly the most interesting point that Ruben did bring to my attention with this is that in VDI implementation the Disk I/O is predominately Writes and not Reads – I know this sounds contradictory in some respects, but do review this for yourself – in particular is a good comment from Claudio Rodrigues regarding the nature of how the O/S’s deal with this, as well as confirmation from Dan Feller at Citrix who confirms that most of the numbers and math are in sync with his findings. Ruben has also supplied a Sizing Tool to get a good idea of what might be needed?


So it also starts to make sense that you make sure that you focus very sharply on the XP or Windows 7 Template VM’s in the first place to reduce the I/O required as much as possible?
Citrix has some great tips in this document - Best Practices – Citrix XenDesktop with Citrix Provisioning Server

So as this brought up some interesting thoughts I also found this article relating to sizing and best practices, VMware View sizing & best practices
which followed on from this original article Virtual Infrastructure best practices and in essence this confirms that the storage requirements will need to focused somewhere around an 20/80 split on the Read/Write (20% read and 80% write)

While this table gives a quick representation of what sort of sizing requirements are out there please make sure you research this subject thoroughly, and read the post in full? J

In all fairness I must also confess at this point that my weakest area in IT is storage, and as yet I have not had a functioning Lab running either Citrix’s XenDesktop or Vmware’s View to be able to play around with this and get my hands dirty with either Provisioning Server or Composer. From what I can see (based on Marketing so far J) both of these products have the ability to drastically minimize the storage needed for VDI and as a consequence I am assuming that this may indeed have a flow on effect to mitigating the IOPS issue? (Can anyone comment or set me straight on this?)

I then went on to find some more details around the costing and financial side of VDI implementations and found this article of Doug Brown’s regarding Cost Savings of VDI: Is It Possible? as the diagram below points out, it is complex, and as such it could be that the regular ROT/TCO calculations are not highlighting all the potential benefits and savings?


Changing the ROI/TCO Calculation?
So this then brings me to another update in my Twitter was from Tyler at LiquidwareLabs.com with a new addition to the ROI/TCO debate,
COP – Coefficient of Productivity in VDI the New Math
, in which he points out that you should also factor in any additional productivity gains and use this to dilute the true cost of the CAPEX.

Now I wonder how many VDI proposals are brave enough to start assigning values to what has traditionally been seen as "additional" benefits related to an architectural change? Don’t get me wrong, I’m not against it, but I can see the debate might start to get a bit heated in the boardroom? ;-) and in some respects maybe that’s exactly what’s needed – by assigning values and assumptions and then extrapolating them across an enterprise the resultant *potential* savings become too hard for the board to ignore?

Sure there will be debate, and rightly so, but in following this sort of process it should quickly become apparent that there is a technological shift under way where the concept of "My Desktop" still being something that can be pointed to in the Office sitting on a desk somewhere is rapidly becoming redundant?

Now having said all that – it’s interesting to note that Gartner’s Hype Cycle Report for 2009 seems not to have VDI listed? Maybe it’s hiding under something else here?

So will VDI make it to "mainstream"? AMD seems to think so, Taking Desktop Virtualization Mainstream, but also a lot of it will depend on a number of factors.

My main thoughts on this are:

To truly succeed VDI needs to be able to deliver a *High Fidelity* User experience equal to or better than what users have today – if what you are designing is unable to provide this then you should stop and re-evaluate now

If you can build a system that is able to provide this then users will want (no, Demand!) that they have it ASAP – you will know that you have succeeded when demand outstrips supply

To provide this *High Fidelity* User experience I’d suggest focusing on two key areas that may/or may not apply depending on your particular focus and what is considered in or out of scope for your project?

  • Graphics
    • Bottom line – Graphics performance in this design CANNOT be measured, it has no metrics, or Best Practices. One persons "that’s fine" is another persons "there is absolutely no way I’ll put up with that", etc. This will always be a completely subjective assessment based on whoever is rendering the judgement.
    • You may think that you don’t need to focus too much on Graphics because "there isn’t much need or requirement for Video from the business" but I can almost bet that you would be wrong – if you are operating on this assumption be sure that you double-check with Stakeholders and get their agreement in writing?
    • So much content on the Internet is done in Flash today, it’s not just watching a CEO or Analyst briefing via YouTube, a lot of Web Based Training modules use the same or similar technologies, so even if you currently block YouTube and Facebook today it’s still quite likely that you need to factor this in to your design
    • Is RDP good enough? Check this assumption by giving targeted business users a Thin Client and removing their desktop to the lab and forcing them to access via the Thin Client – this is a simple but effective test that validates whether they truly need/demand High Fidelity, or will OK Fidelity do?
    • Do you need Citrix HDX? PCoIP? RGS? While I’m making the point about Remote Protocols – don’t forget to gather details of connecting accessories? Do you need to support serial or parallel?
  • HDD or IOPS
    • Make sure the VM performs as well as possible while measuring the I/O on the VM itself? Check the disk queue, and ensure that it’s not the disk performance that’s holding this back – and make sure you have enough throughput to allow for Login’s etc.

So where does that leave us?
In the Graphics space
there is already a number of solutions for VDI that have been out for some time (Citrix’s HDX) or have recently been released (Teradici’s software version of PCoIP). In addition to that there are other approaches if you need to support High Fidelity Users at the top end with HDX-3D taking advantage of Nvidias GPU’s that almost bring TeraFLOP performance to the Desktop.

A good introduction to CUDA – Just how powerful can GPU’s be? Back in mid-2008 the GT200 was able to deliver 1 TeraFLOP performance, the G80 supports 768 Threads per Core – on 128 Cores… Want a Personal Supercomputer? What would you use it for?

So I suspect that we are not far away from seeing vGPU’s being available to VM’s that will be able to deliver the performance on demand – the main thing is that you make sure you are either using a Remote Protocol stack that can provide this functionality – OR make sure you are using a Broker that supports a shift to one?

At the HDD level I’d suspect that as SSD prices come down and SSD longevity increases, along with improved methods of utilising this in some kind of shared caching mechanism in a VDI deployment we may well see some serious advances that enable VDI to overcome this IOPS roadblock?

With the monotonous regularity of Moore’s Law driving CPU performance up it should be possible to provide way more CPU power to users than they really need to enable this – but again – it’s quite possible that the Disk I/O that has the potential to spoil this party?

Please feel free to correct any of my ramblings? ;-)

  • Share/Bookmark

written by dcaddick

Sep 01

So it’s hardly a surprise that Citrix have decided to “Announce” the long awaited HDX-3D “high quality user experience” protocol enhancement in the early hours prior to the start of the VMworld conference? Pardon me for being something of a sceptic, but it seems that this might have more to do with the fact that Teradici are expected to release the Software only version of their PCoIP protocol stack in Public Beta at this event?

I’ve had a look around the MyCitrix.com site and from what I can see there is no new components as the latest update was the addition of the Feature Release 1 update of the XenDesktop Agent component back on the 5/1/09? I’d be more than happy to be proved incorrect, but I’d imagine that if Citrix had actually posted this new component up on the web site as an available download then they’d be making sure it was “in your face”?

I’m pretty well acquainted with RGS and it’s capabilities, so I’m quite keen to get my hands on both of these new technologies and give them a going over to find their strengths and weakness’s.  One of the biggest issues/problems is that the visual quality that is delivered is subject to “peoples perceptions” and as such can be quite difficult to comparisons? One persons “great”, can be another persons “rubbish”?

I’ll be at Microsoft’s TechEd 09 on the Gold Coast here in Australia next week (on the HP Stand) and I’m hoping to be able to do live demonstrations and comparisons of RDP6, RDP7 and RGS so that Customers can see for themselves why implementing the correct Remote Protocol stack can be crucial depending on their particular circumstances.

Citrix Enhances Market Leading HDX Technology for High-Definition Virtual Desktops

Mon Aug 31, 2009 3:01am EDT

Breakthrough New HDX 3D Technology Enables Citrix XenDesktop to Deliver High-End
Professional Graphics in Real-Time over Any Network
SANTA CLARA, Calif.--(Business Wire)--
Today Citrix Systems, Inc. (NASDAQ: CTXS), announced a new breakthrough addition
to its market leading HDX Technology, ensuring a high-definition virtual desktop
experience even for the most demanding high-end professional graphics
environments. The new HDX 3D technology, available as a feature of Citrix
XenDesktop, delivers a rich, high-definition desktop experience applications to
users in any location, over any network. Combined with the full portfolio of HDX
technologies in XenDesktop, the new HDX 3D capability allows organizations to
leverage talent globally, while still housing data centrally, assisting in
real-time collaboration among disperse design teams while still ensuring the
security of critical intellectual property. HDX 3D is part of an extensive range
of HDX technologies from Citrix that go well beyond a single protocol to address
potential problems in all parts of the IT infrastructure, from the datacenter,
to the network, to the device. Combined with the company`s groundbreaking HDX
Adaptive Orchestration technology, XenDesktop can also dynamically adapt to
changes in the environment by applying the best technologies for each unique
user scenario (see http://hdx.citrix.com for more detail).

Citrix Enhances Market Leading HDX Technology for High-Definition Virtual Desktops | Reuters

  • Share/Bookmark

written by dcaddick

Aug 18

Some while back one of my US based colleagues passed on some advice about checking out some details on how to get the best out of Wireshark

For anyone interested in getting their feet wet using Wireshark the network protocol analyzer.  This knowledge can be useful to have when dealing with network anomalies.

www.chappellseminars.com/s-wireshark101.html

Download the latest 1.2.1

As you may or may not know this was originally called Ethereal, and then morphed in to Wireshark, and has recently had quite a number of improvements and has moved from ver. 0.9 to now 1.2 plus in the last few months. There has been quite a number of additions including the ability to graph throughput etc from within the tool, as well as it now supporting GeoIP DB’s so that you can carry out extensive mapping of where the packets are going to or coming from.

http://wiki.wireshark.org/HowToUseGeoIP

Running Windows 7?

If you are running Windows 7 – then do be aware that the WinPcap driver (the component that does the sniffing) will fail to install by default – but if you modify the executable to run in Vista SP1 compatability mode then all should be fine – as detailed below:

I’ve just downloaded WinPcap 4.1 beta5 from here: WinPcap, the Packet Capture and Network Monitoring Library for Windows Set the compatibility mode to Windows Vista (right click on the installer executable then select Properties; on the Compatibility tab, check "Run this program in compatibility mode for", select Windows Vista SP1 from the dropdown list, then finally click OK =)) and it will install as it should.
For me it worked flawlessly so far.

Further reading

I then followed this up a bit further and noted that after a recent Sharkfest event there were a number of presentations made by a chap called Ray Tompkins (CEO of Gearbit) and these are available at:

At Sharkfest 2009 gearbit presented 3 sessions::
Finding the Latency:
How Protocols Work:

Wireshark Charts & IO Graphs:
OSTU – Wireshark IO Graph for Response Time Analysis:
Understanding the Need for Protocol Analysis: HYPERLINK
OSTU – Wireshark Case Study: Benchmark Test
OSTU – Wireshark TCP Stream Graphs
OSTU – Wireshark Capture Filters
OSTU – Wireshark Display Filters
OSTU – Identifying Zero Window with Wireshark

If you do find that you have to dig in on a Customers Site to start doing some serious troubleshooting around Networks then I would seriously recommend the first two presentations in PDF format as they do appear to explain things in a very simple and matter of fact way.

Wireless Issues:

Now this should in no way be any sort of substitute for a proper Wireless Survey, but when you find that you are up against some issues then try using inSSIDer as a very good starting point? And it works on Windows 7 straight out of the box ;-)

image

  • Share/Bookmark

written by dcaddick

Jun 28

***UPDATE***

Please check the comments below – but it appears I’m incorrect on the statement regarding Teradici PCoIP needing a PCI card, there is a SW/SW version in development and I’d be guessing this might be coming out sometime around VMworld as Beta? Just a guess on my part?

So just as we headed in to the weekend I noted Brian Madden’s post – Microsoft’s RDP host-side rendering (Calista) plans include optional GPU offload hardware & custom chips. (Oh, and Hyper-V is required!) as well as Alessandro’s post Is Microsoft silently building a better VDI? and realised that this also tied in with some recent articles earlier in the week from the MSDN side of things around the new Remote Desktop Services (RDS now effectively replaces the old Terminal Services) on Windows 2008, namely getting Aero Glass Remoting in Windows Server 2008 R2 and Running WS08 Terminal Server as a virtualized guest under Windows Server 2008 R2 Hyper-V.

So now this started getting me thinking, there is an almost unbelievable amount of hype right now about VDI, where it’s getting to the point that as soon as we hear about either a Customer or a Partner telling us that they have installed a PoC/Pilot or deployed VDI we can almost guarantee the next thing they will be wanting to know is “how do I get decent Video/MultiMedia/USB performance, it’s not working correctly, and I want it fixed”.

Ideally what should happen when designing a Desktop Virtualization Strategy (in my opinion) is start with the Success Criteria that the design needs to meet as defined by the business units/stakeholders/users, and if they are unsure or don’t know, then the best way to think about this is to strive to provide a User Experience that is *indistinguishable* from a normal PC.

Which is why the first part of designing the solution should be deciding on the Remote Protocol that is needed to meet the Success Criteria?

  • If you only need to supply standard Office Apps then RDP will likely be fine
  • If it’s Office Apps plus some Video with some USB devices then you may find that you can get away with RDP plus one of the RDP enhancements like TCX, etc.
  • If it’s Office Apps plus some Video with some USB devices and you need to make this work over something less than a 10Mb LAN with a possibility of anything greater than 150 – 200ms latency then you may want to seriously consider ICA as the only viable choice.
  • Now if you have demanding users that require a true PC like experience regardless of how it’s done then there are a number of choices like RGS (from HP), PCoIP (from Teradici), SPICE (from Redhat?), etc. however as you start to examine the pro’s and con’s of each one you may find that each one of these may bring further limitations to yuor design?
    • The key takeaway from this is that you can have either a good 
      user experience *OR* low bandwidth – not neccessarily BOTH?

Desktop Virtualization today:

So my opinion is that currently there are two front runners out there today marketing, selling and deploying Virtual Desktop Solutions:

  • Citrix XenDesktop
  • VMware View

I know there are others of note, but generally these are the two main leaders today who are spending the most in Marketing trying to own the top spot in this area, and who both not only have a Solution and a Broker but also a Hypervisor that provides the grunt at the back end. However to a certain extent both of these solutions are fundamentally flawed in that neither of them *currently* have (or support) a Remote Protocol that delivers what I would refer to as a High Quality User Experience that is available today.

Side note on “High Quality User Experience”:

So far most of what I have seen in the field is examples of “Desktop Virtualization” that have been designed by IT for IT – not for the business or users. In some instances it has come as quite a shock to those designing the Solution that one or all of the various Stakeholders, Business or Users not only want the ability to view Training Video’s, but actually have the audacity to demand it as a Criteria of Success for the project? And USB redirection for their Blackberries too!

Bottom line, if your planned Desktop Virtualization solution does not include MultiMedia and USB as a measure of success then there is a good chance it’s fundamentally doomed? At the very least include some scope for a Phase 2 that will expand the project to include this and plan for it up front?

Yes VMware has announced a partnership with Teradici to develop PCoIP for VDI instances of View, (VMware does support HP’s RGS in View, but only to BladePC’s and BladeWS’s) but this appears that it will be based on a H/W PCIe card that will be capable of supporting 32 and 64 VDI instances from the hosted end, so there are some limitations like having to use Servers that can support PCIe cards, but the real kicker is that these cards won’t be available until mid-next year?

Yes Citrix has HDX, (re-Marketed term covering the ICA Protocol :) ) and although they have now started releasing the HDX-Flash add-on there is still no sight of the HDX-3D? For the life of me I can’t understand what is holding up Citrix on this, if it is really as good as it’s supposed to be then they should be releasing it now while they can beat VMware to the punch.

How to create a portable ver. of RDP7:

So with this in mind I looked a bit deeper at the RDP7 side of things to see what differences there were? First thing I noticed was that it’s still referred to as Ver. 6.1.7100.0 (this is Ver. 7, right?), and after some quick googling the next thing I came across is that it is relatively simple to take a copy of the MSTSC.EXE and save it to a folder on another machine and run this version (you will also need a copy of MSTSCAX.DLL and a folder beneath this named “en-US” and in this you’ll need a copy of MSTSC.EXE.MUI and MSTSCAX.DLL.MUI)

Once you have this accomplished you’ll be free to run the RDP7 Client to connect to either your Windows 7 or Server 2008 Host and see what differences there are. Essentially all of the differences are covered under the post:Aero Glass Remoting in Windows Server 2008 R2 but I must admit I was quite surprised at how well it was able to cope with a DiVX avi of Kung Fu Panda, the picture was really sharp and defined, the audio appeared to stay in synch and it didn’t appear to consume much more than 4 – 500Kb/s. When it came to Flash this again had good definition and sharpness – but there was lot’s of gaps in the playback – so on Flash it still scores a miss?

Putting this in context, here are some very rudimentary tests:

RDP6

RDP7

General inactivity

2 – 25 or 35Kb/s

2 – 20Kb/s

Open Outlook

200Kb/s – peak 500Kb/s

150Kb/s – peak 400Kb/s

Open local version of Kung Fu Panda this was using ¼ of screen at 1440 x 900 – no real change in bandwidth when changing to Full screen

2Mb/s – 5Mb/s

500Kb/s – 250Kb/s sometimes lower

Seek in Video

Spike to 15Mb/s

Spikes to 500Kb/s

Flash Video – YouTube

2Mb/s

2Mb/s

**NOTE** I did not test the Aero feature

clip_image002

Bandwidth was simply measured by using “Bandwidth Monitor” from www.bwmonitor.com
This also had the effect of causing 20 – 25Kb/s consumption just in the act of being displayed in the remote screen, so typically it was hidden until needed.

Conclusions:

  • RDP7 is definitely an improvement – and provides a much clearer and cleaner display
    • Gone are the old days of RDP presenting the screen with 6 or 7 horizontal bars from left to right and top to bottom
    • It does appear able to deliver quite reasonable Video today
    • It does not appear to have delivered much or any improvements in Flash
  • Has it reduced bandwidth needs?
    • Not much at the bottom end?
    • But for Video and high usage scenarios? A resounding Yes.
    • But I would still caution folks to do their own testing prior to jumping in? Bandwidth requirements for any Remote Protocol is both very bursty and it’s measurement is very subjective depending on what is happening on the screen?
  • Do you still need ICA?
    • ALL Remote Protocols will suffer as soon as latency increases above 200 – 250ms – *APART* from ICA, Citrix has a number of technologies that make it possible for the ICA Client to manage and cope with latencies of 1000ms and more.
    • HOWEVER, you then don’t have a “High Quality User Experience”, and I would very much doubt that even when the long-awaited HDX-3D does make it’s debut that it will be able to cope with much more than 200 – 250ms. If it can it will be *THE* Remote Protocol to beat.
    • Share/Bookmark

    written by dcaddick

    Jan 12

    A colleague of mine was after some info on a tool that I had mentioned before and while I was at it I had some extra tools and links and other handy stuff that I thought I’d share?

    The details regarding Fixed Mode below have proven to be very useful today in getting PXE working across multiple VLAN’s – very nice to see it burst in to life :-)

    Gui tool to check AD/LDAP context etc. (30 day free trial)

    Softerra LDAP Browser

    Altiris DS 6.8 Docs… (Very useful for additional info)

    chapter-1-preparing-your-server-ds68sp2-installation
    chapter-2-installing-deployment-solution-68sp2
    chapter-3-introducing-ds-console
    chapter-4-introduction-imaging-using-pxe
    chapter-5-ms-dos-a-pxe-automation-option
    chapter-6-winpe-pxe-automation-option
    chapter-7-hidden-bootworks

    Troublshooting PXE

    Altiris KB – PXE not crossing subnets or VLANs
    Altiris KB – Configuring PXE to work across VLANs
    Altiris KB – Error: PXE-E53: No boot filename received (very comprehensive list of 11 possible causes and 14 possible resolutions, and a reminder that when using Wireshark filter down to BOOTP packets to focus on PXE)
    Installing Altiris Deployment Server and Altiris PXE Server in a VLAN Environment this lead me to this link of a PDF (PXE Modes of Operation) regarding setting PXE in “Forced Mode” that has just worked a treat in a multi VLAN environment and essentially it means that you simply set Options 60, 66, 67 and 43 on the DHCP with the relevant details of where to find the PXE Server

    Now setting the Option 43 correctly is not quite that simple, so I have added my explanation here:

    All figures/values need to be in HEX
    Example: 06 01 0B 08 0F AA AA 03 C0 A8 02 C8 C0 A8 03 68 C0 A8 01 66

    06 01 0B 08 and AA AA are set by Altiris and should not be changed

    0F = Specifies the number of octets that exist to be read after this one (the fifth octet) in this particular case it’s 15

    03 = Number of PXE Servers, change as needed, in this example there are 3 PXE Servers

    C0 A8 02 C8 C0 A8 03 68 C0 A8 01 66 = the IP Address of each PXE Server, defined in HEX, octet by octet

    C0 A8 02 C8 = 192.168.02.200

    Once you have crafted the Value of Option 43 as described above you simply need to add a zero value of 00 to indicate the end of the Value and then add this to the DHCP Server, paying particular care not to add any spaces or CR’s?

    Alternatively I have also discovered a tool that can help in making this calculation at PXE Forced Mode Utility

    • Share/Bookmark

    written by dcaddick

    Dec 03

    Recently I found myself at a school down near Melbourne carrying out what should have been a relatively simple PoC (Proof of Concept) of 20 x BC2500 BladePC’s in an Enclosure with some T5730 Thin Clients, and initially all went pretty well and then I found that I had some issues with the SAM Registration service….. from the BladePC’s not communicating very well (or at all) with the SAM Server

    BTW – SAM = HP’s Broker, Session Allocation Manager

    So then I was thinking through some ideas about what the next steps might be to help isolate it?

    Initially I was getting a note in the Application Event Log for all the BladePC’s that they are unable to contact the SAM Server, and yet:

    • they can ping it successfully
    • I can connect to the blades using RGS fine
    • The Windows Firewall is disabled for now
    • There are exceptions in the firewall any way for the SAM Service

    So I then used WireShark to take a trace and isolated one conversation with a blade and the SAM Server and got it to translate the TCP conversation and the resultant HTTP reply from the SAM Server indicated an internal error – so I simply rebooted all the Blades and the SAM server and I finally got the devices registered – hurrah!! I thought I was out of the woods.

    Then as the day wore on it appeared that after a period of time the Blades would become “Offline” with a red circle and the diagonal slash indicating they were offline and they would no longer respond – a reboot will get them going again.

    Now if I was just troubleshooting this it would not have been too bad – but I was also seeing some other very curious issues like:

    • When I connected from the T5730 to the BladePC via RGS direct (no SAM) it is verrrrry slow (5 – 10min login)
    • When I connected from the T5730 via the SAM Client it is verrry slow just to start the Login process (even longer?)
    • Once connected the experience is not too bad (not real snappy, but just about useable)
    • Connecting a USB device to the T5730 to test Video causes the RGS session to drop – with no warning – before the Media Player comes up

    Now was I right to suspect the network at this point? I’m not a Network/Switch expert, but I know enough to be dangerous? ;-) but it was starting to look like there was more than just a hint of coincidence with what was going on?

    Curiously enough I’ve also found that while my laptop is connected to their network and using the proxy to gain access to the ‘net to get email that the connection to exchange keeps bouncing quite a bit and when I send an email it can sometimes hold in my Outbox for hours before finally getting under way….

    Outcome – all now sorted – it would appear that even though the SAM/Altiris Server AND the BladePC Enclosure was plugged in to the Core Switches there was a ProCurve 2650 that was throwing out FCS Errors and causing a ruckus – this also happened to be the unit that I was trying to go through with the T5730 Thin Clients……

    This morning a Network Guru from their integrator turned up and updated all the firmware on the Cores and 2650 alike, found the issues were related to the 2650 only, this was then swapped out for a temp unit – then it seemed like nothing was working at all with no connectivity to the Enclosure, then it was discovered that Spanning Tree was the cause of this, so that was hit on the head – and now everything has settled down nicely.

    Good news is that apparently HP’s ProCurve Switches come with a lifetime warranty – I learn something new every day? – so as it turns out it’s not too bad?

    So a day or two late, but finally back on track.

    Incidentally, when I was trying to update the Firmware in the Enclosure for the Switch and the IA (Integrated Administrator) via a simple TELNET I was getting timeouts and errors, after this wayward Switch was replaced it all worked a treat.

    Lesson:
    As with Citrix and VDI, as well as BladePC/BladeWS, it’s always worth remembering that this is typically the first time that a network will have been asked to provide “real time” network performance. Up to this point even things like email are effectively only “store and forward” process’s so do not be surprised when a relatively simple PoC (Proof of Concept) shines a Spotlight on any Network issues?

    I hope this helps others?

    • Share/Bookmark

    written by dcaddick

    Sep 25

    After a wet and wild weekend camping Dugie pulls out all the stops again…

    Thanks Dugie, I wasn’t aware I needed to install the 64bit version to make it work?

    URLs for Windows 2008 RC0 and WSv

    WSS, Virtualization September 25th, 2007

    If you want to download Windows 2008 RC0 with the Windows Server virtualization preview, the URLs are below:

    Remember to grab the 64 bit version (6001.16659.070916-1443_amd64fre_Server_en-us-KR1SXFRE_EN_DVD.iso) as it is the only version that will run WSv

    John Howard has some great install primers in his post , “How to install the Windows Server virtualization role in Windows Server 2008 RC0“, a snippet is below:

    more at source…

    • Share/Bookmark

    written by dcaddick

    Jun 16

    The Concept of DR (Disaster Recovery) is dead!

    Pretty provocative statement? What do I mean by this?

    Typical Scenario

    Large sized Enterprises of 1,000 users or more have looked around at what they implement in IT Technology and realized at some point that, although they have done their best to eliminate any potential "Single Points of Failure" during the design stage, they have now come to accept (especially post 9/11) that in the event of some kind of Natural Disaster or other major failure to the building or Data Centre they will not be able to continue to trade or stay in business.

    This has generally led to the devising of a “Disaster Recovery” plan. Normally in the first instance a site is decided upon based on a third party’s Hosting Centre, then due to cost constraints of the service and/or dedicated Hardware (that everyone expects never to use, BTW) this plan is somewhat stifled at birth and can sometimes remain moribund.

    Any of this sound vaguely familiar?

    This then possibly progress’s at some point (when the next yearly(?) review of the DR Plan comes around) to add a bit more hardware and possibly some dedicated Leased line of some kind to enable comms? Although this is still somewhat half-hearted, when questioned about why it takes so long to test someone will usually point out that the reason it took so long is that:

    • A – It wasn’t done properly by *the other guys/consultants/*
    • B – We should test every quarter – "It would be easier and quicker….."

    When the next review comes around, the board hit’s the roof when they’re told that in the event of a Disaster the IT dept. thinks it would realistically take between 2 – 5 days to get up and running. (This is of course depending on what the board are told, or indeed how much it cost to implement the DR in the first place?)

    Result

    By the time they’ve got around to doing quarterly tests, you can almost guarantee enough full time work for one Project Manager, as well as 3 – 4 IT staff being dedicated to this for 1 – 2 weeks every quarter, not forgetting the users from the Business to run the testing. On top of this you also have an array of Hardware that may/or may not fit your needs depending on whether you have purchased the kit outright or it’s *shared* from a service supplier - ALL of which doesn’t do much – BUT still needs to be replaced and/or upgraded at various stages.

    Is this the best use of budget and resources? I don’t think so – do you?

    Time to re-think the approach?

    If you are going to re-think or re-design your DR strategy the very first thing that you should think about is getting the appropriate buy-in from the rest of the company. What do the other dept. heads think is a valid recovery time? Explain to them that this time has a logarithmic effect on the cost.

    Zero down time costs sqillions, even reducing the down time of critical systems to less than 5 minutes can seem extortionate, and yet we all feel that a 4 – 6 hours recovery time for DR should be within our grasp at a reasonable cost? Shouldn’t it?

    Take a leaf out of the best practices as deployed by bigger organizations and see if this can’t help you deploy a better model of DR?

    We now live in a world with a reasonably healthy supply of comms and bandwidth (apart from our colleagues further away from the main centres, Sorry chaps.) **SO**, Instead of implementing a design based on *Live* – *DR*, look instead to implement a design based on *Live* – *Live*.

    My personal choice of design to mitigate a disaster is to design a system that incorporates the idea of 2 Data Centres from the start. If you can’t afford much, then just make the secondary site smaller, even if it means that you only incorporate 1 x Citrix, 1 x Database and 1 x AD/Exchange . Most companies at this level will have either AD or NDS (or some other Directory Services) and as such be aware that they have limitations around tombstoning of deleted objects in the tree that mean you CANNOT just copy a Domain Controller or Catalogue and leave it on tape at the DR Site for when you come running – any more than 60 days and you’re Toast! You really need to have something live at the Secondary site.

    This where the cheap bandwidth part comes in – think about how much time and effort it takes to keep on top of the Backup System? Wouldn’t this be so much easier if you just had some form of replication of your backend systems to the Secondary site? Don’t forget that in some cases there will be no alternative option except go out and buy the kit and place it on the secondary site – HP Superdome’s don’t just sit around in a warehouses, so if you’ve already bought it, you might as well be using it – rather than letting it sit around gathering dust?

    **Caveat**
    This will not suit all situations – in some organizations that simply cannot cost justify this level of continuity the numbers will never show it to be viable. But certainly, any organization that relies on a minute by minute continuity of service to keep in business could find this design approach highly effective.

    If this concept still stretches your budget too far, then think about at least deploying a Remote Access system from another site? Once you have a DC, a Secondary Exchange Server with OWA at least you can still get eMail? Got a bit more cash? Start replicating the more important files to the DC – expand to more storage and then add a Citrix Server. You can always accomplish this by Stealth if you have to?

    Why I think DR is dead

    By designing systems as *Live* – *Live* or Primary and Secondary from the start it means that whenever you initiate Change Control to make changes to the System (You do have Change Control don’t you? ;-) ) you are always doing it both sites regardless of location. It becomes more than a process – it becomes ingrained!

    This is what I mean by the provocative statement at the start, even if the second site is smaller and has less kit, can handle less users – SO WHAT! – when your main Data Centre has just been hit by the fact that you can use the Citrix servers in the secondary site from:

    • an Internet Cafe
    • Home
    • an Airport
    • a Laptop at a Wireless HotPoint in Starbucks
    • Any site you choose

    *INSTANTLY* with no cross over or manual intervention required

    The IT Director will appear to be some kind of Svengali who has magically managed to keep the Systems running despite the .

    The Secondary Site needs to be scalable

    The reality is that with all this going on around you, there is a very distinct possibility that there will be so many people trying to fix things back at HQ, or look for new Office space until the current HQ can be that there is likely to be only a requirement of 30 – 40% of users needing access to the system.

    The next thing to do then is shore this up and anticipate your needs, there is a distinct likelyhood that you will see capacity back up to 70 – 80% within days – BUT you now have days in which to get the hardware, re-image, recover tapes and be ready for it.

    And won’t you come out of that smelling of roses?

    Costs

    Now to cap off this little rant, let’s get back to costs shall we? Please believe me when I say that no 2 companies will have the same requirements so this is all pure guess work – BUT caveats read and understood J

    Once you get to ~ 1,000 users the resources required will pretty much scale the same (ish….) (BTW, as far as I know the ~ symbol means approx., OK?)

    1,000 users can mean about IT head count of ~20 – 50 depending on the propensity of IT to do in-house development and/or do things in Access, etc.

    A traditional approach to DR could see us with:

    A bunch of hardware sitting around that gets fired up once a quarter – or it’s left powered up all the time, even though it’s not used and it consumes Power?

    One full time Project Manger

    4 IT Staff x 10 days x 4 times a year = 160 Days

    6 Test Users x 4 days x 4 times a year = 100 Days

    This gives us the equivalent of ~2.5 full time employees @ cost to the company of ~50K/pa

    Manpower = 125K

    Hardware = 50K plus?

    Not forgetting any additional manpower needed when there have been big changes since the last test that need to be implemented as well as any failed tests need fixing and then testing again.

    **I have not covered initial design and implementation – just on-going commitment**

    Although this is only a rough guide, what I’m trying to show here is that it’s the labour and/or resources involved with continually testing and verifying the traditional approach that sucks the very lifeblood out of the whole concept. And if you aren’t doing regular testing then it’s probably taking even longer to get things back to a workable state.

    Powerful idea

    One beautiful point here is that simply by referring to this as a Secondary site as opposed to DR and incorporating the concept that this Is part or normal IT capacity, you are now changing the board’s perception of the capital and operational costs in to something that is a percentage of normal IT funding instead of Dead money for kit you hope you don’t use.

    Anyway – that’s my 2 cents on that………………. ;-)

    Additional 5 cents worth below…..

    As soon as you even mention the concept of DR to the board – what they are drawing in their head is a symbolic switch and a mirror.

    You have now done such a good job of selling it that they imagine that when the “proverbial hits the rotating blades” they believe that you will be able to through theswitch at a moments notice and this mirror now magically mimics the production system and all is 100% fantastic.

    Try and rethink the architecture so that servers, services, objects and the like DO actually rely on TCP/IP to traverse the environment – this way you can quite simply redirect or “load balance” traffic, or should it come to it, use DNS to act as your switch to redirect part of the load to Systems that are working.

    Taking this concept further, if you have QoS (Quality of Service) in use at the Cisco/Switch/Router level you can even start pre-defining a “cost” element to traffic negotiating across to the secondary site – this *could* allow you to design a system where you don’t even need to through a switch to cut over at all. Now how cool would that be?

    • Share/Bookmark

    written by dcaddick