DanT's GridBlog  
All | Chess | Cooking | Grid | Java | Misc | Travel
 
Monday August 10, 2009
Another Undocumented Feature

In reading the comments for Issue 409, I came across another undocumented feature I hadn't seen before. Apparently, if you pass a variable to your job through qsub or qrsh with the -v switch, and if that variable starts with SGE_COMPLEX_, the SGE_COMPLEX_ part will be stripped off, and the remainder will be treated as a resource request whose value will be placed in the job's environment.

An example will make it easier to explain. If your job is able to run on multiple architectures, but you always select on which architecture you're running it when you submit, you could add "-v SGE_COMPLEX_arch" to the qsub submission parameters, and the job's environment would then contain the value of arch that was requested as the -l arch=... resource request. In action, it would look like:

% qrsh -l arch=sol-amd64 -v SGE_COMPLEX_arch echo \$arch
sol-amd64

Nice, but why is it useful? Well, maybe your script is capable of operating in multiple environments, but it needs to know about how it was submitted. For example, maybe the script changes your application's startup parameters based on the memory limits. The script could use this feature to get the memory limits from the submission parameters and act accordingly. Of course, it could also get the memory limits from ulimit(1), so maybe not the best example. Licenses may be a better example. The OS is blissfully unaware of license assignments. The only way for your script to find out about how many licenses were requested for it would be to use this feature (or do some clever digging with qstat).

You might have noticed by now that you could get the same effect by just passing in the requested complex value as an environment variable, e.g. "qsub -l arch=sol-amd64 -v arch=sol-amd64 ..." The difference between using the SGE_COMPLEX_ feature and using an environment variable explicitly is that with the SGE_COMPLEX_ feature you don't have to know what the requested value was, i.e. you can add it to an sge_request(5) file or write it into your script. And now we come to the real value. If you have a job that needs to know about its submission parameters, you can embed submission directives to add the needed complexes' values to the environment. Pretty handy. Whenever you can, it's a good idea to make your jobs and scripts as self-contained as possible.

Permalink Comments [0] (2009-08-10 09:16:01.0/2009-08-10 09:16:01.0)
Trackback: http://blogs.sun.com/templedf/entry/another_undocumented_feature
 
Thursday July 30, 2009
Sun HPC Software Workshop '09 -- Early Bird's Almost Over!

Just wanted to remind everyone that the early bird registration for the Sun HPC Software Workshop '09, Sept 7-10 in Regensburg, Germany, ends tomorrow (31 July 2009). It's your last chance to sign up at the discounted rate. After tomorrow, you will still be able to register, but the cost of registration will be higher.

In a nutshell, the Sun HPC Software Workshop '09 is a combination of our annual Grid Engine Workshop, a European edition of the popular Lustre Users Group meeting, and a conference on developing applications and services for HPC and cloud environments. The Workshop lasts three days, with a presentation track representing each of these topics. One the day before the main Workshop starts, we're also holding deeper technology seminars: a Lustre Deep Dive, a Grid Engine admin training, and a class on parallel application development taught by Ruud van der Pas. The Workshop and the preceding seminars are an excellent opportunity to learn more about these technologies and connect with the product engineers, partners, and other community members.

There is an open Call for Presentations for the Workshop, but it also closes tomorrow. If you're interested in proposing a talk for the Workshop (and getting a discounted registration fee if it's accepted), send a title, duration, and brief summary to the email address listed on the Agenda page. But, hurry. We'll be making our final decisions and notifying the speakers soon.

I look forward to seeing you there!

Permalink Comments [0] (2009-07-30 15:00:17.0/2009-07-30 15:00:17.0)
Trackback: http://blogs.sun.com/templedf/entry/sun_hpc_software_workshop_091
 
Tuesday July 21, 2009
Lies, Damned Lies, & DRMs

Some of our competitors seem to be very fond of spreading the rumor that the Sun Grid Engine product team has been laid off and/or that the product has been discontinued. It would appear that since they can't claim to have a better, more scalable, or more cost-effective product, they're willing to go with lying through their teeth to make the sale. Since I keep getting asked this question, I figured it would be worthwhile to post an official response.

To plagiarize Mark Twain, the rumors of our death have been greatly exaggerated. We're still here and going strong. The team is now roughly four times the size it was when I joined six years ago. It spans six offices in five countries on three continents. The product has a road map that reaches out past 2012 (which is as far as we're willing to speculate). We have a massive (if not leading) share in both the open source and licensed DRM system markets, and we're not planning to go away any time soon.

Of course, with the deal with Larry pending, nothing is certain. The only comment I can make there is "no comment." That said, for now at least, it's business as usual. We're still writing code, preparing releases, doing trainings, holding our annual Workshop, etc. Look for the next update this quarter. Look for the next release next year. And look for a whole lot more good stuff coming from our team over the next several updates and releases. With the features that have been added in the 6.2, 6.2u2 and 6.2u3 releases, Sun Grid Engine is in a great position. With what's coming up, I'd resort to lying too, if I worked for one of our competitors.

Permalink Comments [1] (2009-07-21 13:37:51.0/2009-07-21 13:37:51.0)
Trackback: http://blogs.sun.com/templedf/entry/lies_damned_lies_drms
 
Monday July 20, 2009
European Students: Want a Free Laptop?

Are you a student in Europe*? Do you want a new Toshiba laptop? Willing to write some code to get it? Good. Read on.

The OpenSolaris HPC team is currently running a programming contest for European students that was launched at ISC in Hamburg last month. The contest is to write the most performant and scalable implementation of a distributed hash table. Submission can be from teams of up to three people. The top prize is a new Toshiba laptop for each member of the winning team.

For more information, check out the contest site. Better hurry, though, because the contest deadline is coming up quick!

* Contest participation is limited to legal residents of a specific list of European countries. See the contest site for details.


OFFICIAL RULES
NO PURCHASE NECESSARY

1. DESCRIPTION OF THE CONTEST: The Sun HPC Software Student Programming Challenge ISC 2009 ("Contest") is designed to promote the use of the Sun HPC Software, Developer Edition 1.0 for OpenSolaris among students by having them compete to design and implement the most scalable and best-performing implementation of a common parallel algorithm. Prizes will be awarded to those who submit the best entries as determined by the judges in accordance with these Official Rules.

2. ELIGIBILITY: This contest is open only to teams of 1 to 3 currently-enrolled, full- or part-time, undergraduate or graduate, university or college students, who are the legal age of majority in their country, province or state of legal residence and residents of Denmark, France, Germany, Italy, Poland, Russia, Spain, Sweden, Switzerland, and the United Kingdom. Void in Puerto Rico, Quebec and where prohibited by law. Persons in any of the following categories are not eligible to participate or win the prize(s) offered: (a) Employees or agents of Sun Microsystems, their parent companies, affiliates and subsidiaries, participating advertising and promotion agencies, application development partner companies, and prize suppliers; (b) immediate family members (defined as parents, children, siblings and spouse, regardless of where they reside) and/or those living in the same household as any person in (a) above; and (c) employees of any government entity. You must also have access to the Internet and a valid email address in order to enter or win.

3. HOW TO ENTER: This contest begins at 12:01 P.M. Pacific Time (PT) Zone in the United States (e.g. San Francisco time) which is 5:01 A.M. Greenwich Mean Time (GMT) on the 29th of June 2009 and ends at 11:59 P.M. (PT) which is 4:59 A.M. (GMT) on 10th of August 2009 ("Contest Period"). IMPORTANT NOTICE TO ENTRANTS: ENTRANTS ARE RESPONSIBLE FOR DETERMINING THE CORRESPONDING TIME ZONE IN THEIR RESPECTIVE JURISDICTIONS.

4. THE SUBMISSION: Create an implementation of a fault-tolerant distributed hash table as described at http://wikis.sun.com/display/HPCContest/Sun+HPC+Software+Student+Programming+Challenge+ISC+2009 The implementation must be written in C for the OpenSolaris 2009.06 operating environment using the Sun HPC ClusterTools 8.1 OpenMPI implementation and must be submitted as a Sun Studio 12 project. All Entries must include a valid and complete Sun Studio 12 project that builds without errors on an unmodified instance of the Sun HPC Software, Developer Edition 1.0 for OpenSolaris. Entries may be submitted either electronically or via mail. All Entries must be comprised of original work of the submitter(s). No participant may submit an Entry as a member of more than one team.

Electronic Entries must include a 1-3 page written summary of the implementation approach and the name(s) of the submitter(s). The electronic file must be a gzipped tar file that includes the Sun Studio 12 project directory, including all required files, and must be no larger than 5MB in size. If the electronic file is larger than 5MB in size, it must be submitted by mail in accordance with the instructions below. The electronic entry must be sent via email to hpccontest@sun.com and received no later than 11:59 PM (PDT) on August 10th, 2009 in the United States.

Mailed Entries must include a 1-3 page written summary of the implementation approach and the name(s) of the submitter(s), and a CD or DVD containing the project code as described above. All mailed Entries must be sent to Sun HPC Software Programming Challenge, c/o Sun Microsystems, Inc., 17 Network Circle, Menlo Park, CA 94025, MS-MPK17-207, and must be received no later than 11:59 PM (PDT) on August 10th, 2009 in the United States.

All Entries must be in English. Registration or Entries that are in any other language will not be considered. Entries that are lewd, obscene, pornographic, disparaging of the Sponsor or otherwise contain objectionable material may be disqualified in the Sponsor's sole and unfettered discretion.

5. JUDGING: All Entries will be judged by a panel of experts based on the following equally weighted judging criteria: data retrieval throughput for requests coming from a single node, data retrieval throughput for parallel requests coming from multiple nodes, ability to withstand processing node failure, and scalability with respect to number of processing nodes and number of data items. In the event of a tie, the person or team among the tied Entries with the highest score in scalability with respect to number of processing nodes and number of data items will be declared the winner. In the event that no entries are received, no prize will be awarded. Decisions of judges are final and binding. Winner will be notified by email.

6. PRIZES AND APPROXIMATE RETAIL VALUE: First prize: Toshiba OpenSolaris laptop valued at approximately $2,000. Second and third prizes: Apple iPod valued at approximately $150. Up to three Toshiba laptops and six Apple iPods may be awarded. Prize includes round-trip coach air transportation for one person from major airport nearest winner's residence and hotel accommodations for one person for four nights. Hotel accommodations at Sponsor's discretion. Certain black out dates apply. In the event the Sun HPC Software Workshop is cancelled or postponed for any reason, Sponsor reserves the right to award the remainder of the prize with no further obligation to the winner. All other expenses not specified herein are the responsibility of the winner. ALL TAXES AND ANY APPLICABLE WITHOLDING AND REPORTING REQUIREMENTS ARE THE SOLE RESPONSIBILITY OF THE WINNER. Cash prizes will be awarded in US Dollars. All costs associated with currency exchange are the sole responsibility of the winner.

7. CONDITIONS OF PARTICIPATION. Sponsor reserves the right to substitute a prize for an item of equal or greater value in the event all or part of a prize becomes unavailable. Prizes are awarded without warranty of any kind from Sponsor, express or implied, without limitation, except where this would be contrary to federal, state, provincial, or local laws or regulations. All federal, state, provincial and local laws and regulations apply. Submission of entry into this Contest deems that entrants agree to be bound by the terms of these Official Rules and by the decisions of Sponsor, which are final and binding on all matters pertaining to this Contest. Return of any prize/prize notification may result in disqualification and selection of an alternate winner. Any potential winner who cannot be contacted within 15 days of attempted first notification will forfeit his/her prize. Potential prize winner(s) may be required to sign and return an Affidavit or Declaration of Eligibility/Liability & Publicity Release within 30 days following the date of first attempted notification. Failure to comply within this time period may result in disqualification and selection of an alternate winner. Travel companion of winner must also execute an Affidavit of Eligibility/Liability & Publicity Release prior to ticketing and must possess required travel documents (e.g. valid photo I.D.) prior to departure. Once the travel schedule has been arranged, it cannot be altered and failure of winner to follow such schedule shall not obligate Sponsor in any way to provide the winner with alternate arrangements. The intellectual and industrial property rights to the contest submission, if any, will remain with the participants, except that these terms do not supersede any other assignment or grant of rights according to any other separate agreements between participants and other parties. As a condition of entry, participants agree that Sun shall have the right to use, copy, modify and make available the application or code in connection with the operation, conduct, administration, and advertising and promotion of the Contest via communication to the public, including, but not limited to the right to make screenshots, animations and video clips available to the public for promotional and publicity purposes. Notwithstanding the foregoing, ownership of and all intellectual and industrial property rights in and to the application and code shall remain with the participant. Acceptance of the prize constitutes permission for, and winners consent to, Sponsor and its agencies to use a winner's name and/or likeness and entry for advertising and promotional purposes without additional compensation, unless prohibited by law. To the extent permitted by law, entrants, agree to hold Sponsor, its parent, subsidiaries, agents, directors, officers, employees, representatives and assigns harmless from any injury or damage caused or claimed to be caused by participation in the Contest and/or use or acceptance of any prize won, except to the extent that any death or personal injury is caused by the negligence of the Sponsor. Sponsor is not responsible for any typographical or other error in the printing of the offer, administration of the Contest or in the announcement of the prize. A participant may be prohibited from participating in this Contest if, in the Sponsor's sole discretion, it reasonably believes that the participant has attempted to undermine the legitimate operation of this Contest by cheating, deception, or other unfair playing practices or annoys, abuses, threatens or harasses any other participants, the Sponsor or associated agencies. In the event a winner/potential winner's employer has a policy, which prohibits the awarding of a prize to an employee, the prize will be forfeited and an alternate winner will be selected.

8. NO RECOURSE TO JUDICIAL OR OTHER PROCEDURES: To the extent permitted by law, the rights to litigate, to seek injunctive relief or to make any other recourse to judicial or any other procedure in case of disputes or claims resulting from or in connection with this contest are hereby excluded, and any participant expressly waives any and all such rights.

Participants agree that these Official Rules are governed by the laws of California, USA.

9. DATA PRIVACY: Participants agree that personal data, especially name and address, may be processed, stored and otherwise used for the purposes and within the context of the contest and any other purposes outlined in these Official Rules. The data may also be used by the Sponsor in order to check participants' identity, their postal address and telephone number, or to otherwise verify their eligibility to participate in the Contest and to receive any prize. Participants have a right to access, review, rectify or cancel any personal data held by the Sponsor by writing to Sponsor (Attention: Daniel Templeton) at the address listed below. If participant's data is not provided or is canceled participants' Entries will be ineligible.

10. WARRANTY AND INDEMNITY: Entrants certify that their entry is original and that they are the sole and exclusive owner and right holder of the submitted entry and that they have the right to submit the Entry in the Contest. Each participant agrees not to submit any Entry that (1) infringes any 3rd party proprietary, intellectual property, industrial property, personal rights or other rights, including without limitation, copyright, trademark, patent, trade secret or confidentiality obligation; or (2) otherwise violates applicable law in any countries in the world. To the maximum extent permitted by law, each participant indemnifies and agrees to keep indemnified the Sponsor its parent, subsidiaries, agents, directors, officers, employees, representatives and assigns harmless at all times from and against any liability, claims, demands, losses, damages, costs and expenses resulting from any act, default or omission of the participant and/or a breach of any warranty set forth herein. To the maximum extent permitted by law, each participant indemnifies and agrees to keep indemnified the Sponsor, its parent, subsidiaries, agents, directors, officers, employees, representatives and assigns harmless at all times from and against any liability, actions, claims, demands, losses, damages, costs and expenses for or in respect of which the Sponsor will or may become liable by reason of or related or incidental to any act, default or omission by a participant under these Official Rules including without limitation resulting from or in relation to any breach, non-observance, act or omission whether negligent or otherwise, pursuant to these official rules by a participant.

11. ELIMINATION: Any false information provided within the context of the Contest by any participant concerning identity, postal address, telephone number, ownership of right or non-compliance with these rules or the like may result in the immediate elimination of the participant from the Contest. Sponsor further reserves the right to disqualify any Entry that it believes in its sole and unfettered discretion infringes upon or violates the rights of any third party or otherwise does not comply with these official rules.

12. INTERNET: Sponsor is not responsible for electronic transmission errors resulting in omission, interruption, deletion, defect, delay in operations or transmission. Sponsor is not responsible for theft or destruction or unauthorized access to or alterations of entry materials, or for technical, network, telephone equipment, electronic, computer, hardware or software malfunctions or limitations of any kind. Sponsor is not responsible for inaccurate transmissions of or failure to receive entry information by Sponsor on account of technical problems or traffic congestion on the Internet or at any Web site or any combination thereof, except to the extent that any death or personal injury is caused by the negligence of the Sponsor. If for any reason the Internet portion of the program is not capable of running as planned, including infection by computer virus, bugs, tampering, unauthorized intervention, fraud, technical failures, or any other causes which corrupt or affect the administration, security, fairness, integrity, or proper conduct of this Contest, Sponsor reserves the right at its sole discretion to cancel, terminate, modify or suspend the Contest. Sponsor reserves the right to select winners from eligible entries received as of the termination date. Sponsor further reserves the right to disqualify any individual who tampers with the entry process. Caution: Any attempt by a contestant to deliberately damage any Web site or undermine the legitimate operation of the game is a violation of criminal and civil laws and should such an attempt be made, Sponsor reserves the right to seek damages from any such contestant to the fullest extent of the law.

13. If any provision(s) of these Official Rules are held to be invalid or unenforceable, all remaining provisions hereof will remain in full force and effect.

14. WINNER'S LIST: For winner's name, log onto http://wikis.sun.com/display/HPCContest on or about August 14th, available for a period of up to 60 days.

15. SPONSOR: The Sponsor of this Contest is Sun Microsystems, Inc., 4220 Network Circle, Santa Clara, CA 95054.

Permalink Comments [0] (2009-07-20 13:37:24.0/2009-07-20 13:37:24.0)
Trackback: http://blogs.sun.com/templedf/entry/european_students_want_a_free
 
Sun HPC Software Workshop '09

Every year, usually in the autumn, we have a Grid Engine workshop, usually at the Grid Engine home base in Regensburg, Germany. (Last year was an exception in that we held the conference in the spring in Oakland. What were we thinking?) This year will be no exception. September 7-10 at the Best Western Premier in Regensburg, Germany, we'll be holding the next Grid Engine workshop. What is exceptional about this year, though, is that we're expanding the scope to be about all of Sun's HPC software offerings.

This year, the workshop will offer three separate tracks. One track will be essentially the Grid Engine workshop that we all know and love. The second track will be focused on Open Storage technologies, like Lustre, SAM-QFS, ZFS, etc. The last track will be about development tools and technologies for HPC and the cloud, including Sun's HPC developer tools, Hadoop, Fortress, the Sun Cloud, etc.

If you're interested in any of these technologies, especially Grid Engine and/or Lustre, this is a conference you won't want to miss. And as an added incentive, the conference falls squarely in the middle of the Regensburger Herbstdult, which is the city's autumn festival. In US terms, it's a lot like a county fair with beer tents. In general, think mini-Oktoberfest. Monday (Sept. 7th) night, we'll take a delegation of folks from the conference over to the Dult for an evening of socializing over a few liters of beer. (I have empirically proven my limit to be 2.5L in a sitting.)

The Call For Presentations for the conference is open until the end of July. If you're doing something interesting with one, some, or all of these technologies, we'd love to hear from you. We have presentation slots open that are 15, 25, and 55 minutes long. In addition, if your talk is selected for the Workshop, you will get a discounted registration fee. For details, click on the Call for Presentations tab on the Workshop site.

And as if all that wasn't tempting enough, Monday, September 7th, the first day of the Workshop, will be devoted to deep-dive seminars. These will include a full-day Grid Engine administration training, a Lustre internals deep dive, and a parallel programming class. These is an additional fee for attending the seminars, and there are a limited number of seats. If you're interested, sign up now!

I hope to see you there! (Look for updates on Twitter via the #sunhpc09 hash tag.)

Permalink Comments [0] (2009-07-20 13:17:55.0/2009-07-20 07:54:42.0)
Trackback: http://blogs.sun.com/templedf/entry/sun_hpc_software_workshop_09
 
Friday July 17, 2009
Shameless

So, you may have noticed that I haven't been blogging much lately. That's partially because I'm completely swamped and partially because I've started tweeting a lot of the things that I would have normally blogged. I'm finding that for posting links or sending out tips and tricks, Twitter is lower overhead than blogging.

One of the things I've been doing on Twitter is a Grid Engine Tip of the Day. Most of the time it's something that I just answered on the mailing list. In general, though, it's intended to be little things that you might not have realized or known about.

Something I really like about Twitter is that it's more conversational. Yes, you can leave comments on my blog posts, and yes it technically fills the same purpose, but it just seems so much more natural to just ask a question on Twitter. Of course, before asking a question is helpful, I have to build a following. To that end, I've started doing something else on Twitter. When people respond to the questions I ask, I've been sending the first ones "thank-you gifts", which thus far has been 4GB USB memory sticks with the OpenSolaris logo. Think of it as positive reinforcement*.

I would love to hear your opinions here or on Twitter about the use of Twitter for conveying the kind of information that I'm prone to try to covey

*: I reserve the right to be completely arbitrarily about when and to whom I send something and what I send them, if I send them anything. Proper positive reinforcement demands randomness.

Permalink Comments [0] (2009-07-17 10:48:18.0/2009-07-17 10:48:18.0)
Trackback: http://blogs.sun.com/templedf/entry/shameless
 
Wednesday April 01, 2009
Strange Times

I just saw the news that Rackable Systems just bought sgi. Oh, how the mighty have fallen. Reminds me of when Wizards of the Coast bought TSR. I hope that's the last acquisition news we here for a while...

By the way, I've started tweeting links like this one, rather than blogging them. If you don't want to miss out on any of the fun, follow me there. (I've also started tweeting a Grid Engine tip of the day.)

Permalink Comments [0] (2009-04-01 06:43:35.0/2009-04-01 06:43:35.0)
Trackback: http://blogs.sun.com/templedf/entry/strange_times
 
Friday March 20, 2009
Rube Goldberg Gone Wild

I just can't not post this. Assuming they're not cheating by editing the film, this is easily the largest Rube Goldberg machine I've ever seen, and they're really creative about the elements they used. They do lose points for not using live animals, though.

(Anyone have a better link for this video? I'm sure it's on YouTube or Google Video somewhere, but at the moment, I can't get it to play again, so I don't have details with which to search for it.)

Permalink Comments [0] (2009-03-20 05:48:54.0/2009-03-20 05:48:54.0)
Trackback: http://blogs.sun.com/templedf/entry/rube_goldberg_gone_wild
 
Thursday March 19, 2009
Podcast: New Installer in Sun Grid Engine 6.2 Update 2

I just posted a new podcast on the new installer in Sun Grid Engine 6.2u2. Check it out.

Permalink Comments [0] (2009-03-19 14:05:27.0/2009-03-19 14:05:27.0)
Trackback: http://blogs.sun.com/templedf/entry/podcast_new_installer_in_sun
 
Monday March 16, 2009
New Installer in Sun Grid Engine 6.2 Update 2

In my previous post, I talked about the new installer that is included with Sun Grid Engine 6.2u2. Lubos, one of our core team (as opposed to Service Domain Manager or QA) engineers in Prague, has just posted a couple of videos of the new installer. The first one shows how to make sure the new installer can be used with the machines you're planning to use for your cluster. Because the new installer can install an entire cluster at once, it has to be able to contact all the machines destined for the cluster, and that's where the setup comes in. The second one actually shows off the new installer. Lubos also has some screenshots of the new installer posted.

Permalink Comments [0] (2009-03-16 07:24:35.0/2009-03-16 07:24:35.0)
Trackback: http://blogs.sun.com/templedf/entry/new_install_in_sun_grid
 
Thursday March 05, 2009
Sun Grid Engine 6.2 Update 2 Is Out!

Sun Grid Engine 6.2u2 is now available. If you're not excited, you should be. First off, don't let the name fool you. 6.2u2 is not just bug fixes. It's a full feature release, and contains some great features. What features? Glad you asked.

First and foremost, job submission verifiers (JSVs). It's a feature we added specifically for TACC, but it's one that will be useful for almost everyone. In fact, I suspect that we'll discover it's the answer to some of the classic Sun Grid Engine problems. What is it? Before 6.2u2, there was no way to prevent a job from being submitted. It was (and still is) possible to choose not to schedule a job after it's been submitted, but before 6.2u2, that's all you could do. With 6.2u2 and JSV, you now have the option to insert a step between submission and acceptance. With that step, you can choose to accept or reject the job submission, but you can also choose to modify the job before accepting it, and that's where the magic comes in.

The verification step is handled through scripts or binaries. There's a new submission option, -jsv, that adds a JSV to the submission. That means you can pick up JSVs from anywhere that you can stash a submission option: most notably the global sge_request file, your user sge_request file, and the directory's sge_request file, but also DRMAA native specification, DRMAA job category, the enigmatic -@ switch, and, of course, the command line itself. The -jsv switch is cumulative, so if you have one in several of those places, several JSVs will be run for your submission. It's worth noting that all of the above listed JSV sources are controlled by the user, except the global sge_request file, and even that can be overridden with the -clear switch.

So far, we've only talked about the client side. JSVs can also come in on the server side. In the global host configuration an administrator can configure a single JSV. Unlike on the client side where every JSV is started from scratch with every job submission, on the server side the JSV is started once and queried repeatedly. The reason is that on the client side, performance isn't a big issue, but on the server side, the cost of forking and execing the JSV for every job submission can have a huge impact. By keeping the JSV running, we save that cost. The big advantage of the server-side JSV is that users can't circumvent it. If you really need to enforce a policy with a JSV, the server side is that place to do it.

Now, if you're thinking fast, you might question the point of the server-side JSV when users can change everything about the job using qalter after it's submitted. Well, so did we. When you configure a server-side JSV, users are no longer allowed to modify jobs after submission unless you specifically grant the ability to do so, and even then it's limited to the job attributes that you allow them to modify.

JSV is a huge topic, and I could probably go on for days about it. Instead I'll save it for a white paper and move on.

The next big feature in 6.2u2 is the new installer. You now have the option of using the old interactive text-based installer or a new graphical installer. The graphical installer has several important advantages. First, it lets you install an entire cluster at once. It actually sits on top of the auto-installer and reuses that same functionality to install remote nodes. The graphical installer, however, will first verify that all the nodes are reachable before the installation starts, so the installation won't quietly hang on an unreachable node. It also accepts wildcarded host name and IP address ranges, which makes installing a huge cluster much simpler.

The third major feature is that we've added support for Microsoft Windows Vista (Ultimate and Enterprise) and Server 2003R2 and 2008. Both 32-bit and 64-bit version are available. Harald (who you should encourage to start blogging!) worked really hard on ironing out the issues with the changes in the OS. We still rely on SFU for the Windows execution daemons, except that it's now called SUA.

The fourth big feature is job-level parallel job resource requests. Before 6.2u2, whenever a parallel job requested a resource, SGE would implicitly multiply that resource request by the number of assigned slaves (because each slave requests the resource on the host where it runs). That makes sense with, say, memory, where requesting 4GB really means that every slave should have 4GB. It doesn't make any sense for other things, like some software licenses. Now with 6.2u2, the administrator can flag a resource as job level, meaning that it is not multiplied by the number of assigned slaves when requested by a parallel job. In most cases, a resource that shouldn't be multiplied in for one job, shouldn't be multiplied for any job. There may be exceptions to the rule, but I doubt there will be many. I'd love to hear your feedback, though.

The last two new features aren't so much features as improvements. Starting with 6.2u2, the 64-bit Linux binaries use the jemalloc library instead of the default Linux malloc. The performance and memory footprint impact is significant, in some cases as much as 20% improvement. Also, starting with 6.2u2, the Linux binaries use poll() instead of select() in the commlib. For some flavors of Linux, the use of select() made it difficult to scale past a couple thousand hosts. With the commlib now using poll(), I've seen SGE scale well over 6000 Linux nodes.

And on top of all that, there is the usual pile of bug fixes. A handful of qmaster and scheduler issues cropped up recently in 6.2 and 6.2u1, but with 6.2u2 those should all now be resolved.

I highly recommend giving 6.2u2 a try, if for no reason other than JSV. Let me know what you think!

Permalink Comments [2] (2009-03-05 09:46:14.0/2009-03-05 09:46:14.0)
Trackback: http://blogs.sun.com/templedf/entry/sun_grid_engine_6_21
 
Sunday February 22, 2009
I Like This Guy

At the Omniture Summit '09 last week I listened to a keynote presentation by George Colony, head honcho over at Forrester. I found his style very entertaining and his points mostly on target. I recently took a look at his blog, and I think I really like this guy. His blog is definitely worth a read. (I also love the irony that in the brave new world of social media, I, Joe Nobody, can announce with a straight face to my faceless readership that I approve of the founder of Forrester. I also approve of Peter Gabriel and Scott McNealy, by the way.)

Permalink Comments [0] (2009-02-22 10:00:00.0/2009-02-22 10:00:00.0)
Trackback: http://blogs.sun.com/templedf/entry/i_like_this_guy
 
Saturday February 21, 2009
The Future Is Now

Friday I heard about the Esquire cover for the first time. Wow. It is apparently the most expensive cover spread in history. It was only a matter of time. It's scary to think where this is going. Someday your cereal box will talk to you, even without the help of hallucinogens.

Permalink Comments [0] (2009-02-21 08:00:00.0/2009-02-21 08:00:00.0)
Trackback: http://blogs.sun.com/templedf/entry/the_future_is_now
 
Friday February 20, 2009
Executable Web Deja Vu

George Colony spoke in yesterday morning's Omniture Summit 09 keynote, and one of his eight big messages for CEOs was that the web is moving to what he calls "the executable web," meaning that instead of your browser being just a file viewer, it will become an application platform for applications downloaded off the web. Gee. That sounds kinda familiar... like from about a decade ago! Applets anyone? It really ticks me off that "Java™ is dead" while at the same time, everyone is busily trying to recreate Java in JavaScript. Hey, I know! Why not just fragging use Java? JavaScript is ugly and painful. AJAX is an order of magnitude worse. (Don't tell me about how much better Python and Ruby are than Java. That's a different topic.) I know that Sun is notorious for being way too early to the market with its technologies. Yes, applets did suck ten years ago. (Although, ten years ago there wasn't an alternative for RIAs, so at the same time it was revolutionary, cutting-edge technology.) Have you looked at applets lately? Check this out. Let's see you port that to JavaScript. I think it's just silly that with the current state of Java technology people are still thinking about the platform based on impressions from last millennium. Wake up and smell the coffee.

Permalink Comments [3] (2009-02-20 08:00:00.0/2009-02-20 08:00:00.0)
Trackback: http://blogs.sun.com/templedf/entry/executable_web_deja_vu
 
Thursday February 19, 2009
Omniture Summit '09

I'm out at the Ominture Summit '09 in Salt Lake City this week, and I'm very pleased to report that it doesn't suck. I was very worried that I was paying three grand for a two-day sales pitch, but I took a leap of faith and registered anyway. Turns out that feedback from previous years' conferences have inspired Omniture to dial back the product pitches and user-oriented content. This year, there has been a wealth of useful information on Internet marketing in general and very few product sales pitches.

And as an added bonus, as I'm writing this post, I just won a Corsair vintage radio from Vintage Tub & Bath for being quick to raise my hand.

The big take-aways for me have been:

  • I think I finally get Twitter. I singed up for a Twitter account, just to play with it, but I hadn't quite comprehended how Twitter is useful for a company or organization. Twitter has been a big theme at this conference. Everyone is trying to figure out what to do with this untested new marketing channel. Look for more from me there in the near future.
  • An important point that I missed before is that to be successful in the brave new world of social networking, you have to be a full participant. It's not enough to just broadcast. The communication has to be 2-way.
  • Building a community is a lot like creating viral media. There's no formula. You can't just create a community. All you can do is seed the ground and hope something grows. You can, however, do a lot to encourage the right things to grow and to help things along. In the end, it really does still come down to content.
  • Brand has to be pervasive. Martin Lindstrom calls it "smashable brand," meaning that the brand should be recognizable by even the smallest fragment. Is your website obviously your if you take away the logos and products? His new book, Buy-ology, looks pretty interesting. We all got free copies, so I'll let you know if it was after I've had a chance to read it.
  • Mobile is the next marketing frontier. Makes me glad I don't have a data plan.

Permalink Comments [0] (2009-02-19 15:20:18.0/2009-02-19 15:20:18.0)
Trackback: http://blogs.sun.com/templedf/entry/omniture_summit_09
 
 
Calendar
« November 2009
SunMonTueWedThuFriSat
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
     
       
Today
Blog::Navigation
Bookmarks::Grid Engine
Bookmarks::Blogroll
Bookmarks::News
Link to DanT's GridBlog

Link to DanTs GridBlog

Site notes

This page validates as XHTML 1.0, and will look much better in a browser that supports web standards, but it is accessible to any browser or Internet device. It was created using techniques detailed at glish.com/css/.
Powered by Roller Weblogger.

XML