{"id":127,"date":"2011-04-28T19:26:30","date_gmt":"2011-04-29T01:26:30","guid":{"rendered":"http:\/\/mattdturner.com\/wordpress\/?p=127"},"modified":"2011-05-02T10:21:18","modified_gmt":"2011-05-02T16:21:18","slug":"rocks-linux-node-install-error-boots-to-grub","status":"publish","type":"post","link":"http:\/\/mattdturner.com\/wordpress\/2011\/04\/rocks-linux-node-install-error-boots-to-grub\/","title":{"rendered":"Rocks Linux Node Install Error, Boots to Grub"},"content":{"rendered":"<p><a href=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/rocks-logo.png\"><\/a><a href=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/pyramids.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-138\" title=\"pyramids\" src=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/pyramids.png\" alt=\"People have built great things with rocks\" width=\"576\" height=\"288\" srcset=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/pyramids.png 576w, http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/pyramids-300x150.png 300w\" sizes=\"(max-width: 576px) 100vw, 576px\" \/><\/a><br \/>\nWe had a problem recently with one of our nodes that required replacing the motherboard. \u00a0After replacing the motherboard, I was unable to install the node on our cluster again. \u00a0I would run &#8220;insert-ethers&#8221; on the head node and then turn on the compute node. \u00a0Insert-ethers would detect a new device, however the node would never request a kickstart file.<\/p>\n<p>After many different attempts to remedy the problem, I was finally able to get the node to instal. \u00a0This post will describe the steps I took in order to bring life back to the node.<\/p>\n<p><!--more--><\/p>\n<p><strong>Ensure the node is set to PXE boot<\/strong><\/p>\n<ol>\n<li>The first thing you should do is enter the BIOS of the troublesome node.<\/li>\n<li>In the BIOS, make sure that an onboard LAN port is enabled and ensure that the Onboard LAN option is set to PXE.<\/li>\n<li>In our BIOS these options were both under the &#8220;Advanced&#8221; tab. \u00a0Within this tab, enter the &#8220;Advanced PCI\/PnP Setting&#8221; (See image below). \u00a0I apologize for the poor image quality.<br \/>\n<a href=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/bios1.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-130\" title=\"bios1\" src=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/bios1.jpg\" alt=\"Enable onboard LAN and PXE in BIOS\" width=\"546\" height=\"444\" srcset=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/bios1.jpg 682w, http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/bios1-300x244.jpg 300w\" sizes=\"(max-width: 546px) 100vw, 546px\" \/><\/a><\/li>\n<li>Now that we know LAN is enabled and set to use PXE, we need to make sure that the network (LAN) is set to be the first boot device.<\/li>\n<li>Still in the BIOS, move to the BOOT tab (might be different in different BIOS&#8217;s) and enter the Boot Device Priority option.<\/li>\n<li>Make sure that 1st boot device is set to your Network option (see image below). \u00a0 Again, I apologize for the poor image quality.<a href=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/bios2.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-133\" title=\"bios2\" src=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/bios2.jpg\" alt=\"Select boot order in BIOS\" width=\"553\" height=\"465\" srcset=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/bios2.jpg 691w, http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/bios2-300x252.jpg 300w\" sizes=\"(max-width: 553px) 100vw, 553px\" \/><\/a><\/li>\n<li>One all of these changes have been made, press F10 to save and exit. \u00a0Now just watch what happens when the node boots. \u00a0If you are lucky, this will solve your problems.<\/li>\n<\/ol>\n<p>If the node still boots to the GNU GRUB command prompt, this most likely means that there is a problem in the local OS. \u00a0By default, Rocks tries to run some code locally and then switch to PXE boot. \u00a0The best option is to wipe the local hard drive on the node using something like badblocks.<\/p>\n<p><strong>Wipe the local hard drive<\/strong><\/p>\n<ol>\n<li>Remove the hard drive from the node, and insert it as a secondary hard drive on a separate node (see image below). \u00a0On our cluster, removing the hard drive was simple. \u00a0All that was required was to push a tab on the front of the node and pull the hard drive out. \u00a0The quality of this picture is even worse than the others. \u00a0The area was not well lit and i was using my phone.<a href=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/hard-drive.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignright size-full wp-image-135\" title=\"hard drive\" src=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/hard-drive.png\" alt=\"Remove the hard drive from the node\" width=\"286\" height=\"232\" srcset=\"http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/hard-drive.png 477w, http:\/\/mattdturner.com\/wordpress\/wp-content\/uploads\/2011\/04\/hard-drive-300x242.png 300w\" sizes=\"(max-width: 286px) 100vw, 286px\" \/><\/a><\/li>\n<li>After placing the hard drive in a new node, access the node either remotely or locally log into the system.<\/li>\n<li>Type &#8220;fdisk -l&#8221; to make sure that the node was detected. \u00a0On our cluster, each node&#8217;s main hard drive was &#8216;\/dev\/sda&#8217; and the secondary hard drive is &#8216;\/dev\/sdb&#8217;.<\/li>\n<li>If you see the secondary node, the next step is to wipe the drive. \u00a0Do this by using something like badblocks.\n<pre class=\"brush: plain; light: true; title: ; notranslate\" title=\"\">badblocks -wvf \/dev\/sdb<\/pre>\n<\/li>\n<li>This only needs to be run for a few minutes in order to wipe the beginning of the drive.<\/li>\n<li>After you have wiped the drive, replace it into the troublesome node and run insert-ethers again.<\/li>\n<\/ol>\n<p>After wiping the hard drive, i now got a new error message when trying to boot the node. \u00a0The error message was:<\/p>\n<pre class=\"brush: plain; light: true; title: ; notranslate\" title=\"\">Reboot and Select proper Boot device or Insert Boot Media in selected Boot device and press a key<\/pre>\n<p>When I was attempting to get the node to boot, making the changes mentioned previously about the BIOS was the last step I took. \u00a0The message about selecting a proper boot device was caused by the fact that I was not performing a PXE boot. \u00a0Switching the BIOS to PXE boot first solved the problem for me.<\/p>\n<p><strong>Force PXE Boot<\/strong><\/p>\n<p>If you still are unable to PXE boot, the only other possible solution I can think of is to force a PXE boot. \u00a0When the node is booting up, and you first get information posted to the monitor, press F12 to force a PXE boot.<\/p>\n<p>Hopefully one of these solutions works!<\/p>\n<div class=\"zemanta-pixie\" style=\"margin-top: 10px; height: 15px;\"><a class=\"zemanta-pixie-a\" title=\"Enhanced by Zemanta\" href=\"http:\/\/www.zemanta.com\/\"><img decoding=\"async\" class=\"zemanta-pixie-img\" style=\"border: none; float: right;\" src=\"http:\/\/img.zemanta.com\/zemified_e.png?x-id=44f57f5e-082a-49e1-bc4c-44c0d119b161\" alt=\"Enhanced by Zemanta\" \/><\/a><\/div>\n","protected":false},"excerpt":{"rendered":"<p>We had a problem recently with one of our nodes that required replacing the motherboard. \u00a0After replacing the motherboard, I was unable to install the node on our cluster again. \u00a0I would run &#8220;insert-ethers&#8221; on the head node and then turn on the compute node. \u00a0Insert-ethers would detect a new device, however the node would [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[3,64,88],"tags":[116,113,118,114,117,50,84,115,119,121,120,23],"class_list":["post-127","post","type-post","status-publish","format-standard","hentry","category-command-line","category-linux-2","category-system-admin","tag-bios","tag-booting","tag-error-message","tag-gnu-grub","tag-hard-disk-drive","tag-linux","tag-operating-systems","tag-preboot-execution-environment","tag-pxe","tag-rocks","tag-rocks-linux","tag-source-code","no-thumb"],"_links":{"self":[{"href":"http:\/\/mattdturner.com\/wordpress\/wp-json\/wp\/v2\/posts\/127"}],"collection":[{"href":"http:\/\/mattdturner.com\/wordpress\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/mattdturner.com\/wordpress\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/mattdturner.com\/wordpress\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/mattdturner.com\/wordpress\/wp-json\/wp\/v2\/comments?post=127"}],"version-history":[{"count":11,"href":"http:\/\/mattdturner.com\/wordpress\/wp-json\/wp\/v2\/posts\/127\/revisions"}],"predecessor-version":[{"id":197,"href":"http:\/\/mattdturner.com\/wordpress\/wp-json\/wp\/v2\/posts\/127\/revisions\/197"}],"wp:attachment":[{"href":"http:\/\/mattdturner.com\/wordpress\/wp-json\/wp\/v2\/media?parent=127"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/mattdturner.com\/wordpress\/wp-json\/wp\/v2\/categories?post=127"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/mattdturner.com\/wordpress\/wp-json\/wp\/v2\/tags?post=127"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}