Nothing Special   »   [go: up one dir, main page]

CN104850473A - Starting up test method of server in low temperature environment - Google Patents

Starting up test method of server in low temperature environment Download PDF

Info

Publication number
CN104850473A
CN104850473A CN201510238030.0A CN201510238030A CN104850473A CN 104850473 A CN104850473 A CN 104850473A CN 201510238030 A CN201510238030 A CN 201510238030A CN 104850473 A CN104850473 A CN 104850473A
Authority
CN
China
Prior art keywords
test
server
equipment
bios
starting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510238030.0A
Other languages
Chinese (zh)
Other versions
CN104850473B (en
Inventor
吴仲品
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Tong Yi Yi Information Technology Co., Ltd.
Original Assignee
Shenzhen Guo Xinheng Space Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Guo Xinheng Space Technology Co Ltd filed Critical Shenzhen Guo Xinheng Space Technology Co Ltd
Priority to CN201510238030.0A priority Critical patent/CN104850473B/en
Publication of CN104850473A publication Critical patent/CN104850473A/en
Application granted granted Critical
Publication of CN104850473B publication Critical patent/CN104850473B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Test And Diagnosis Of Digital Computers (AREA)

Abstract

The invention relates to the field of servers, in particular to a starting up test method of the server in a low temperature environment, the test method comprises the following steps: A, amending BIOS; B, amending Linuxg Grub configuration; C, monitoring the state when the BIOS is started up by serially connecting a test host with a serial port; D, amending a system configuration file: amending the system to realize automatic shutdown after starting up and recording starting up time; E, setting a test case and AC Source, thus ensuring that the server is started under a completely cooling state; F, collecting test log and ending the test. Equipment can be normally shutdown after stating up by a script, and influence of abnormal shutdown factors on the equipment is excluded; by setting the AC Source and the script, the equipment and elements are started up after complete cooling, so the influence on a test result caused by the condition that the equipment still has waste temperature is prevented; the test result is recorded in an omnibearing manner and a failed condition is played back by match of respective test equipment (hardware, bios and operation system).

Description

A kind of server startup method of testing at low ambient temperatures
Technical field
The present invention relates to server field, particularly a kind of server startup method of testing at low ambient temperatures.
Background technology
Large data one word is mentioned more and more, and people describe with it the mass data produced with definition information explosion time generation, and name associated technical development and innovation.Data just also become large at undergoes rapid expansion, it decides the future development of enterprise, although enterprise may not recognize that data explosion increases the hidden danger bringing problem now, As time goes on, people will more and more recognize the importance of data to enterprise.The data controling power of large data age to the mankind proposes new challenge, also more deeply, comprehensively sees clearly ability for people's acquisition and provides unprecedented space and potentiality.In the application to large data, the fastest demand of intelligent development is the most outstanding, smart mobile phone, Smart Home, Intelligent worn device, and all smart machines all need large data to support! Wherein, the development of pocket smart machine is the swiftest and the most violent, smart mobile phone intelligence is dressed particularly outstanding, bring great convenience to our life, but this brings larger challenge also to our equipment-server of large data that provides, portable meaning has the place of people to have intellectuality, has intelligentized place will have the large data supporting him, has the place of large data to have server!
So the operating environment requirements of server is also more and more harsher, and traditional intelligence can not meet our demand at the server of machine room work.Also extremely harsh to the environment for use of server, therefore, we must improve the testing standard to server, develop better reliability, more can adapt to the server of various rugged surroundings meanwhile, in military!
In the middle of the low-temperature test of existing server, be all generally carry out by low temperature environment the mechanism that acceleration equipment exposes problem.Its process is generally: equipment is placed in low temperature environment and connects AC Source; Conditioning equipment makes equipment be energized with regard to startup, and has established the power on/off program of AC Source as being energized four minutes, power-off one minute; Energising, start, power-off, shutdown, ceaselessly circulate, AC source sets cycle index; After having tested, whether equipment of checking restarts the due number of times of theoretical value.
Conventional startup is tested, we go testing server to have by power-off energising can start, but server does not have coldly just immediately to start up again as the components and parts such as CPU temperature in start process has risen after shutdown, and also abnormal shutdown when shutdown, so just cannot get rid of the factor of the infringement that abnormal shutdown causes to equipment, the failure conditions in test process is also difficult to obtain detailed log.
Summary of the invention
The invention provides a kind of server startup method of testing at low ambient temperatures, system automatic shutdown mode, solve the technical matters of the damage effect test result that abnormal shutdown causes to equipment in prior art.Simultaneously the method mode of adopting low temperature then to heat up, the temperature of Deterministic service device in test process is 0 DEG C, avoids and has an impact to server test because server work produces heat.
The present invention solves the problems of the technologies described above this server startup method of testing at low ambient temperatures designed to comprise the following steps:
A. revising BIOS, is energising Auto Power On by Servers installed;
B. change Linuxg Grub to arrange, after the start of Deterministic service device, enter system smoothly;
C. by the external Test Host of serial ports, state when monitoring BIOS starts;
D. revise CONFIG.SYS: amendment system, with automatic shutdown after realizing starting shooting, and record the on time;
E. set chamber and AC Source, Deterministic service device when complete cold start;
E1. 2 hours are put by cold at-5 DEG C for server;
E2. described chamber is warming up to 0 DEG C, described AC Source starts to power to server;
E3. described server automatic shutdown under Systematical control;
E4. after device powers down, chamber temperature is reduced to-5 DEG C from 0 DEG C, leaves standstill 5 minutes at-5 DEG C;
E5. repeat above-mentioned steps E2 to E4 to complete to server test;
F. collect test log, test terminates.
Scheme tool designed by the present invention has the following advantages, and under low temperature environment, can add the probability of the various implication problem of big exposure, helps research and development to solve potential threat; Can test equipment and all components and parts normally work at low ambient temperatures, and can all components and parts normally starting after definitely lowering the temperature, can not only test the stability of machine and bad low temperature properties, can also test the bad low temperature properties of components and parts; Linux system for the non-distribution version of independent development can test the stability of system, and system compatibility; The stability of test bios guidance system; The stability of power module can be tested, and the stability of equipment under different AC environment; Make equipment after powering can normal shutdown by script, get rid of abnormal shutdown factor to the impact of equipment; By the setting to AC Source and script, starting to prevent the impact when equipment also has a surplus temperature on test result after making equipment and components and parts each all cold; Coordinated by each testing apparatus and reach test result (hardware, bios, operating system) comprehensive record and the playback to failure conditions.; The brand name computer combined for there being fixed configurations, can test hardware compatibility and the stability of different configuration (CPU/ internal memory/hard disk, different capabilities different vendor combines); Environmental structure is simple, and test is full-automatic, and test result analysis is easy
Accompanying drawing explanation
Fig. 1 is the state change sketch of chamber and AC Source in the embodiment of the present invention.
Embodiment
Explanation of nouns:
AC Source: programmable type AC power supply.
Grub:GNU GRUB (GRand Unified Bootloader is called for short " GRUB ") is a multiple operating system start-up routine from GNU project.GRUB is the realizations starting specification more, and it allows user can have multiple operating system in computing machine simultaneously, and selects the operating system that hope runs when computer starting.GRUB can be used for selecting the different kernels on operating system partition, also can be used for transmitting start-up parameter to these kernels.
The following detailed description of specific embodiment of the invention scheme.
Present embodiment is tested, testing tool for Ubuntu 14.04LTS (GNU/Linux 3.13.0-24-generic x86_64): server one, AC Source, Test Host one, DV mono-, temperature test chamber one.
1. revise BIOS: be energising Auto Power On by Servers installed.
2. change Linuxg Grub to arrange: when surprisingly shutting down, be easy to cause system to rest on grub and guide interface, need carriage return just can enter system, at this moment CONFIG.SYS is revised, vim/boot/grub/grub.cfg finds one section below, timeout=-1 is made into timeout=2 (be greater than the Arbitrary Digit of zero, unit is second).
fi
terminal_output gfxterm
if["${recordfail}"=1];then
Set timeout=-1 (changing this journey-1 into 2)
else
if[x$feature_timeout_style=xy];then
set timeout_style=menu
3. by the external Test Host of serial ports, run OCTOPUS software, state when monitoring BIOS starts (also can connect the equipment of other monitoring bios herein, as the more sophisticated equipment such as Intel ITP), the display of connection server is recorded, according to the time reproduction situation at that time of log record when conveniently pinpointing the problems with DV.
4. revise CONFIG.SYS: amendment system, with automatic shutdown after realizing starting shooting, and record the on time,
Cp/etc/rc.local/etc/rcbackup/ backup rc.local source document/
The newly-built file being used for depositing log of touch/root/log//
Vim/etc/rc.loacl/ revises rc.local, code (two kinds of different schemes) below adding inside file:
(1) how many times (namely successfully entering the number of times of system) need successfully be started by script design server:
(2) set server energising by AC Source and start how many times (namely server energising not necessarily enters the number of times of system):
Date>>log/each on time is recorded in inside log file/
Sleep 10/wait for ten seconds/
Poweroff/shutdown/
After the number of times that AC Source to be done sets completes, start enters single user mode, performs cat/etc/rcbackup>/etc/rc.local
5. chamber and AC Source set.
In order to make server complete cold, we set server and are energized 5 minutes, and then power-off is cold puts 15 minutes, need chamber and AC Source synchronous matching to complete.First put 2 hours by cold at-5 DEG C for server, until equipment complete cold after temperature rise to 0 DEG C, server is energized, and waits for shutdown in ten seconds (equipment is 5 minutes from energising start to maximum time of automatic shutdown, exceedes and namely calculates fail) after energising.Can be cold in order to ensure server, after device powers down, temperature is reduced to-5 DEG C from 0 DEG C, after leaving standstill 5 minutes at-5 DEG C, recovers 0 DEG C, then the start that is energized, shutdown ceaselessly circulates, and after having tested, server need leave standstill two hours at normal temperatures.
This process as shown in Figure 1.
If equipment needs to sell in worldwide, can also set AC Source, will each test of all main flow voltage types 50 times in the world, the amplitude of that voltage of five minutes that is namely at every turn energized and frequency need include the type of following table, and every type tests 50 times as following table:
6. collect the test log that records of log:a. on time, / root/log, order: wc – l/root/log can check the number of times of whole start, concrete analysis can calculate twice start interval time by excel, if the time is greater than the situation that 25 minutes must have boot failure.When b.bios monitors log boot failure, can check corresponding bios leader record c.linux /var/log/syslog, can only find whether start is normal by start log and bios log, check that syslog can see whether shutdown has exception, the abnormal log started of bios normal boot system but system.D.DV can help to reappear concrete situation when starting shooting abnormal or shutdown exception.
The present invention patent discloses a kind of a kind of method of testing of testing server cold-starting, by the setting to AC Source and script, starting to prevent the impact when equipment also has a surplus temperature on test result after making equipment and components and parts each all cold.This test not only tests device start start, has still carried out corresponding test to shutdown, and eliminates abnormal shutdown to testing the impact caused.This test has not only had omnibearing test to hardware and components and parts, and the Linux system of non-distribution version also developed some enterprise self-determinings and the bios of independent development carry out stability test.This test also achieves the test to AC power supplies module and voltage compatibility.This test provides very detailed log, after pinpointing the problems, go reproduction problem to provide a great help.This test not only can be tested under Linux system, under windows system, also can carry out same test, just can lack the syslog under the exclusive var file of linux.This test not only can run under linux, also can run under windows, but can lack the exclusive syslog of a linux.For the server that CPU, Memor, HDD have fixed Combination to configure, above test can be done respectively with difference configuration, complete hardware compatibility test.The optimum configurations of this test can be changed according to user's concrete condition, as conduction time, coldly puts the time, also has chamber temperature etc.Omnibearing detection test has been carried out in this test from components and parts to system to server, with as much as possible expose various problem.
Above content is in conjunction with concrete preferred implementation further description made for the present invention, can not assert that specific embodiment of the invention is confined to these explanations.For general technical staff of the technical field of the invention, without departing from the inventive concept of the premise, some simple deduction or replace can also be made, all should be considered as belonging to protection scope of the present invention.

Claims (4)

1. server startup method of testing at low ambient temperatures, is characterized in that: this method of testing comprises the following steps: A. revises BIOS, is energising Auto Power On by Servers installed; B. change Linuxg Grub to arrange, after the start of Deterministic service device, enter system smoothly; C. by the external Test Host of serial ports, state when monitoring BIOS starts; D. revise CONFIG.SYS: amendment system, with automatic shutdown after realizing starting shooting, and record the on time; E. set chamber and AC Source, Deterministic service device when complete cold start; F. collect test log, test terminates.
2. startup method of testing according to claim 1, is characterized in that: described step e Deterministic service device when complete cold start also comprise step by step following: E1. puts 2 hours by cold at-5 DEG C for server; E2. described chamber is warming up to 0 DEG C, described AC Source starts to power to server.
3. startup method of testing according to claim 1, is characterized in that: described step e Deterministic service device when complete cold start also comprise step by step following: server described in E3. is automatic shutdown under Systematical control; E4. after device powers down, chamber temperature is reduced to-5 DEG C from 0 DEG C, leaves standstill 5 minutes at-5 DEG C; E5. repeat above-mentioned steps E2 to E4 to complete to server test.
4. startup method of testing according to claim 1, is characterized in that: described step e Deterministic service device when complete cold start also comprise step by step following: after E6. has tested, server at normal temperatures leave standstill two hours.
CN201510238030.0A 2015-05-11 2015-05-11 A kind of startup test method of server at low ambient temperatures Active CN104850473B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510238030.0A CN104850473B (en) 2015-05-11 2015-05-11 A kind of startup test method of server at low ambient temperatures

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510238030.0A CN104850473B (en) 2015-05-11 2015-05-11 A kind of startup test method of server at low ambient temperatures

Publications (2)

Publication Number Publication Date
CN104850473A true CN104850473A (en) 2015-08-19
CN104850473B CN104850473B (en) 2018-10-12

Family

ID=53850128

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510238030.0A Active CN104850473B (en) 2015-05-11 2015-05-11 A kind of startup test method of server at low ambient temperatures

Country Status (1)

Country Link
CN (1) CN104850473B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090204839A1 (en) * 2008-02-07 2009-08-13 Sun Microsystems, Inc. Method and apparatus for controlling the temperature of a power supply in a computer system
US20100070745A1 (en) * 2008-09-18 2010-03-18 Mitac Technology Corp. Computer system and method for booting under low temperature state
CN102262430A (en) * 2010-05-28 2011-11-30 微软公司 Automatically starting servers at low temperatures
CN102262607A (en) * 2011-05-04 2011-11-30 山东超越数控电子有限公司 Method for acquiring temperature information in personal computer (PC) by using I<2>C bus
CN102520370A (en) * 2011-12-31 2012-06-27 曙光信息产业股份有限公司 Server switching power supply testing method and server switching power supply testing device
CN104375910A (en) * 2014-11-24 2015-02-25 浪潮电子信息产业股份有限公司 Automatic power-on and power-off test method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090204839A1 (en) * 2008-02-07 2009-08-13 Sun Microsystems, Inc. Method and apparatus for controlling the temperature of a power supply in a computer system
US20100070745A1 (en) * 2008-09-18 2010-03-18 Mitac Technology Corp. Computer system and method for booting under low temperature state
CN102262430A (en) * 2010-05-28 2011-11-30 微软公司 Automatically starting servers at low temperatures
CN102262607A (en) * 2011-05-04 2011-11-30 山东超越数控电子有限公司 Method for acquiring temperature information in personal computer (PC) by using I<2>C bus
CN102520370A (en) * 2011-12-31 2012-06-27 曙光信息产业股份有限公司 Server switching power supply testing method and server switching power supply testing device
CN104375910A (en) * 2014-11-24 2015-02-25 浪潮电子信息产业股份有限公司 Automatic power-on and power-off test method

Also Published As

Publication number Publication date
CN104850473B (en) 2018-10-12

Similar Documents

Publication Publication Date Title
WO2022198972A1 (en) Method, system and apparatus for fault positioning in starting process of server
US8732285B2 (en) Upgradeable persistent virtual storage
TWI291652B (en) Debugging device using a LPC interface capable of recovering functions of BIOS, and debugging method therefor
US20210286692A1 (en) Method and system for optimal boot path for a network device
US9747182B2 (en) System and method for in-service diagnostics based on health signatures
CN102546224A (en) Remote management system and method for server
US20050102568A1 (en) System, method and software for isolating dual-channel memory during diagnostics
CN102541704B (en) Testing method for RAID (redundant array of independent disk) cards
CN109240865A (en) A kind of AC test method, device, terminal and the storage medium of AEP memory
CN103364650A (en) Testing system and testing method
CN102135893A (en) Method for integrating operating system on BIOS (Basic Input Output System) chip and starting operating system on server
CN111104275A (en) Automatic testing method and device for flash SSD hard disk power supply
CN101441589A (en) Detection method of substrate management controller and server thereof
CN115129520A (en) Computer system, computer server and starting method thereof
CN104050065A (en) Method aiming at failure location in server startup and shutdown testing
CN114860528A (en) NVMe SSD Nand reliability test method and system
CN110187922A (en) It is arranged and verifies the method, apparatus, equipment and storage medium of BIOS parameter
CN111475353B (en) Method and system for detecting NVDIMM memory of production line
CN104657232A (en) BIOS automatic recovery system and BIOS automatic recovery method
CN104597983A (en) Regulation method of revolving speed of computer and mainboard system
CN104850473A (en) Starting up test method of server in low temperature environment
CN107357700A (en) A kind of method and system of test NVME hard disk order stability
US20070115709A1 (en) Host computer memory configuration data remote access method and system
CN111488250A (en) High-density multi-node server testing method, system, terminal and storage medium
CN105893196A (en) Server debugging auxiliary tool and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20170512

Address after: 518000 Guangdong city of Shenzhen province Nanshan District Guangdong streets Park seven Shenzhen Digital Technology Park B1 building 3 floor C District

Applicant after: Shenzhen Tong Yi Yi Information Technology Co., Ltd.

Address before: 518000 Guangdong province Shenzhen city Nanshan District high tech park, seven Shenzhen City Digital Technology Park B1 building 3 floor C District

Applicant before: Shenzhen Guo Xinheng space Technology Co., Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant