職位描述
We are seeking a highly skilled and motivated Senior Data Center Technician to join our team. In this role, you will be responsible for overseeing the operations, maintenance, and troubleshooting of our enterprise network, power and cooling solutions in our company labs and offices. You will ensure the smooth and efficient operation of hardware, networking, and server systems, while supporting customers and stakeholders to maintain high levels of uptime and performance.
Key Responsibilities:
● Oversee day-to-day operations within the offices, NERs (Network Equipment Room), and labs, ensuring all hardware and systems are functioning properly.
● Perform troubleshooting and diagnostics for hardware and network-related issues.
● Manage the installation, configuration, and maintenance of servers, storage devices, and network equipment.
● Conduct preventive maintenance and regular inspections of equipment to minimize downtime.
● Regularly audit IT environments to identify changes that have occurred, cleaning up rooms and cabling while there
● Assist with the planning, implementation, management of system monitoring tools and respond to systerts, escalations, and incidents in a timely manner.
● Collaborate with other cross-functional teams to support various business initiatives and projects.
● Develop and maintain documentation related to hardware configurations, troubleshooting procedures, tribal knowledge, and data center processes.
● Facilitate regional meetings independently and lead team to solve problems and incidents.
Additional Responsibilities:
● Define locations for equipment within network rooms (security/BAS panels, rack positions for network gear, service provided wall fields)
● Produce construction documents and specifications for low voltage cabling contractors
● Track capacity of available ports, fiber plant, and PDU outlets within network rooms
● Configure and connect smart PDU/UPS, working with monitoring team to provide alerting at correct thresholds
● Regularly walk construction projects and provide site reports with Nvidia feedback on work
Qualifications:
● Bachelor’s degree in Information Technology, Computer Science, or a related field (or equivalent experience).
● 5+ years of experience in IT operations, server administration, or similar technical support roles.
● Strong understanding of data center infrastructure, including servers, storage systems, networking equipment, and power/cooling systems.
● Strong experience with server hardware troubleshooting and maintenance (e.g., Dell, HPE, Lenovo, SuperMicro).
● Knowledge of network protocols and concepts, including TCP/IP, DNS, DHCP, VPNs, and firewalls.
● Familiarity with monitoring, automation, and inventory tools (e.g., MAAS, Nautobot) and ticketing systems (e.g., ServiceNow, Jira).
● Ability to be proactive and work with minimal direction. Strong communication skills and the ability to work well within a team environment.
● Ability to work in a fast-paced, high-pressure environment and manage multiple priorities. Availability to work across different time zones to support global regions
● Ability to go onsite a few days a week. Travel when needed (once a few months). Physically lift with the assistance of equipment, install, and inspect components in a rack
● Experience in managing vendors, provide working instructions and SOW (Statement of work)
● Speak English Fluently. Ability to use English in meetings and writings
地點:中國上海
我們正在尋找一名高技能和積極進(jìn)取的高級數(shù)據(jù)中心技術(shù)人員加入我們的團(tuán)隊。在這個職位上,您將負(fù)責(zé)監(jiān)督我們的企業(yè)網(wǎng)絡(luò)、電源和冷卻解決方案在我們的公司實驗室和辦公室的操作、維護(hù)和故障排除。您將確保硬件、網(wǎng)絡(luò)和服務(wù)器系統(tǒng)的平穩(wěn)高效運行,同時支持客戶和利益相關(guān)者保持高水平的正常運行時間和性能。
IT Infrastructure Senior Technician
主要職責(zé):
? 監(jiān)督辦公室、網(wǎng)絡(luò)設(shè)備室和實驗室的日常運作,確保所有硬件和系統(tǒng)正常運行。
? 對硬件和網(wǎng)絡(luò)相關(guān)問題進(jìn)行故障排除和診斷。
? 管理服務(wù)器、存儲設(shè)備和網(wǎng)絡(luò)設(shè)備的安裝、配置和維護(hù)。
? 對設(shè)備進(jìn)行預(yù)防性維護(hù)和定期檢查,盡量減少停機(jī)時間。
? 定期審核IT環(huán)境,以確定已經(jīng)發(fā)生的變化,清理房間和布線
? 協(xié)助系統(tǒng)監(jiān)控工具的規(guī)劃、實施和管理,及時響應(yīng)系統(tǒng)警報、升級和事件。
? 與其他跨職能團(tuán)隊合作,支持各種業(yè)務(wù)計劃和項目。
? 開發(fā)和維護(hù)與硬件配置、故障排除程序、部落知識和數(shù)據(jù)中心流程相關(guān)的文檔。
? 獨立主持區(qū)域會議,帶領(lǐng)團(tuán)隊解決問題和突發(fā)事件。
額外的職責(zé):
? 定義網(wǎng)絡(luò)機(jī)房內(nèi)設(shè)備的位置(安全/BAS面板,網(wǎng)絡(luò)設(shè)備的機(jī)架位置,提供服務(wù)的墻壁區(qū)域
? 為低壓電纜承包商制作施工文件和規(guī)范
? 跟蹤網(wǎng)絡(luò)機(jī)房內(nèi)可用端口、光纖設(shè)備、PDU插座的容量
? 配置并連接智能PDU/UPS,配合監(jiān)控團(tuán)隊提供正確閾值報警
? 定期巡視施工項目,提供包含Nvidia工作反饋的現(xiàn)場報告
資格:
? 信息技術(shù),計算機(jī)科學(xué)或相關(guān)領(lǐng)域的學(xué)士學(xué)位(或同等經(jīng)驗)。
? 5年以上IT運營、服務(wù)器管理或類似技術(shù)支持工作經(jīng)驗。
? 熟悉數(shù)據(jù)中心基礎(chǔ)設(shè)施,包括服務(wù)器、存儲系統(tǒng)、網(wǎng)絡(luò)設(shè)備和電源/冷卻系統(tǒng)。
? 有豐富的服務(wù)器硬件故障排除和維護(hù)經(jīng)驗(如戴爾,惠普,聯(lián)想,超微)。
? 了解網(wǎng)絡(luò)協(xié)議和概念,包括TCP/IP, DNS, DHCP, vpn和防火墻。
? 熟悉監(jiān)控、自動化和庫存工具(如MAAS、Nautobot)和票務(wù)系統(tǒng)(如ServiceNow、Jira)。
? 工作積極主動,在有限的指導(dǎo)下工作。良好的溝通技巧和團(tuán)隊合作能力。
? 能夠在快節(jié)奏、高壓力的環(huán)境中工作,并能處理多個優(yōu)先事項??绮煌瑫r區(qū)工作以支持全球區(qū)域的可用性
? 夠每周到現(xiàn)場工作幾天。必要時旅行(幾個月一次)。在設(shè)備的幫助下進(jìn)行物理提升,安裝和檢查機(jī)架上的組件
? 有供應(yīng)商管理經(jīng)驗,提供工作指導(dǎo)書和SOW(工作說明書)
? 流利地說英語。能夠在會議和寫作中使用英語