Roles & Responsibilities
1. Responsible for the operation and maintenance of the proprietary cloud platform and related cloud products, solve difficult technical problems related to the platform and products, repair platform technical risks, ensure the stable operation of the cloud platform and cloud products in various customer scenarios, and guarantee cloud service SLA
2. In-depth understanding of the architecture design and technical principles of the cloud platform and cloud products, promote the optimization and upgrade of the platform architecture and cloud products to a more advanced direction
3. Improve the overall work efficiency of the operation and maintenance team by developing automated, intelligent, and data-based tools or system platforms, avoid inefficient and repetitive work, and reduce operation and maintenance costs
4. Be responsible for the planning of cloud platform construction, upgrades, expansions, high availability, etc. and complete them on time and with quality assurance, be able to put forward constructive suggestions on cost reduction and efficiency improvement, and be able to implement ideas.
Qualifications
1.Bachelor degree or above, major in computer science or related majors
2.Proficient in Linux system, familiar with TCP/IP protocol, familiar with database principles, and have in-depth research on the underlying OS/network/DB
3.Proficient in at least one general programming language (language is not limited, experience in large-scale system design is a plus), and able to write operation and maintenance related tools or systems to improve the efficiency of operation and maintenance work
4.Have in-depth technical understanding in at least one field in the fields of proprietary cloud platform system operation and maintenance, containers, computing, networks, application operation and maintenance, middleware, big data, databases, security, etc., have skills such as system debugging and performance debugging, and have strong analysis and troubleshooting capabilities for difficult problems
5.Be familiar with the cloud technology system and new technologies popular in the industry, have your own insights on solving complex problems in platform operation and maintenance, and be able to continuously improve operation and maintenance related systems to maximize the automation capabilities of operation and maintenance
6.Good document output ability, timely precipitate technical documents and operation and maintenance solutions, and be good at summarizing and summarizing, and be able to share and publicize them internally and externally
7. Be careful in work, be able to perform operation and maintenance operations strictly in accordance with the operation and maintenance process system, have a strong sense of ownership, customer service and teamwork, be good at active thinking and self-driving, have a keen sense of risk and good risk identification ability, and be able to plan operation and maintenance work with customer business as the center
8. Have an in-depth understanding of distributed distributed systems, be familiar with commonly used open source basic components of the Internet (nginx, redis, kafka, mysql, hbase, zookeeper, hadoop, etc.), and those with experience in ultra-large-scale cluster management are preferred
9. Have a strong sense of responsibility, good communication and coordination skills, and have experience in large projects. Management experience is preferred
10. Able to accept certain short-term travel requirements.