Staff Engineer, SRE (macys.com)

San Francisco - California

Date Posted: Jan. 23, 2018

Requisition ID: MAC08169

Apply
Job Overview:
 
A Staff Software Engineer, SRE is an integral member of the core team that provides application support of our website and mobile services platform. This is a fantastic opportunity to work and collaborate closely with our software engineering, architecture and operations teams at Macy’s Technology. The Staff Software Engineer, Site Reliability is responsible for ensuring Macy’s services are highly available, reliable, secure and scalable.  The ideal candidate is fluent in systems programming and/or automation and can leverage their experience to solve complex problems associated with running production environments at massive scale in multi-tenant environments. This position involves 70-80% of the time on production support activities and remaining on development activities. Person should have a strong mindset of application support with development background. The activities include application design reviews, code reviews, writing code for enhancements and championing the operational readiness of mission critical applications before production launch. Perform other duties as assigned.
 
Essential Functions:
 
• Provide production support to desktop and mobile application services platform.
• Ensure services are designed with 24/7 availability and operational readiness.
• Implementation of proactive monitoring, alerting, trend analysis and self-healing systems.
• Work with product development / engineering as needed to ensure high availbilty of services.    
• Works closely with core platform and stability & performance teams to influence framework or architectural changes based on learnings from production.
• Works closely with infrastructure and operations team for operational readiness.
• Responsible for review of topology, capacity, caching, stability and reliability of services.
• Conduct Root Cause Analysis and drive to prevent recurrence through to closure including, resolution of product/service defects or design changes, infrastructure changes, or operational changes.
• Be the team’s representative on production calls for the application development team to track, manage, trouble-shoot and fix production issues both short term and long term.
• Monitor production issue queue on a rotation basis and work with business team to prioritize, analyze and manage them to closure.
• Review system support documents and update production application service run books where needed.
• Deep troubleshooting and scripting skills to improve the availability, performance, and security of services.
• Consistently demonstrates regular, dependable attendance & punctuality. 
 
Qualifications:
 
Education/Experience:
 
• Bachelor's degree in Computer Science or a related field and 10+ years of experience or the equivalent combination of education and experience.
• 6+ years of experience in full life cycle development of J2EE projects.
• 5+ years of experience with Monitoring tools like DynaTrace, Splunk, KeyNote is a strong plus.
• 5+ years of experience on application Profiling Skills (Java Core, Thread Dumps etc).
• 5+ years of Systems Engineering in 24x7 Production Services environments.
• 2+ years of with at least one current generation scripting language used by DevOps professionals (Python, Perl, PHP) + Java Development.
• Excellent troubleshooter, utilizing a systematic problem-solving approach spanning code, systems, and network theory & protocols (TCP/IP, UDP, ICMP) ability to read a packet capture/tcpdump, etc.
• Demonstrated experience in designing, analyzing, and diagnosing large-scale distributed systems + Linux systems internals (system libraries, file systems, client-server protocols).
• Experience in massive scale web operations.          
• Experience in Offshore and Onsite co-ordination to maximize team impact.
 
Communication Skills:
 
• Ability to effectively communicate technical issues and recommend solutions to all levels of business, sometimes during high stress situations.
 
Reasoning Ability:
 
• Strong analytical and problem-solving skills a must.
 
Other Skills:
 
• Ability and desire to strive in a proactive, high pressure, customer facing environment.
• Ability to work a flexible schedule based on department and company needs.
 
Work Hours:
 
• Ability to work a flexible schedule based on department and company needs.
 
Company Profile:
 
As the fastest growing part of Macy's Inc. business, macys.com is achieving record sales and broadening our workforce. With offices in New York and San Francisco, macys.com is the best of all worlds. The entrepreneurial thinking of a Web business complements the stability and support of a national brand. Creativity and ingenuity partner with business acumen and tech savvy to build a unique business poised for continued growth. Employees at macys.com have long term opportunities and are encouraged to utilize their Supervisors and Human Resources for cross-functional movement to further their careers. At macys.com we are committed to giving back to the community by partnering with local charitable organizations.  By skillfully combining the power of the Internet with the best in retailing, macys.com is reaching new heights.
 
This job overview is not all inclusive.  In addition, Macy’s, Inc. reserves the right to amend this job overview at any time.  Macy’s is an Equal Opportunity Employer, committed to a diverse and inclusive work environment.  Macy’s, Inc. – including Macy’s and Bloomingdale’s – will consider for employment qualified applicants with criminal convictions in a manner consistent with SFPC Art. 49 and LA MC ch.XVIII Art. 9.