Critical thinking SRE with extensive understanding of high availability architecture and concepts. Purpose-driven professional with capacity to be strong team player plus work effectively independently.
Design and implement observability/monitoring solutions,
define and establish SLOs based on this and also align them with the business needs.
At fiskars i was tasked with deploying Azure Cloud Infrastucture using AzDO pipelines, design implement and showcase PoC for products like Dynatrace, Contrast, Mend, Datadog etc.
In charge of both maintaining and supporting the AKS clusters, in which our Sitecore(.NET framework based app) app resides, and the complementary services which it uses(Subnets, Application Gateways, Storage Accounts, Traffic Managers, VMSS, SQL Server DBs etc).
Re-designing the existing build and deploy pipelines to minimize the execution time.
Resolve topics raised by the Azure Security Advisors to further increase our platforms security.
This was a part time project in which i was tasked with migrating the application from the legacy VMs (linux and tomcat) to an orchestrated cloud environment (GCP).
Onboarded the project to the Google Cloud platform using git hub actions and terraform enterprise.
Design new CI/CD flows using github actions for our new Kubernetes clusters hosted in GCP.
Maintained metrics visibility using Datadog and Prometheus/Grafana to create useful dashboards and monitors.
As a DevOps engineer in ING Tech i have to leverage AzureDevOps to create pipelines for scaling pods for our OpenShift Kubernetes objects
Implement OpenShift cluster monitoring using kube-state-metrics and Prometheus, displaying the results in Grafana.
Improve system reliability by measuring downtime of the middleware that we are using, ELKaas/DBaas/Kafka and other in house developed software.
Propose solutions for recurring incidents for our middleware, aswell as optimizing delivery pipelines inside AzDO.
Implement nightly builds so we can have a better idea of where the build downtime lies.
Manage and maintain all OpenShift clusters for different environments ACC/TST/DEV/PRD
Ensure that our platforms, both in-house and on cloud, deliver the expected quality of service and meet our customers SLAs.
Deploy and manage our in-house developed stack (based on salt-stack) both on-premises and on cloud(AWS).
Create CI/CD pipelines using Jenkins to automatically provision different AWS Objects(EKS, EC2, EFS, VPC, ELB etc) based on terraform files hosted on various GIT repositories.
Using infrastructure management tools like Salt, Ansible, Puppet to further develop our growing platform.
Making use of GIT Scm and Apache SVN to tweak, configure and update our applications and also infrastructure open-source apps used by the platforms like Nginx, Haproxy, Keepalive-d, Ceph-csi, etc. As Git scm is the core component in dev/devops teams i have experience in deploying and working with tools such as BitBucket, Gitlab, Gitea, Gogs, Git-hub etc.
Working in a containerized environment orchestrated by Kubernetes or DockerSwarm i am a proficient user of Kubernetes and Docker. I know all the pros and cons, the little things that make docker containers great or sometimes terrible.
Managing run/operations production issues using scripting languages such as Python/Bash.
Proficient user of main linux distros used in Enterprise like Ubuntu,CentOS, Solaris, RedHat etc.
Deploying and using databases like: MariaDB/MySQL, CassandraDB, Postgres, MongoDB.
Support projects for customers like Discover, SamsungPay, ApplePay.
Work on kuberntes orchestrated platforms to ensure operability of customer service.
Work together with customers like Santander, Sabadell, Redsys and ServiceProviders like SamsungPay to further improve the service offered to end-users.
Using scripting languages such as python or bash to query DBs(CasandraDB/MariaDB) in order to create custom reports/statistics for customers.
Identifying bugs reported by customers and working together with Dev Team in order to patch said bugs.
Automation of certain toilsome tasks using a scripting language(customer reports/failed notification sending/ etc) to reduce toil for my team as well for other teams.
Implementing Prometheus+Grafana custom monitoring for certain application modules.
First point of contact with incidents happening on the platform.
Ensure 24/7 functionality of the platform by monitoring it and quickly responding to incidents.
Managing incidents from start to finish by:
Performing actual upgrade operations for required products to fix/patch existing bugs and/or to add new features to the product.
Creation of customer reports based on customer requirements
DevOps proficiency
undefinedCCNA 1
CCNA 4
CCNA 3
CCNA 2
CCNA 1