We are seeking a skilled Microsoft SharePoint / M365 & Power Platform Admin/ Operations & Support Engineer to monitor, maintain, and support two closely related Microsoft platform domains within a managed services engagement. This is a unified operations role — not a development role — centred on platform observability, incident response, service continuity, and proactive health management across SharePoint Online, Microsoft 365 collaboration services, and Power Platform workloads including Power Apps, Power Automate, and Dataverse. The candidate will work closely with the AMS Service Manager, platform developers, and solution architects to ensure all services remain reliable, performant, secure, and aligned with SLA commitments.
Key Responsibilities
Core monitoring & observability responsibilities
The following six responsibilities apply across both platform domains — SharePoint / M365 and Power Platform — and must be performed consistently in all environments:
- Ticket monitoring in ITSM tools — Track, triage, and manage all incoming incidents and service requests across both SharePoint / M365 and Power Platform through ITSM platforms (e.g., ServiceNow, Jira Service Management), ensuring timely assignment, prioritization, and resolution in line with SLA commitments.
- Application availability monitoring — Continuously monitor the availability and uptime of SharePoint Online, Microsoft 365 services (Teams, OneDrive, Exchange Online), Power Apps applications (Canvas and Model-Driven), and Power Automate flows across production and non-production environments; promptly identify, escalate, and act on outages or service degradations to minimize business impact.
- Platform health and usage monitoring — Maintain ongoing visibility into the health and usage patterns of both Microsoft 365 (via M365 Admin Center and service health dashboards) and Power Platform (via Power Platform Admin Center); track site collection utilization, SharePoint storage quotas, Power Platform environment capacity, API usage, connector health, and licensing consumption across all environments.
- Performance and capacity tracking — Monitor performance KPIs and capacity thresholds across SharePoint Online storage, M365 service response times, Power Apps load performance, Power Automate flow run durations, and Dataverse capacity; proactively report trends and recommend scaling, optimization, or governance actions before issues impact end users.
- Log review and alert management — Regularly review M365 audit logs, SharePoint diagnostic logs, Power Automate run history, and Power Platform telemetry; manage and tune alerting configurations within M365 Admin Center and Power Platform Admin Center to maintain signal quality; investigate triggered alerts and differentiate operational noise from genuine risks requiring escalation or remediation.
- Proactive anomaly detection — Apply monitoring tools, operational insights, and historical usage patterns to detect early signs of failure — including unusual SharePoint storage growth, abnormal Power Automate failure spikes, configuration drift, connector deprecation risks, or DLP policy violations — and initiate preventive action before incidents escalate to affect business operations.
SharePoint / M365 operations
- Monitor application availability, performance, and health of SharePoint Online and Microsoft 365 services using native M365 admin tools, service health dashboards, and monitoring platforms.
- Administer site collections including creation, configuration, storage management, and lifecycle governance.
- Manage permissions and access controls including SharePoint groups, sharing policies, external access, and sensitivity labels.
- Resolve incidents and service requests related to SharePoint, OneDrive, Teams, and other M365 services in line with SLA commitments.
- Perform adaptive maintenance activities to accommodate Microsoft 365 platform updates, feature rollouts, and deprecations.
- Support and administer Microsoft 365 compliance features including retention policies, eDiscovery, and DLP configurations.
- Manage SharePoint search configuration, crawl schedules, and content source administration.
- Assist with user onboarding and offboarding activities related to M365 services and SharePoint access.
- Conduct regular audits of site permissions, sharing configurations, and guest access to ensure compliance with governance policies.
- Collaborate with the SharePoint development team on deployments, hotfixes, and platform configuration changes.
Power Platform operations
- Monitor Power Apps applications (Canvas and Model-Driven) for availability, performance issues, and errors across production and non-production environments.
- Troubleshoot and resolve Power Automate flow failures, errors, and performance degradations in a timely manner, including analysis of run history and trigger/action failures.
- Manage the connector lifecycle including reviewing connector usage, handling connector deprecations, and updating connections to maintain service continuity.
- Investigate and resolve functional incidents related to Power Platform solutions, coordinating with business users and development teams as required.
- Administer Power Platform environments including environment management, DLP policy enforcement, and capacity monitoring.
- Manage access, security roles, and permissions within Dataverse and Power Platform environments.
- Apply adaptive maintenance to address Microsoft platform updates, feature changes, and API version upgrades affecting Power Apps and Power Automate solutions.
- Monitor and manage Power Platform capacity, API usage, and licensing consumption across all environments.
- Collaborate with Power Platform developers to support solution deployments, change validations, and post- release health checks.
Cross-platform operational responsibilities
- Execute minor configuration changes, parameter updates, and service requests across both SharePoint / M365 and Power Platform within defined change management processes, with appropriate documentation and approvals.
- Conduct root cause analysis (RCA) for recurring or critical incidents across both platforms; contribute findings to the problem management log and continuous improvement backlog.
- Maintain and update operational runbooks, troubleshooting playbooks, and known error databases covering SharePoint / M365 and Power Platform services.
- Produce regular operational reports covering service health, incident volumes, SLA adherence, and platform usage metrics for service review meetings.
- Contribute to the continuous improvement backlog by identifying recurring issues and recommending proactive solutions across both platform domains.
- Collaborate with the AMS Service Manager to ensure seamless service delivery, SLA governance, and consistent service quality across both platform areas.
Required Qualifications & Skills
- 7-10 years of experience in Microsoft 365 / SharePoint administration and/or Power Platform operations or support roles.
- Strong working knowledge of SharePoint Online, Teams, OneDrive, and M365 Admin Center for administration, governance, and incident support.
- Solid hands-on knowledge of Power Apps (Canvas and Model-Driven), Power Automate, and Dataverse for operational monitoring and incident resolution.
- Experience with Power Platform Admin Center for environment administration, DLP policy management, and capacity monitoring.
- Proficiency in troubleshooting Power Automate flows — including run history analysis, diagnosing trigger/action failures, and optimizing flow performance.
- Understanding of connector types (standard, premium, custom) and connector lifecycle management within Power Platform.
- Experience with site collection administration, SharePoint permissions management, and M365 governance frameworks.
- Familiarity with Microsoft 365 compliance tools: Purview, retention policies, DLP, and eDiscovery.
- Proficiency in PowerShell, PnP PowerShell, and Power Platform CLI (PAC CLI) for administration and operational tasks.
- Understanding of Azure Active Directory (Entra ID), conditional access, RBAC, and identity governance across M365 and Power Platform.
- Familiarity with Power Platform ALM and solution deployment processes.
- Experience with ITSM tools (ServiceNow, Jira Service Management) for incident, problem, and change management.
- Strong analytical, communication, and documentation skills; ability to work effectively under pressure during live incidents.
- Microsoft certifications such as MS-102, MS-700, PL-200, PL-400, PL-900 (Power Platform track) are highly desirable.
Work Mode
Experience Required