09:10 – 10:10 Keynote 1: Phil Tee (Moogsoft)
In commercial applications of network management, a recurring challenge is the need to correlate system and network events to determine root cause. The techniques applied have ranged from rules based expert systems to more modern approaches involving the full battery of information and data science. I personally have been involved with this journey for nearly 30 years, and a curious feature of the space is the repeated motif of graph theory as a common thread in all approaches. In this talk, I will survey some of the manifestations of G(V,E) across very different algorithms, and identify that the convenient equivalence between graphs and matrices as potentially being a driving factor in the “unreasonable effectiveness of graphs” for fault localization.
Phil Tee is the CEO and co-founder of Moogsoft, the pioneer and leading provider of artificial intelligence for IT operations (AIOps). Phil is a serial entrepreneur, inventor, and expert in IT service assurance — a category he was instrumental in shaping through his invention of Netcool (acquired by IBM) nearly 25 years ago.
His curiosity in the IT space began after graduating from the University of Sussex in 1990, but that academic experience was what led to the development of Moogsoft’s initial technology. Two years after graduation, he co-founded Omnibus Transport Technologies Limited (OTT) to build and market Netcool/Omnibus. Since then, Phil has led numerous companies to successful exits, including RiverSoft (IPO) and Njini (acquired by Riverbed).
Phil’s career came full circle when he went back to his academic roots and co-founded Moogsoft in 2011. Now, with over 120 customers worldwide, including SAP SuccessFactors, American Airlines, Intuit, GoDaddy, Yahoo!, and HCL Technologies, Phil’s legacy has helped IT teams work faster and smarter. With patented AI analyzing billions of events daily across the world’s most complex IT environments, the Moogsoft AIOps platform helps the world’s top enterprises avoid outages, automate service assurance, and accelerate digital transformation initiatives. Customers are what drives Phil to improve and learn more each day, and his pursuit of rigorous scientific discovery and innovation of technology is at the heart of Moogsoft.
Challenges and Opportunities: Resource Management for Fog Computing/Cloud Infrastructure
Cloud computing has worked very well for many applications. However, there are limitations that make it difficult to fully support certain types of applications. For example, an Internet of Things (IoT) application may require large amounts of data generated by sensors to be analyzed quickly in order to trigger actions in a timely fashion (e.g., traffic control, emergency response). Transferring data over the Internet to a cloud data center may incur significant delays that impact timely decision making. Furthermore, the transfer of large amounts of data over the network potentially could impact the costs associated with the network infrastructure needed to support the data transfer. To tackle these issues a new computing and networking paradigm, Fog Computing, has emerged. The main goal of fog computing is to distribute computing, storage, control, and networking services by bringing services to the network edge, where data is generated and where decisions will often need to be made in order to address stringent real-time requirements of applications. The result is a complex ecosystem of technologies.
The task of managing system health to ensure that applications are reliable and have the resources to satisfy their runtime requirements becomes more challenging due to the heterogeneity and fractured ownership of the technology ecosystem and the dynamicity of applications. Resource management challenges and opportunities will be presented in this talk.
Hanan Lutfiyya is a Professor and Chair of the Department of Computer Science at the University of Western Ontario. Her research interests include Internet of Things, software engineering, self-adaptive systems, autonomic computing, monitoring and diagnostics, mobile systems, policies, and clouds. Her research group in collaboration with industrial and government partners investigates different aspects of reliable software and systems. She is currently collaborating with Tillsonburg Hydro on smart grids. She has received funding from Ontario Research Fund (ORF), NSERC, IBM, Samsung, Fujitsu and Canada’s Communications Research Centre (CRC). Professor Lutfiyya received the UWO Faculty Scholar Award in 2006. Professor Lutfiyya is currently an associate editor for the IEEE Transactions on Network and Service Management, and has recently served as program co-chair of IEEE/IFIP Network Operations and Management Symposium and the IEEE International Conference on Network and Service Management (CNSM).