Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SYSTEM AND METHOD FOR SOFTWARE ARCHITECTURE REDESIGN
Document Type and Number:
WIPO Patent Application WO/2021/064058
Kind Code:
A1
Abstract:
The present invention provides a system and method for automation of the creation of a software application as a combination of microservices by extraction of discrete elements of software functionality from a monolithic architectures using dynamic and static analysis of code and non-code-related artefacts; forming them into microservices such that the software application is recreated as serverless hardware infrastructure while also validating the transformed code. In many cases, serverless microservices will significantly reduce the hardware requirement for software systems.

Inventors:
CLARKE PAUL (IE)
MCCARREN ANDREW (IE)
Application Number:
PCT/EP2020/077429
Publication Date:
April 08, 2021
Filing Date:
September 30, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV DUBLIN CITY (IE)
International Classes:
G06F8/72
Foreign References:
US20190108067A12019-04-11
US20190250912A12019-08-15
EP3411785A12018-12-12
US20190108067A12019-04-11
Other References:
KAZANAVICIUS JUSTAS ET AL: "Migrating Legacy Software to Microservices Architecture", 2019 OPEN CONFERENCE OF ELECTRICAL, ELECTRONIC AND INFORMATION SCIENCES (ESTREAM), IEEE, 25 April 2019 (2019-04-25), pages 1 - 5, XP033557302, DOI: 10.1109/ESTREAM.2019.8732170
Attorney, Agent or Firm:
PURDYLUCEY INTELLECTUAL PROPERTY (IE)
Download PDF:
Claims:
Claims

1 . A system for creating a software application which operates to perform a software function, the system comprising: an analysis module for analyzing a version of the software application in a monolithic software system in which the software function is operational; an extraction module comprising dynamic and static analysis of code and non-code-related artefacts for extracting discrete functional elements from the monolithic software system; an assembly module for forming the extracted discrete functional element into a microservice wherein the software function is recreated from one or more of said microservices.

2. The system of claim 1 wherein, the analysis module determines which data components of the monolithic software system are suitable for extraction.

3. The system of any preceding claim wherein, the analysis module analyses business logic of the monolithic software system.

4. The system of any preceding claim wherein, the analysis module uses static code analysis focused on architectural representation.

5. The system of claim 4 wherein, the static code analysis comprises automated examination of efferent and/or afferent coupling metrics.

6. The system of claim 4 wherein, the static code analysis comprises instability index determination.

7. The system of any preceding claim wherein, the analysis module uses dynamic code analysis under typical payloads to understand where code is more frequently executed.

8. The system of any preceding claim wherein, the analysis module uses dynamic code analysis to inform resources quotas in the microservices implementation. 9. The system of any preceding claim wherein, the analysis module uses testing asset leverage.

10. The system of claim 9 wherein, testing asset leverage comprises a pre existing test assets targeted for transformation

11 . The system of any preceding claim wherein, the analysis module uses code volatility analysis.

12. The system of any preceding claim wherein, the analysis module uses Al techniques to discover which mechanisms are the best candidates for extraction to form microservices.

13. The system as claimed in claim 12 wherein, the Al technique is Support Vector Machines.

14. The system as claimed in claim 12 wherein, the Al technique uses neural networks.

15. The system as claimed in claim 12 wherein the Al technique is statistical cluster analysis.

16. The system of any preceding claim wherein, the analysis module uses code analysis to determine information exchange within the monolith. 17. The system as claimed in claim 16 wherein, the code analysis forms a footprint for event/messaging.

18. The system of claim 16 wherein, code analysis examines parameter lists for methods and functions, and/or control flow and/or interaction between methods 19. The system of any preceding claim wherein, the analysis module uses interface communications analysis determine what traffic arises in interfaces within the monolith

20. The system of any preceding claim wherein, the analysis module uses interface communications analysis determine what traffic arises between the monolith and the database.

21. The system of any preceding claim wherein, the analysis module uses interface communications analysis determine what traffic arises between the monolith and the Ul.

22. The system of any preceding claim wherein, the analysis module uses automated monolith unit test case discovery in order to isolate units of interest that can be considered candidates for microservice transformation

23. The system of any preceding claim wherein, the automated monolith Ul test case discovery is a means to form test data for verification of the auto generated microservices infrastructure. 24. The system of any preceding claim wherein, the analysis module examines object oriented (OO) structures of monolithic architectures.

25. The system of any preceding claim wherein, the analysis module examines continuous integration monolith based technology to auto-generate robust validation routines for the destination microservices based infrastructure.

26. The system of any preceding claim wherein, the analysis module uses automated analysis of source code commit logs to determine the level of volatility in each area of the code base as a means to identifying prime candidates for full microservice implementation.

27. The system of any preceding claim wherein, where analysis has determined the existence of large and stable features implemented in the monolithic system, the features are such instances in a form of microservice.

28. The system of any preceding claim wherein the extraction module comprises a parallelised Al module for service identification configured with at least one unsupervised machine learning technique for extracting service candidates from extracted discrete functional elements wherein candidate microservices are recreated from one or more of said service candidates.

29. A method for creating a software application which operates to perform a software function, the method comprising the steps of: analyzing a version of the software application in a monolithic software system in which the software function is operational; extracting discrete functional elements from the monolithic software system using dynamic and static analysis of code and non-code-related artefacts; forming the extracted discrete functional element into a microservice; and recreating the software function from one or more of said microservices.

30. The method of claim 29 comprising configuring a parallelised Al module for service identification using at least one unsupervised machine learning technique for extracting service candidates from extracted discrete functional elements wherein candidate microservices are recreated from one or more of said service candidates.

31. A computer program having program instructions for operating the method of claim 29.

Description:
Title

System and Method for Software Architecture Redesign

Field The present disclosure relates to a system and method for software architecture redesign. In particular the disclosure relates to a system for software architecture redesign which replaces monolithic systems with microservices (and higher-level granularity services, e.g. individual functions in a Function as a Service environment).

Background

There is an acceptance by many software firms of the need to modernize their software architectures so as to lower operating costs and increase speed to market. In the past, software modernization had tended to focus on a change of programming language, for example, from C / C++ to Java to take advantage of the Java Virtual Machine capability and its operational attractiveness. Such a change could be achieved, but required a level of system architectural change.

Figure 1 is a schematic representation of a monolithic architecture 1 which comprises an application user interface (Ul) 3, tightly bound business logic 5 and a database 7. As depicted in Figure 1, the monolithic architecture 1 has all of the programmatic business logic 5 associated with a software system housed within a single large monolithic architecture. One disadvantage of this monolithic architecture is that it is unsuited to rapid feature delivery of services. Therefore, the current wave of technological advancement involves moving from monolithic software structures to cloud-based microservices.

Microservices are a software development technique (a variant of the service- oriented architecture (SOA)) that structures an application as a collection of loosely coupled services. In a microservices architecture, services are fine grained. The benefit of decomposing an application into different smaller services is that it improves modularity. Flowever, microservices function as service-based systems and require an entire architectural redesign. Redesign requires an elicitation of all of the intricate accumulated dependencies within the monolithic architecture, a decomposition of the core functionality into microservices and the implementation of an entirely new event-driven communications mechanism. Therefore, redesign is expensive and time consuming.

Microservices architectures are considered to be superior because they allow for many independent pieces of business logic to be completely decoupled, meaning that new business logic/features can be added to an existing system without the need to replace other already-operational business logic.

Moving to microservices requires: eliciting the accumulated dependencies within the monolithic systems; separating core functionalities into micro-services; and creating event-driven communication systems.

Current processes for creating microservices from monolithic architectures are manual with much human intervention and decision making. Consequently, they are time-consuming, costly and error prone.

In addition, the implementation of microservices introduces several challenges, for example, the complexity that was once bundled tightly into the monolith is now essentially distributed across various services with intricate inter-service messaging demands. A further challenge is that, whereas in the monolith all of the data is typically stored in one large database, with microservices, only one service should manage any single item of data, yet clearly where complex business functions are implemented it might be necessary to engage multiple microservices in combination to deliver some higher order business feature.

US Patent publication number US2019/0108067, assigned to IBM, discloses a method for transforming a monolithic application into microservices. The IBM method involves significant end-user interaction via a console to inform decisions, and completely overlooks runtime/dynamic data collection and analysis. In addition reducing noise in the data is a large problem, especially where there are large volumes of code (either executing or static), which the IBM publication does not address.

Therefore, it is difficult to structure the data on a per-service exclusivity basis and it is computationally inefficient in terms of accessing the data for higher order functions. There are also potentially increased challenges in terms of validation and deployment of microservices when compared with monolithic architectures.

Summary

In accordance with a first aspect of the invention there is provided, as set out in the appended claims, a system for creating a software application which operates to perform a function, the system comprising: an analysis module for analyzing a version of the software application in a monolithic software system in which the software function is operational; an extraction module comprising dynamic analysis and static analysis of code and non-code-related artefacts for extracting discrete functional elements from the monolithic software system; an assembly module for forming the extracted discrete functional element into a microservice wherein the software function is recreated from one or more of said microservices.

The invention provides an assembly module for forming the extracted discrete functional element into a microservice wherein the software function is recreated from one or more of said microservices. Dynamic analysis involves attaching to programs as they execute and analysing the various interactions between program components, including analysis of data layer concerns which can includes database call. Static analysis involves analysing source code when a program is not running and can be used to establish theoretical relationships and constrains that exist in the code, such as may not be readily identified through dynamic analysis alone. Non-code-related artefacts includes data from supporting systems and sources, including source code repository analysis which can provide various temporal and meta information such as rate of change in artefacts and change origin(s) / originator(s) as well a change size. In one embodiment the the extraction module comprises a parallelised Al module for service identification configured with at least one unsupervised machine learning technique for extracting service candidates from extracted discrete functional elements wherein candidate microservices are recreated from one or more of said service candidates.

In one embodiment, the analysis module determines which data components of the monolithic software system are suitable for extraction.

In one embodiment, the analysis module analyses business logic of the monolithic software system.

In one embodiment, the analysis module uses static code analysis focused on architectural representation.

In one embodiment, the static code analysis comprises automated examination of efferent and/or afferent coupling metrics.

In one embodiment, the static code analysis comprises instability index determination.

In one embodiment, the analysis module uses dynamic code analysis under typical payloads to understand where code is more frequently executed.

In one embodiment, the analysis module uses dynamic code analysis to inform resources quotas in the microservices implementation.

In one embodiment, the analysis module uses testing asset leverage.

In one embodiment, testing asset leverage comprises pre-existing test assets targeted for transformation In one embodiment, the analysis module uses code volatility analysis.

In one embodiment, the analysis module uses Al techniques to discover which mechanisms are the best candidates for extraction to form microservices (or individual functions, which can be considered to be very small microservices as utilised in Function as a Service environments).

In one embodiment, the Al technique is Support Vector Machines.

In one embodiment, the Al technique uses neural networks.

In one embodiment, the Al technique is statistical cluster analysis (including dimensionality reduction techniques).

In one embodiment, the analysis module uses code analysis to determine information exchange within the monolith.

In one embodiment, the code analysis forms a footprint for event/messaging.

In one embodiment, code analysis examines parameter lists for methods and functions, and/or control flow and/or interaction between methods

In one embodiment, the analysis module uses interface communications analysis to determine what traffic arises in interfaces within the monolith

In one embodiment, the analysis module uses interface communications analysis to determine what traffic arises between the monolith and the database.

In one embodiment, the analysis module uses interface communications analysis to determine what traffic arises between the monolith and the Ul. In one embodiment, the analysis module uses automated monolith unit test case discovery in order to isolate units of interest that can be considered candidates for microservice transformation

In one embodiment, the automated monolith Ul test case discovery is a means to form test data for verification of the auto-generated microservices infrastructure (this includes Function as a Service).

In one embodiment, the analysis module examines object oriented (OO) structures of monolithic architectures.

In one embodiment, the analysis module examines continuous integration monolith based technology to auto-generate robust validation routines for the destination microservices based infrastructure.

In one embodiment, the analysis module uses automated analysis of source code commit logs to determine the level of volatility in each area of the code base as a means to identifying prime candidates for full microservice (or Function as a Service) implementation.

In one example, where analysis has determined the existence of large and stable features implemented in the monolithic system, the features are such instances in a form of microservice.

In accordance with a second aspect of the invention there is provided a method for creating a software application which operates to perform a function, the method comprising the steps of: analyzing a version of the software application in a monolithic software system in which the software function is operational; extracting discrete functional elements using dynamic and static analysis of code and non-code-related artefacts from the monolithic software system; forming the extracted discrete functional element into a microservice; and recreating the software function from one or more of said microservices (or finer grained functions in the form of a Function as a Service target environment).

In one embodiment, the step of analyzing the monolithic software system comprises determining which data components of the monolithic software system are suitable for extraction.

In one embodiment, the step of analyzing the monolithic software system comprises analyzing business logic of the monolithic software system.

In one embodiment, the step of analyzing the monolithic software system comprises, static code analysis focused on architectural representation.

In one embodiment, the static code analysis comprises automated examination of efferent and/or afferent coupling metrics.

In one embodiment, the static code analysis comprises instability index determination.

In one embodiment, the step of analyzing the monolithic software system comprises dynamic code analysis under typical payloads (or system testing payloads) to understand execution paths and dependencies, and where code is more frequently executed.

In one embodiment, the step of analyzing the monolithic software system comprises dynamic code analysis to inform resources quotas in the microservices implementation.

In one embodiment, the step of analyzing the monolithic software system comprises, testing asset leverage. In one embodiment, testing asset leverage comprises a pre-existing test assets targeted for transformation.

In one embodiment, the step of analyzing the monolithic software system comprises code volatility analysis.

In one embodiment, the step of analyzing the monolithic software system comprises using Al techniques to discover which mechanisms are the best candidates for extraction to form microservices.

In one embodiment, the Al technique is Support Vector Machines.

In one embodiment, the Al technique uses neural networks.

In one embodiment, the Al technique is statistical cluster analysis.

In one embodiment, the step of analyzing the monolithic software system comprises code analysis to determine information exchange within the monolith.

In one embodiment, the code analysis forms a footprint for event/messaging.

In one embodiment, code analysis examines parameter lists for methods and functions, and/or control flow and/or interaction between methods

In one embodiment, the step of analyzing the monolithic software system comprises interface communications analysis determine what traffic arises in interfaces within the monolith

In one embodiment, the step of analyzing the monolithic software system comprises interface communications analysis determine what traffic arises between the monolith and the database. In one embodiment, the step of analyzing the monolithic software system comprises interface communications analysis determine what traffic arises between the monolith and the Ul.

In one embodiment, the step of analyzing the monolithic software system comprises automated monolith unit test case discovery in order to isolate units of interest that can be considered candidates for microservice transformation

In one embodiment, the automated monolith Ul test case discovery is a means to form test data for verification of the auto-generated microservices infrastructure.

In one embodiment, the step of analyzing the monolithic software system comprises examining object oriented (OO) structures of monolithic architectures.

In one embodiment, the step of analyzing the monolithic software system comprises examining continuous integration monolith based technology to auto- generate robust validation routines for the destination microservices based infrastructure.

In one embodiment, the step of analyzing the monolithic software system comprises automated analysis of source code commit logs to determine the level of volatility in each area of the code base as a means to identifying prime candidates for full microservice implementation.

In one example, where analysis has determined the existence of large and stable features implemented in the monolithic system, the features are such instances in a form of microservice.

In accordance with a third aspect of the invention there is provided a computer program having program instructions for operating the method of the second aspect of the invention. Brief Description of the Drawings

The invention will be more clearly understood from the following description of an embodiment thereof, given by way of example only, with reference to the accompanying drawings, in which:-

Figure 1 is a schematic representation of a monolithic architecture;

Figure 2 is a flow diagram which illustrates an example of the method of the present invention;

Figure 3 is a schematic representation of an example of a system in accordance with the present invention; and

Figure 4 illustrates a detailed architecture of a fully automated system for creating a software application which operates to perform a software function.

Detailed Description of the Drawings

The present invention provides a system and method for automation of the creation of a software application as a combination of microservices by extraction of discrete elements of software functionality from a monolithic architecture, forming them into microservices such that the software application is recreated as serverless hardware infrastructure while also validating the transformed code. In many cases, serverless microservices will significantly reduce the hardware requirement for software systems.

Figure 2 shows an example of a method 11 for creating a software application which operates to perform a software function. The method analyses 13 a tightly bound monolithic software system in which the software function is operational. Examples of the means of analysis are provided below. The functional elements are extracted 15 from the monolithic software system and then formed 17 into a microservice. In this example, a number of microservices are created and used to recreate 19 the software function. The use of the method of the present invention is also described with reference to figure 3 below which shows the system, referred to as the microservice engine 49. Figure 3 is a schematic representation 21 which shows a known monolithic architecture for a software application 23, a software application created using microservices 25 and a microservice engine 49 in accordance with the present invention.

The monolithic architecture 23 comprises an application user interface (Ul) 27, tightly bound business logic module 29 with business logic elements 31 and a database 33. I/O communication 35 between the application Ul 27 and the business logic module 29 and I/O communication 37 between the database 33 and the business logic module 29 are shown. The microservice engine comprises a bundle of software analysis tools described in more detail below which analyse the tightly bound monolithic software system 23. It also extracts elements from the monolithic software system and then forms them into one or more microservice to recreate the software function as a loosely bound set of microservices.

Analysis of the monolithic architecture includes analysis of:

I/O communication 35 between the application Ul 27 and the business logic module 29;

I/O communication 37 between the database 33 and the business logic module 29; business logic elements 31 ;

Interaction between business logic elements; and the database. Once the analysis has been completed, the microservice engine assembles the extracted elements into one or more microservices (or smaller functions such as may be used in a Function as a Service environment) which recreates the software function of the software of the monolithic architecture. This is achieved as a series of outputs 51 , 53, 55, 57 which define various microstructures as loosely bound business logic elements, loosely bound database elements 59, I/O communication 65 between the application Ul 63 and the business logic elements 61 and I/O communication 67 between the database elements 59 and the business logic elements 61 .

In order to achieve this, a variety of analysis tools are used to analyse different features and functions of the monolithic software application.

The architecture is analysed using static code analysis. Dynamic code analysis under typical payloads (or system test payloads) is used to understand code hot spots (parts that are executing more often) and also to inform resources quotas in the microservices implementation. Resource quotas are concerned with the allocation of hardware resources to microservices, for example determining or allowing for a certain capacity of CPU and RAM to be allocated to a microservice. Dynamic analysis involves attaching to programs as they execute and analysing the various interactions between program components, including analysis of data layer concerns which can includes database call. Static analysis involves analysing source code when a program is not running and can be used to establish theoretical relationships and constrains that exist in the code, such as may not be readily identified through dynamic analysis alone. Non-code-related artefacts includes data from supporting systems and sources, including source code repository analysis which can provide various temporal and meta information such as rate of change in artefacts and change origin(s) / originator(s) as well a change size.

The interface communications 31 between the application Ul 27 and the business logic 29 and between communications 37 between the database 33 and the business logic 29 are analysed to determine what traffic arises in interfaces within the monolith.

Advanced code analysis is used to determine information exchange within the monolithic system 23 so as to form a footprint for event/messaging. For example, a very basic form of this analysis would examine parameter lists for methods and functions, as well as control flow and interaction between methods.

Static analysis is used to enable automated examination of general coupling and other object-oriented infrastructure (e.g. efferent and afferent coupling metrics) along with instability index determination. Automated unit test case and test case suite discovery are created for the monolithic software to isolate units of interest that can be considered candidates for microservice transformation. Automated monolith test case discovery is used with the application Ul 27 as a means to form test data for verification of the auto-generated microservices infrastructure. For example, test cases developed using technology such as Selenium could be discovered and ported to the microservices based implementation.

In addition, the object oriented (00) structures of monolithic architectures is examined. The structure of an OO implementation can be highly informative in relation to service (and microservice) discovery. DevOps based development and deployment chains may be created automatically.

Examples of the present invention may also use interface communications discovery to determine exactly what traffic arises in interfaces between the monolithic software 29 and the Ul 27. Similarly, interface communications discovery is used to determine exactly what traffic arises in interfaces between the monolithic software 29 and the database 33.

The present invention provides: a software-based system that will automatically determine a monolithic architecture through the implementation of Al techniques; a software-based system that will enable the creation of a microservices based infrastructure; a software-based system that will enable the establishment of appropriate event/messaging infrastructure; and a software-based system that will enable access to a serverless deployment environment.

In operation, Figure 4 illustrates a detailed architecture of a fully automated system similar to Figure 3 for creating a software application which operates to perform a software function, according to one embodiment of the invention. A number of separate phases are required in order for the system and method to operate. In a first phase the analysis module 100 is configured with a dynamic data collection engine 101 for analysing the dynamic behaviour at runtime of a version of the software application in a monolithic software system in which the software function is operational. The first phase can also include static data collection engine 102 configured for analysing the static behaviour of a version of the software application in a monolithic software system in which the software function is operational. A repository collection engine 103 is configured for profiling source code repositories and commit history cluster analysis of the software application in a monolithic software system. A dynamic data preparation engine 104 is provided and comprises a semantic domain analysis module for extracting domain entities of the software application in which the software functions are built around the entities. A multi-form filtering engine is configured for extracting and filtering the runtime interaction of the monolith code using sophisticated pruning heuristics and automated fitness sensitization using packages, classes, methods, URLs and database Queries of the software application in a monolithic software system in which the software function is operational.

An extraction module 105 comprises a parallelised Al module for service identification using unsupervised machine learning techniques (for example agglomerative clustering, spectral clustering and/or FP tree analysis) for extracting service candidates wherein candidate microservices are recreated from one or more of said service candidates.

An assembly module 106 comprises a Multivariate Service Optimisation and Selection module using Constraint-based Optimisation Algorithmic Techniques and configured to identify desirable microservice features including inter and intra, structural and semantic similarities within and among candidate microservices. If the microservice is not deemed optimal the result can be fed back into the parallelised Al module for further processing.

In one embodiment the dynamic data collection engine 101 is configured to collect runtime data from the monolithic application and extracts the execution traces of the software with information including chains, URLs, classes, methods and database methods to construct a call graph. The static data collection engine 102 collects static data from the monolithic application, extracts the execution traces of the software with information including chains, URLs, classes, methods and database methods, analyse the control flow and the data flow. The repository collection engine 103 is configured to profile the source code repository of the monolithic application and clusters the application components based on the commit history of the monolithic application. A commit history cluster analysis module clusters the monolith code into clusters with similar behaviour.

In the analysis module the dynamic data preparation engine 104 performs semantic domain analysis to automatically extract domain-specific entities of the monolithic application using the output of the parallelised Al Module. The dynamic data preparation engine performs a multi-form filtering engine that can utilise sophisticated pruning heuristics and automated fitness sensitizing using Packages, classes, Methods, URIs, Database Methods. In one embodiment the parallelised Al module automatically and concurrently extracts initial service candidates using unsupervised machine learning techniques including agglomerative clustering, spectral clustering and FP tree analysis. The parallelised Al module also composes candidate microservices from initial service candidates using constraint-based optimisation techniques.

In the assembly module 106, the multivariate service optimisation and selection using constraint-based optimisation algorithm uses fitness functions generated based on the outputs provided from the analysis module and the extraction module to select the service composition deemed optimal. The module 106 can be configured to use optimisation techniques to compose optimal service candidates that demonstrate the desirable microservice feature, wherein the selected service candidates become the recommended microservices.

As used herein, the term software application means a software function implemented in a given architecture/platform. Software function refers to the functionality of the software regardless of the architecture/platform.

The description of the invention including that which describes examples of the invention with reference to the drawings may comprise a computer apparatus and/or processes performed in a computer apparatus. However, the invention also extends to computer programs, particularly computer programs stored on or in a carrier adapted to bring the invention into practice. The program may be in the form of source code, object code, or a code intermediate source and object code, such as in partially compiled form or in any other form suitable for use in the implementation of the method according to the invention. The carrier may comprise a storage medium such as ROM, e.g. CD ROM, or magnetic recording medium, e.g. a memory stick or hard disk. The carrier may be an electrical or optical signal which may be transmitted via an electrical or an optical cable or by radio or other means.

The invention is not limited to the embodiments hereinbefore described but may be varied in both construction and detail.