A SYSTEM AND METHOD FOR GENERATING DOCUMENTS

Title:

A SYSTEM AND METHOD FOR GENERATING DOCUMENTS

Document Type and Number:

WIPO Patent Application WO/2019/191817

Kind Code:

Abstract:

A System and Method for Generating Documents The present invention relates to a system and method for generating documents. The preparation of advice documentation by professional service providers requires much manual input. Although some parts of the document may be "boiler plate" much will be bespoke and require preparation by a professional. Embodiments of the present invention provide a system and method for generating documents which comprises a document content generator having a database storing many document segments. A content prediction engine, which may be in the form of a neural network, is arranged to provide content parameters which are used to select from the plurality of document segments to build a document.

Inventors:

NOURI AHMADI GOURAB SEYED MOHSEN (AU)
ROBBIE JOEL (AU)
ROBBIE STEPHANIE (AU)
SHERIDAN TIMOTHY (AU)

Application Number:

PCT/AU2019/050305

Publication Date:

October 10, 2019

Filing Date:

April 05, 2019

Export Citation:

Click for automatic bibliography generation Help

Assignee:

NODAPP PTY LTD (AU)

International Classes:

G06F17/21; G06F16/93; G06Q40/00; G06Q50/10

Foreign References:

US20180032497A1	2018-02-01
US20180018741A1	2018-01-18
US8121915B1	2012-02-21
US20170220545A1	2017-08-03

Attorney, Agent or Firm:

GRIFFITH HACK (AU)

Download PDF:

View/Download PDF PDF Help

Claims:

Claims

1. A system for generating documents that have dynamic content, comprising a document content generator

comprising a database storing a plurality of document segments, a selection engine arranged to select document segments and build a document from the selected segments; and a content prediction engine arranged to receive input attributes to process the input prediction

attributes to provide content parameters and to use the content parameters to affect the selection engine to select document segments and generate a document.

2. A system according to claim 1, wherein the content prediction engine comprises a machine learning

arrangement .

3. A system in accordance with claim 2, wherein the machine learning arrangement comprises a neural network.

4. A system in accordance with claim 2 or claim 3, wherein the machine learning arrangement is arranged to implement a solution to a multi-class classification problem.

5. A system in accordance with any one of claims 2 to 4, wherein the content parameters comprise strategic guidance elements, and the machine learning arrangement is arranged to output the strategic guidance elements.

6. A system in accordance with any one of claims 2 to 5, wherein the machine learning arrangement is trained based on input historical prediction attributes.

7. A system in accordance with claim 6, wherein the input historical prediction attributes are obtained from historical document corpus of the same type of document as the document being generated.

8. A system in accordance with any one of the preceding claims, wherein the document content generator comprises a dynamic parser arranged to produce the plurality of document segments by analysis and segmentation of

historical documents of the same type as the document being generated.

9. A system in accordance with claim 8, wherein the dynamic parser is arranged to parse the historical set of documents to identify static portions of the documents (portions with content generally remaining the same) and dynamic portions of the document (portions with variable content) .

10. A system in accordance with any one of the preceding claims, wherein the document is a statement of advice (SOA) for financial advice.

11. A method of generating documents that have dynamic content, comprising the steps of: processing input prediction attributes by a content prediction process, to provide content parameters; using the content parameters to affect selection from a plurality of stored document segments; and generating a document from the selected document segments .

12. A method in accordance with claim 11, comprising implementing the content prediction process by way of a machine learning process.

13. A method in accordance with claim 11 wherein the machine learning process comprises a neural network.

14. A method in accordance with claim 12 or claim 13, comprising the step of training the machine learning process based on input historical prediction attributes.

15. A method in accordance with claim 14, wherein the input historical prediction attributes are obtained from a historical document corpus of the same type of document as the document being generated.

16. A method of instructing a system for generating documents that have dynamic content, the method comprising the steps of: parsing a plurality of historical documents of the same type as the document to be generated, by analysing and segmenting the historical documents, to prepare a database of a plurality of document segments; training a machine learning arrangement based on input historical prediction attributes, obtained from historical document corpus of the same type of document as the document being generated, to prepare the machine learning arrangement to output strategic guidance elements for affecting the selection of a plurality of document segments from the available document segments, based on input current prediction attributes.

17. A computer program, comprising instructions for controlling a computer to implement a system in accordance with any one of claims 1 to 10.

18. A non-volatile computer readable medium, providing a computer program in accordance with claim 17.

19. A data signal, comprising a computer program in accordance with claim 17.

20. An apparatus for preparing a system for generating documents that have dynamic content, the apparatus comprising : a parsing arrangement, arranged to parse a plurality of historical documents of the same type as the document to be generated, by analysing and segmenting the

historical documents, to prepare a database of a plurality of document segments, and a machine learning arrangement which has been trained based on input historical prediction attributes, obtained from a historical document corpus of the same type of document as the document being generated, so as to be arranged to output strategic guidance elements for

affecting the selection of a plurality of document

segments from the available documents segments, based on input current prediction attributes.

21. A computer program, comprising instructions for controlling a computer to implement a method in accordance with claim 16.

22. A non-volatile computer readable medium, providing a computer program in accordance with claim 21.

23. A data signal, comprising a computer program in accordance with claim 21.

Description:

A System and Method for Generating Documents

Field of the Invention The present invention relates to a system and method for generating documents, and, particularly, but not

exclusively, to a system and method of generating

documents that have dynamic content. Background of the Invention

The preparation of advice documentation by professional service providers is ubiquitous. Contracts, letters of advice, agreements between parties, licences, financial advice and many other professional services deliver their "product" in the form of documentation which realises the client's requirements. All such documentation requires expert input and can be costly.

An example of the type of documentation that a

professional adviser may prepare is a Statement of Advice SOA) provided by a financial adviser.

A financial adviser is a professional who suggests and renders financial services to clients based on their financial situation. They have to complete specific training and hold a license to provide advice.

During the first meeting with client, the adviser

discusses the advice needs and then gives an idea of what they can do to help. The adviser will also be able to say how much the advice will cost so the client can decide whether to proceed any further. If they decide to continue with the adviser, he will prepare a statement of

advice (SOA) that will formally document the advice, the strategies and any financial products recommended. In a nutshell, a SOA is a document that clearly outlines the recommendations that have been made by the industry expert and also explains why these recommendations have been made. In Australia, the SOA must be provided in accordance with the Financial Services Reform Act (2002) . It should not only provide details of what has been recommended and why, but also details of how the

recommendations intended to benefit the client.

A range of information is provided by a properly drawn-up SOA. It may include the following information, as well as other information: · The name, address and contact details of the adviser or the company that has offered the advice;

• Details about the advice that has been given; · The date that the advice was given;

• The adviser's or company's Financial Services License Number;

• The type of product that the advice relates to;

• The recommendations that have been made and why they have been made;

• Details of the benefits that the products/services recommended should provide;

• The name of the insurer; and

• Details of the client's instructions to the adviser,

The adviser or company must provide a client with a SOA in order to be compliant with regulations and, in order to ensure that a client is dealing with an adviser or company that is compliant, there are a number of items that should be checked. These include:

• Checking the document is official;

• Looking for justification of recommendations;

• Ensuring there are details of products/services

recommended;

• Checking on commission disclosure; and

• Ensuring there is an official license number.

The SOA is important for many reasons. First off, it provides full and important details of the services and products that have been recommended, so the client has a written summary of everything that has been discussed, as well as the reasons for recommendations, commission information and other important details. Also, a proper SOA with the license number included helps to ensure that the client is dealing with a compliant firm.

It is important to carefully read the SOA to ensure you fully understand how the adviser is remunerated and whether they have any relationships or associations that may influence their advice. The SOA will also outline any potential conflicts of interests in the provision of recommendations .

SOAs may provide limited advice on a particular product (e.g. insurance) or may provide more general advice on, for example, a financial plan going forward for the client who wishes to invest in superannuation and/or other investment portfolios. The SOA is therefore a complex document. It may include "static content". This may include "boiler plate" passages such as statements regarding "cooling off" periods for investments. Such static content can be generated by, for example, current systems that store templates incorporating the static content .

Much of the SOA, however, will be complex "dynamic

content" that must be prepared by a professional adviser. This dynamic content may include details of strategies, products to be provided in accordance with the strategies, and other expert knowledge.

For generating an SOA an adviser may currently have some support systems at hand. For example, they may have a CRM (customer relation management system) which is not only for administrative tasks but also helps putting in

information through tools and forms to generate at least some of the SOA. This will provide some static content. Advisers need to go through different menus and wizards to put together an SOA by these applications, and finally the generated SOA needs to be revised by a paraplanner for further adjustment. This is a very time consuming and expensive process that wastes a lot of resources of an advisory firm. The final generated SOA's are usually not consistent across a firm and the price and time of

delivery can lead to a bad customer experience.

As discussed, much of the dynamic content must be bespoke prepared by the adviser and separately input to the document. This requires significant professional time and expertise. It is very costly. The problems relating to preparation of an SOA in the financial advisory industry can be extrapolated to the preparation of any

documentation in any professional services that requires expert input (law, finance, medicine, and any other profession) . Currently, there are no or limited

alternatives to requiring significant amounts of expert time in preparation of advice and documentation.

Supporting systems are currently limited. While they are useful for providing templates which contain static content, they are not particularly useful for determining and incorporating dynamic content in a document.

Summary of the Invention

In accordance with a first aspect, the present invention provides a system for generating documents that have dynamic content, comprising a document content generator comprising a database storing a plurality of document segments, a selection engine arranged to select document segments and build a document from the selected segments, and a content prediction engine arranged to receive input prediction attributes, and to process the input prediction attributes to provide content parameters, and to use the content parameters to affect the selection engine to select document segments and generate a

document .

In an embodiment, document segments are prepared from a corpus of historical documents of the same type of

document that is to be generated. For example, if the system is arranged to generate a Statement of Advice (SOA) for financial advice, then the historical corpus will be of many previous SOAs .

In an embodiment, the document content generator comprises a dynamic parser arranged to produce the plurality of document segments by analysis and segmentation of the historical documents. This provides a "library" (in the database) of content which the selection engine can select from in order to generate the document. This embodiment requires the segments to be selected and the document to be built. In an embodiment, the content prediction engine facilitates the selection of the

documentation based on the input prediction attributes.

In an embodiment, the prediction engine comprises a machine learning arrangement. In an embodiment, this comprises a neural network. In an embodiment, the machine learning arrangement is trained based on input historical prediction attributes. These may be "facts" parsed from historical documents of the same type as the document to be generated. These historical prediction attributes are used to train the machine learning arrangement. In an embodiment, the output of the machine learning arrangement are strategic guidance elements which affect the selection engine to select appropriate document segments in

accordance with the required content.

For example, in the case of an SOA for financial advice, the prediction attribute may comprise facts such as age, demographics of clients, appetite for risk, and the like.

Advantageously, the system in accordance with embodiments of the invention outputs a document which includes static portions and also dynamic portions, which have been selected via the machine learning arrangement. The machine learning arrangement acts as an artificial intelligence expert, governing the selection of dynamic portions of the document. In embodiments, this vastly reduces the workload of the professional adviser, as the strategic advice will automatically be prepared. The professional adviser can review the document prepared by the system, make changes and finalise it. This will take much less time than preparing a document from scratch. Further, the advice is likely to be more consistent and better presented across an organisation which utilises the system. In an embodiment, documents produced by the system can be fed back into the historical document corpus for the content generator and also for the content prediction engine, so that the content prediction engine continues to learn and improve.

In accordance with a second aspect, the present invention provides a method of generating documents that have dynamic content, comprising the steps of: processing input prediction attributes by a content prediction process, to provide content parameters; using the content parameters to affect selection from a plurality of stored document segments; and generating a document from the selected document segments. In an embodiment, the method comprises the step of

implementing the content prediction process by way of a machine learning process. In an embodiment, the machine learning process comprises a neural network. In accordance with a third aspect, the present invention provides a method of constructing a system for generating documents that have dynamic content, the method comprising the steps of: parsing a plurality of historical documents of the same type as the document to be generated, analysing and segmenting the historical documents, to prepare a database of a plurality of document segments; training a machine learning arrangement based on input historical prediction attributes, obtained from historical document corpus of the same type of document as the document being generated, to prepare the machine learning arrangement to output strategic guidance elements for affecting the selection of a plurality of document

segments from the available document segments, based on input current prediction attributes.

In accordance with a fourth aspect, the present invention provides a computer programme, comprising instructions for controlling a computer to implement a system in accordance with the first aspect of the invention.

In accordance with a fifth aspect, the present invention provides a computer readable medium, providing a computer programme in accordance with the fourth aspect of the invention.

In accordance with a sixth aspect, the present invention provides a media signal, comprising a computer programme in accordance with the fourth aspect of the invention.

Brief Description of the Figures

Features and advantages of the present invention will become apparent from the following description of

embodiments thereof, by way of example only, with

reference to the accompanying drawings, in which:

Figure 1: is a schematic diagram of a system in accordance with an embodiment of the invention;

Figure 2: is a schematic flow diagram illustrating overall operation of the system of Figure 1;

Figure 3: is a block diagram of a computing apparatus which may be used to implement the system of Figure 1; Figure 4: is a more detailed view of one part of the flow diagram of Figure 2;

Figure 5: is a more detailed view of another part of the flow diagram of Figure 2;

Figure 6: is a flow diagram illustrating operation of a dynamic parser implemented in accordance with an

embodiment of the present invention;

Figure 7 : is a flow diagram illustrating further operation of the dynamic parser;

Figure 8: is a flow diagram illustrating further operation of the dynamic parser;

Figures 9 to 12: are illustrations of example document segments which may be produced by a system in accordance with an embodiment of the present invention;

Figure 13: is a flow diagram illustrating operation of a content prediction engine of the system of claim 1;

Figure 14: is a representation of a display showing a portion of the output of a dynamic parser, in accordance with an embodiment of the present invention;

Figure 15 is a alternative flow diagram explaining overall operation of the system of Figure 1, and

Figure 16 is a diagram illustrating an "end to end" architecture of a system in accordance with an embodiment of the invention. Detailed Description of Embodiments

Figure 1 illustrates a system in accordance with an embodiment of the present invention for generating dynamic documents. The system comprises a computing system 1, in this example comprising a server computing system hosted in the cloud (although it may comprise any other computer architecture) . The system 1 comprises and or more

processors, memory and also a database 2.

System 1 implements a document content generator arranged to generate documents from segments of documents stored in the database 2. A selection engine 4 implemented by the system 1 is arranged to select the document segments and a content prediction engine 5 implemented by the system 1, is arranged to receive input prediction attributes and process the input prediction attributes to provide content parameters arranged to affect the selection engine 4 to select the document segments and generate a document.

The document content generator 3, selection engine 4, and content prediction engine 5 may be implemented by any combination of software and/or hardware architecture. In this embodiment, the system may be accessed by devices 6, 7 which may be remote and connected to the system 1 over a network. Devices 6, 7 may be any type of computing device, tablet device, smart phone or any other processing device arranged to communicate with system 1. These devices 6, 7 may be used by administrators to administrate the system (devices 6 for example) or by users who wish to use the system to generate documents (e.g. devices 7). Users may include professional service providers of an organisation who wish to generate documents for their clients using the expert system 1. Figure 2 shows a flow diagram illustrating operation of the system 1. The operation can be considered to comprise a number of parts. In this example, the document content generator comprises a dynamic parser 10, implemented by appropriate software and hardware, which is arranged to produce the plurality of document segments by analysis and segmentation of historical documents 15 of the same type as the document being generated.

Another part of the system operation is the artificial intelligence (AI) engine 11. The AI engine 11 comprises a machine learning arrangement 12, in this example being a deep neural network. This will be implemented by

appropriate hardware and software of system 1. The neural network 12 is trained from input of historical prediction attributes 16. The prediction attributes may comprise factual circumstances and other information and data that an expert would require in order to provide the advice that would be included in a document. These attributes may be obtained from historical document corpus 15 and/or other sources (e.g. via a web API 17 obtaining information over a network) .

Some examples of Facts are given in the tables below: It will be appreciated that there may be many other facts:

You have no expenses or not provided details

Lrtestxda assets

A further part of the AI engine 11 is a selection engine 20 which is arranged to receive new prediction attributes 21 and outputs from the neural network 12 following input of the new prediction attributes to the neural network, to select document segments prepared by the dynamic parser to output a document 25. The document 25 may be finalised and checked by an expert.

An example of a computing apparatus which may be used to implement the system 1, will now be given with reference to Figure 3. Figure 3 shows a schematic diagram of components of a computer system (900) which may implement the system 1. Computer system 900 may be a high-performance machine, such as a super computer, a desktop work station or a personal computer, or may be a distributed computing array or a computer cluster or a networked cluster of computers. In this example, the server architecture and database architecture is implemented by hardware and software supported in the "Cloud". The system 1 may be provided as software/hardware as a service, or may be owned by the organisation .

The computer system 900 comprises a suitable operating system and appropriate software for implementation of the various processes operated by the system 1.

The computing apparatus 900 comprises one or more data processing units (CPUs) 902; memory 904, which may include volatile or non-volatile memory, such as various types of RAM memories, magnetic disks, optical disks and solid- state memories; a user interface 906 which may comprise a monitor, keyboard, mouse and/or touch-screen display, may enable access by an administrator of the system 3. A network communication interface 908 for communicating with other computers and devices (e.g. 6 and 7) is also

provided, and one or more communication buses 910 for interconnecting the different parts of the system 900. The computer system 900 may access data stored in a database 914 via network interface 908 (the database 914 may correspond to the database 2 shown in Figure 1) .

Database 914 may be a distributed database. A computing apparatus for implementing embodiments of the invention is not limited to the computer apparatus described above. Any computer system architecture may be utilised, such as standalone computers, networked

computers, dedicated computing devices, or any device capable of processing information in accordance with embodiments of the present invention. The architecture may comprise client/server architecture, or any other

architecture .

The computing system is provided with an operating system and various computer processes to implement functionality. The computer processes may be implemented as separate modules, which may share common foundations such as routines and sub-routines. The computer processes may be implemented in any suitable way and are not limited to separate modules. Any software/hardware architecture that implements the functionality may be utilised.

Figure 4 shows a detail on the flow diagram of Figure 2, showing the dynamic parser 10 side of the system. The dynamic parser 10 is arranged to receive as input many historical documents 15 of a historical document corpus.

In this example, the document corpus is of Statements of Advice (SOA) for financial advice, which may have been prepared by financial advisers, for example. Dynamic parser 10 is arranged to break the documents 15 down into segments which can be stored in the database 2. The segments are labelled so that they can be used for

processing by the selection engine in order to generate documents .

Figures 9 to 12 illustrate portions of documents shown in segmented form. In the examples shown in Figures 9 to 12, there are a number of labels which identify various portions of the documents. These labels are applied by the dynamic parser 10 analysing the documents. The labels include "heading". A heading, relates to a heading of the document. In Figure 9 example, the heading relates to the "the scope of our advice" heading of this particular section of the document heading.

The label "static", relates to content of the document that is considered to be consistently appearing in the documents across the corpus. This type of content is considered to be static type content i.e. content that doesn't change from document to document. Static content would include, for example, "boiler plate" clauses and the like .

The label "sub-heading" relates to parts of the document that have been identified by the parser 10 as being sub headings .

"Dynamic" content relates to parts of the document which have been identified as varying from one document to the next .

Figures 9 to 12 give various examples of headings, static content, dynamic content and sub-headings. It will be appreciated that there will be many more than these types in any historical document corpus.

Referring again to Figure 4, the dynamic parser

lOdetermines headings by checking the style of the text in the documents so that the documents can be segmented later and the same segments can be compared with each other (step 100) . At 101 the sub-headings are determined under each headline in the documents. At 102 sections with similar function in the documents are matched. The

sentences in matched sections are compared to find similar and same sections and then they are labelled with numbers (step 103) . The portions are then labelled dynamic or static based on the frequency of their appearances. Those sentences that consistently appear in documents will be labelled as static (step 104) .

At step 105, the dynamic portions in particular are checked against fact find data to find any correlation between the dynamic parts and fact find information. Fact find may include personal information of the client seeking financial advice, their financial aims, risk profile etc. This may be obtained from the historical document corpus. Also, the combinations of

Goals/Products/Strategies from the historical document corpus are examined for pattern recognition.

The dynamic parts that appear on special occasions and related either to combinations of features or individual personal situations are therefore understood and labelled. These labelled segments of documents are then available 106 for decision making in preparation of documents (107) .

Figures 6, 7 and 8 illustrate in more detail the process of document segmentation implemented by the dynamic parser 10.

Figure 6 illustrates how the headings and sub-headings are determined. The historical SOAs are input into the system (200) . The SOAs are parsed into text objects which can be processed by the system (201) . Each SOA is then separately processed to determine headings and sub-headings. At step 202 an SOA is selected. Headings for the SOA are

determined (203) and also sub-headings (204) . A logical tree structure is constructed for the SOA in the system (205) .

Labels are then assigned to the different headings and sub-headings. Headings that are the same as each other are provided with the same label as are sub-headings that are the same as each other.

The process is to, firstly, (step 206) determine if there is a label assigned to the heading. If there is not a label assigned to this heading, then label the heading with a new label (step 207) . If a label already exists for the heading (that is, if that heading has already been determined) then the heading is labelled with the same label (208) as existing.

A similar process is carried out for sub-headings. At step 209, it is determined whether there is already a label assigned to the particular sub-heading. If not, a new label is applied to the sub-heading (210) . If a label already exists for that sub-heading, then it is labelled with the same label (211) .

The process is repeated (step 212) until it is determined that all the headings and sub-headings are labelled for all the SOAs (213) .

The next stage in the dynamic parser 10 process is

illustrated in Figure 7. This part of the process relates to comparing sections of documents with each other and labelling the same or similar ones with the same label. At 300, all the SOAs with labelled heading and sub-heading are input into this section of the process. Headings are selected (step 303) and sub-headings under the selected heading are picked (304) and then any unlabelled sentences under the sub-heading are selected (306) . The sentence is then compared with text under the same sub-heading from other SOAs (307) . The selected sentence is labelled with the same label as similar or the same sentences already labelled (308) . A similar process is applied to further sentences under other sub-headings and headings until all the SOAs are processed (steps 301, 302, 305 and 309) .

The next stage of the parsing process is to label or "tag" the labelled text with associations to Products (e.g.

financial products), Goals (e.g. financial goals) and Strategies (e.g. financial strategies) and any other attributes that the document features may require. Goals, Strategies and Products are obtained from parsing the historical documents.

Referring to Figure 8, all the labelled portions of documents are input to the process (400) . A dataset is created starting with the SOA file name as the first column (for test and validation) (401) . For each label text a new column is added (402) . A new column is added for the sub-heading label (403) . Then a new column is added for each Goal (404), Strategy (405) and Product (406) .

A process is then implemented to tag the labelled portions of texts as Goals, Strategies or Products. Text is

selected (step 409) and a Spearman correlation is tested between each labelled text and other columns in the dataset. A Spearman correlation matrix is built (410) and then for each labelled text in the dataset (step 411) a determination is made if the text is 100% correlation with Goals (412), Strategy (414) or Products (415). Depending on these determinations, the section of text is tagged with Goals, Strategies or Products (steps 413, 416 and 417) .

This process is repeated for each sub-heading (steps 407 and 408) until all of the content is appropriately tagged The labelled texts are then available for document

generation . To summarise the above process, this state of the art technique is used to parse and extract all the dynamic part of the SOAs and map them across all SOAs, so that varieties of the values for different segments of an SOA can be identified. The segments are flagged automatically based on differences and similarities. The same ones will get the same ID that can be used as a class variable for classification .

The historical corpus of the SOAs also has the Fact Find in the documentation. The dynamic parser searches and extracts keywords and pairs with corresponding values for each key (e.g. Date of Birth: 54) . The extracted data is restructured to form a training set suitable to be used as the input of the AI engine (neural network 12) . The dynamic parser also checks the labelled text portions against the Fact Find data to find any correlations between dynamic parse and the Fact Find provided. The combination of Goals/Products/Strategies are examined for pattern recognition.

Figure 5 is a diagram illustrating the content prediction engine. In this embodiment, this is a machine learning arrangement 150, in the form of a neural network. The neural network 12 is a deep neural network with 25% dropout, RELU activation function, ADAM optimisation algorithm with four fully connected layers (100 neurons in the first layer, 50 in the second layer, 25 in the third layer, and 10 in the final layer) . The neural network 12 is trained based on input historical prediction attributes which are obtained by parsing the historical documents 15 to find the "Fact Finds"

(attributes) . Other facts (attributes) can be obtained in many other ways e.g. manual entry, automatically obtaining information from networks e.g. the Web (via a Web API 17) and in any other way.

The historical Fact Finds are obtained from the historical document corpus 15 by parsing the documents to extract information and by pairing the keys and values in the Fact Find (step 150) . All the attributes obtained from the parsing of the historical documents and by other means are then used to make a training set which is fed into the deep neural network 12 in order for the network to

understand the relationship between personal information (and other attributes) and Strategies/Products/Goals (step

151) .

The trained model 12 will then be used to predict the Strategy, Product and Goals for the next client (step

152) . This will be based on the new Fact Find that will be input to the trained model.

The neural networks essentially output strategic guidance elements (153) that are input to the selection engine 21 with the new Fact Find information, so the selection engine 21 can select from the available sections of document (from database 2) to output a document which includes text relevant to the predicted, Strategies, Products and Fact Find 154, to generate the final SOA 25. The SOA may be reviewed by an expert to finalise it.

Figure 13 is a flow diagram illustrating the operation of the neural network 12 in more detail.

At step 500 the historical prediction attributes are input. For every field in the Fact Find a new column is made in the table (501) . A record is added for every Fact Find in the historical SOAs 502. The categorical columns are binarised (503) and all the numerical columns are binned (504) . Strategies are then processed. The

Strategies picked (506) and used as a class table (507) . The class table is hot coded (508) and a training set is made (509) .

Parameters are adjusted (510) and hyper-parameters are adjusted (511) . The deep learning neural model is trained (512) and the hyper-parameters are optimised (513) and the trained model is saved (514) . This process is repeated for all Strategies (step 505) .

Next the Products are processed (step 516), in a similar manner to the Strategies (step 517, 518, 519, 520, 521,

522 and 523) . The trained model is saved (524) . This process is repeated for all Products (step 515) until the machine learning arrangement 12 is trained.

In an example, the training set for SOAs has been used to train the model and after 2000 epochs the model reached maximum accuracy of 91% on the training set.

New client Fact Finds and Goals are used as the prediction set (step 525) . For each part of the SOA the model is run to predict the relevant segment that needs to be assembled in a new document. The new Fact Finds are processed by making a new column in a table for every field in the Fact Find (526) . The categorical columns are binarised (527) and all the numerical columns are binned (528) .

The table is fed into the saved trained model to output prediction elements for predicting the relevant document segment that needs to be assembled in the new document. Figure 14 shows a display of part of an output of a dynamic parser. Document segments are shown on the left (reference numeral 500) and corresponding strategic guidance elements shown on the right (reference numeral 501) .

The output of the neural network selects the strategic guidance elements and from this the corresponding document segments are selected.

The generated SOA is revised by an expert for further modification and improvement so it is accurate and ready to be delivered to the client. Further, the final versions will be used for improvement of the training set and more ways will be assigned to the newer data points, so the model will be more influenced by changes.

Figure 15 is an alternative view to that of Figures 1, 4 and 5, showing operation of a system 1 in accordance with an embodiment of the invention.

The historical document corpus 15 is operated on by the dynamic parser to determine Headings and Subheadings (step 601) . Sections with similar functions in the documents are then matched (602) . Sentences are matched and

labelled (603) . At step 604 dynamic and static text is identified, depending on recency and frequency. At step 605 the labelled text is then checked against the Fact Find, strategy/advice/product to label the text (dynamic portion; of the documents in particular) . At step 606, Named Entity Recognition algorithms are used to identify place holders in the content. Place holders are places in the content where values (e.g. text or numerical) are injected into the text. At step 607 labelled text

segments are made available for decision making. In this diagram the trained AI engine (which may include the neural network and selection engine) is indicated by reference numeral 608. New client's Fact Find is fed in (609) to the model and a new document is assembled based on labelled text that matches (610) . Place holders are filled in with relevant clients information (611) to produce the final document (612) which may be reviewed by a professional.

Figure 16 illustrates an overall architecture for

implementation of a system in accordance with an

embodiment of the invention. As discussed above, the system may be implemented by a computer hardware/software which may be Cloud based (or may be provided in any other way) . The system (generally designated by reference numeral 700) may be accessed by customers 701 via

telecommunications networks, to enable the customers 701 to use the system to generate documents.

Reference numeral 702 designates the Artificial

Intelligence (AI) system, an embodiment of which is described above with reference to the earlier figures (in particular Figures 4 and 5) . AI System 702 includes the Dynamic Parser and the AI engine (which may be implemented by neural network) .

A "Domain Model Repository" 703 is provided storing data on the "architecture" of the types of documents to be generated, data on document segments and labels (tags) and the relationships between them. In one embodiment, the Domain may be Statements of Advice documentation for financial advice. The Domain Model Repository stores the data architecture for the document domain which may be applied across all customers. The tags and labels used are common across the entire system shown in Figure 16 (i.e. same tags/labels and document architecture used throughout) . The Domain Model Repository 703 is therefore the "source of truth" for the system.

A customer provides access to their historical document corpus (704), which is dealt with by the AI system 702, as discussed above, in an order to provide tagged (labelled) content which is stored in a Tagged Content Repository 705. Note that the content is labelled based on the Model which is provided by the rules on relationships between the data within the given Domain stored in the Domain Model Repository 703.

The Tagged Content Repository is a database storing a collection of document segments with associated tags, which acts as the source of content for the generation of documents .

For a given customer, the Template Engine 706 is arranged to query the Tagged Content Repository 705 to receive an ordered list of content segments for that customer, along with the associated tags for each segment. The Template Engine 706 is arranged, for each content segment, to evaluate the data provided by the Fact Collector 705 against the tags stored against each content segment. The template engine 706 is essentially the "selection engine" referred to above. For example, imagine the following scenario :

• Fact Collector data provided:

o Client Income equals 100,000

• Segment Tags provided:

o Client income <50,000 AND client income >200,000

In this scenario, the associated segment would evaluate as True, and therefore be rendered in the SOA document. The output from the Template Engine 706 is an SOA document 708 which is provided to the customer. Concurrently, the SOA document is fed, along with its associated Fact

Collector Scenario, back in to the AI System 702 for further learning.

In the above embodiment, the system is arranged for preparation of SOAs for financial advice. It will be appreciated that the invention is not limited to the domain of SOA' s for financial advice. It may be applied in many other domains where document preparation is part of the task.

For example, much documentation is required for medicine. The documentation may include Treatment Plans, Clinical Assessment Documentation, and other medical documentation. Embodiments of the invention may be used for preparation of this documentation, using the appropriate historical document corpus and applying the system in a similar way to discussed above for the SOA embodiment.

Similarly for documentation required in the legal

profession e.g. letters of advice. Documentation that may be required for construction, such as project reports for property development. Insurance policy documents are another area where the invention may be applied.

Embodiments of the invention may be applied in many domains .

In the above embodiment, the AI engine is implemented by a deep neural network. The AI engine is not limited to a deep neural network. It may be implemented by any other types of neural network, or by any other types of machine learning process and system. It will be understood to persons skilled in the art of the invention that many modifications may be made without departing from the spirit and scope of the invention.

Previous Patent: MULTI SURFACE ACOUSTIC NEBULISER

Next Patent: MEDICAL DEVICE, ACCESSORIES FOR USE THEREWITH, AND METHODS OF USE