Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
A SYSTEM AND METHOD TO GENERATE AN INTERACTIVE VIDEO ON THE FLY
Document Type and Number:
WIPO Patent Application WO/2016/189535
Kind Code:
A1
Abstract:
The present invention provides a method for real-time generation and streaming of at least one interactive context based video according to video template, raw context related data and real time event data. The method comprising the steps of: receiving input data related to an identified template and context related data, receiving real time input of event data, while the subsequent portion of video is still in generation, choosing, updating and generating audible and visual materials, according to the predefined logic of the identified template in relation to real time data event, generating and encoding video frames for pre-defined portion based polling the generated/updated visual materials, generating continuous audio streams/frames for predefined portion of the video based on polling generated or updated audible materials and providing the generated streamable data of the predefined video portion at the external entity side, while the subsequent portion of video is still in generation.

Inventors:
KALISH DANNY (IL)
SHENBERG IDAN (IL)
Application Number:
PCT/IL2016/050548
Publication Date:
December 01, 2016
Filing Date:
May 26, 2016
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
IDOMOO LTD (IL)
International Classes:
G06F15/16; G11B27/031; H04N9/802
Foreign References:
US20130275557A12013-10-17
US20120254456A12012-10-04
US20090094632A12009-04-09
US20110093784A12011-04-21
US20110296048A12011-12-01
US20130007201A12013-01-03
US9032465B22015-05-12
US8666226B12014-03-04
US20150340067A12015-11-26
Attorney, Agent or Firm:
ZER, Yoram et al. (P.O Box 94, 02 Rehovot, IL)
Download PDF:
Claims:
CLAIMS

1. A method for real-time generation and streaming of at least one interactive context based video according to video template, raw context related data and real time event data, said method comprising the steps of: receiving input data related to at least one identified template and context related data;

receiving real time input of event data, while the subsequent portion of video is still in generation;

choosing, updating and generating audible and visual materials, according to the predefined logic of the identified template in relation to real time data event;

generating and encoding video frames for pre-defined portion based polling the generated/updated visual materials;

generating continuous audio streams or frames for predefined portion of the video based on polling generated or updated audible materials; and providing the generated streamable data of the predefined video portion at the external entity side, while the subsequent portion of video is still in generation;

wherein the receiving, generating audible and visual materials, generating continuous audio streams/frames and providing streamable data are performed by at least one processor.

2. The method of claim 1 wherein each interactive event type is identified by a matching interactive handling mechanism.

3. The method of claim 2 wherein each event is associated with a specific video generation process or video stream process.

4. The method of claim 1 wherein the event data is received from external sources.

5. The method of claim 1 wherein the event data is based on actions

performed by at least one user while viewing the streamed video.

6. The method of claim 4 wherein the user's actions are reactions to triggers appearing in real-time in the streamed video, wherein the reaction include at least one of : user selected choice from given option, user

pointing/touching at predefined location in the movie frame or use entering data by voice or text.

7. The method of claim 4 wherein the video streaming is paused while

waiting for user action.

8. The method of claim 3 wherein while waiting for user action, is streamed predefined video scene.

9. The method of claim 1 wherein the event data is based on identifying and analyzing behavior of at least one user while viewing the streamed video.

10. The Method of claim 1 wherein the updating and generating audible and visual materials is initiated at predefined points based on template definition.

11. The Method of claim 1 wherein the updating and generating audible and visual materials is initiated based on received events data.

12. A system including at least one processor for real-time generation and streaming of at least one interactive context based video according to video template, raw context related data and real time event data, using said system comprising of:

video Generation API module which receives input data related to at least one identified template and context related data;

interactive video processing module which polls real time input of event data, while the subsequent portion of video is still in generation;

visual and audio material preparation modules which chooses, updated and generate audible and visual materials, according to the predefined logic of the identified template in relation to real time data event;

encoding module for generating and encoding video frames for predefined portion based polling the generated/updated visual materials; and interactive Video Stream Management module for providing streamable data of the predefined video portion at the external entity side, while the subsequent portion of video is still in generation;

wherein the modules are implemented on at least one processor device.

13. The system of claim 12 wherein each interactive type is identified by a matching interactive handling mechanism.

14. The system of claim 12 wherein the event data is received from external sources including remote communication server or remote sensors..

15. The system of claim 12 further comprising an interface module

implemented on user computer device which interface device which receives user interaction data while viewing the video, wherein the event data is based on interaction data.

16. The system of claim 15 wherein the user's interaction data are reactions to triggers appearing in real-time in the streamed video, wherein the reaction include at least one of : user selected choice from given option, user pointing/touching at predefined location in the movie frame or user voice or text input.

17. The system of claim 15 interface module identifies and analyzes behavior of at least one user while viewing the streamed video.

18. The system of claim 12 wherein the update is initiated at predefined points based on template definition.

19. The system of claim 12 wherein the update is initiated based on received events data.

20. The system of claim 12 further comprising a video Generation

Management module which receives data of a new video generation request and data polled from interactive video module and coordinates the launching or operating of the video encoding module, Audio Generation module , the Video Frame GPU rendering module and preparation modules.

21. The system of claim 12 wherein the encoding module is kicked of once both an audio input stream is fed by the audio generation module and a video frame sequence is fed by the video frame GPU rendering module and wherein the Audio Generation module is launched when polling new audible material input created by the audible material preparation module and the Video Frame GPU rendering module is launched once polling new visual material input created the visual material preparation module.

22. The system of claim 20 wherein the processed events are formalized to canonical form that is consumed by the video generation management module as part of its polling cycle.

Description:
A SYSTEM AND METHOD TO GENERATE AN INTERACTIVE VIDEO ON

THE FLY

BACKGROUND

1. TECHNICAL FIELD

The present invention relates generally to generation of context based video files based on predefined templates. More particularly, the present invention relates to on the fly generation of context based video.

BRIEF SUMMARY

The present invention provides a method for real-time generation and streaming of interactive context based video according to video template, raw context related data and real time event data. The method comprising the steps of: receiving input data related to at least one identified template and context related data, polling real time input of event data, while the subsequent portion of video is still in generation, choosing, updating and generating audible and visual materials, according to the predefined logic of the identified template and real time data event, generating and encoding video frames for pre-defined portion based polling the generated/updated visual materials, generating continuous audio streams/frames for predefined portion of the video based on polling generated or updated audible materials and providing streamable data of the predefined video portion at the external entity side, while the subsequent portion of video is still in generation.

According to some embodiments of the present invention each interactive event is identified by a unique event type and processed by a matching interactive handling mechanism.

According to some embodiments of the present invention the processed events are formalized to canonical form that is consumed by the video generation management instance as part of its polling cycle.

According to some embodiments of the present invention the event data is received from external sources.

According to some embodiments of the present invention the event data is based on actions performed by at least one user while viewing the streamed video. According to some embodiments of the present invention the user's actions are reactions to triggers appearing in real-time in the streamed video, wherein the reaction include at least one of : user selected choice from given option, user pointing/touching at predefined location in the movie frame.

According to some embodiments of the present invention the video streaming is paused while waiting for user action.

According to some embodiments of the present invention while waiting for user action, is streamed predefined video scene.

According to some embodiments of the present invention the event data is based on identifying and analyzing behavior of at least one user while viewing the streamed video.

According to some embodiments of the present invention the update is initiated at predefined points based on template definition.

According to some embodiments of the present invention the update is initiated based on received events data.

The present invention provides a system for real-time generation and streaming of interactive context based video according to video template, raw context related data and real time event data. The system comprising of: Video Generation API module for receiving input data related to at least one identified template and context related data, Interactive video module for polling real time input of event data, while the subsequent portion of video is still in generation, Visual and audio material preparation modules choosing, updating and generating audible and visual materials, according to the predefined logic of the identified template and real time data event, Encoding module for generating and encoding video frames for pre-defined portion based polling the generated/updated visual materials, generating continuous audio streams/frames for predefined portion of the video based on polling generated or updated audible materials and Interactive Video Stream Management module for providing streamable data of the predefined video portion at the external entity side, while the subsequent portion of video is still in generation.

[0001] These, additional, and/or other aspects and/or advantages of the present invention are: set forth in the detailed description which follows; possibly inferable from the detailed description; and/or learnable by practice of the present invention.

BRIEF DESCRIPTION OF THE SCHEMATICS [0002] The present invention will be more readily understood from the detailed description of embodiments thereof made in conjunction with the accompanying drawings of which:

Figure 1 is a block diagram of real-time video generation process, according to some embodiments of the invention;

Figure 2 is a flowchart diagram of Video Generation API module, according to some embodiments of the invention;

Figure 3 is a flowchart diagram of Video Generation Management module, according to some embodiments of the invention;

Figure 4 is a flowchart diagram of Visual materials preparation, according to some embodiments of the invention;

Figure 4A is a flowchart diagram of Audible materials preparation module, according to some embodiments of the invention;

Figure 5 is a flowchart diagram of Audio Generation Module, according to some embodiments of the invention;

Figure 6 is a flowchart diagram of Video Rendering module, according to some embodiments of the invention;

Figure 7 is a flowchart diagram of Video encoding module, according to some embodiments of the invention;

Figure 8 is a flowchart diagram of Interactive VIDEO Stream Management module, according to some embodiments of the invention;

Figure 9 is a flowchart diagram of Interactive video module, according to some embodiments of the invention; and

Figure 10 is a flowchart diagram of User Interaction video module, according to some embodiments of the invention.

DETAILED DESCRIPTION OF THE VARIOUS MODUELS SCHEMATICS

[0001] Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is not limited in its application to the details of construction and the arrangement of the components set forth in the following description or illustrated in the drawings. The invention is applicable to other embodiments or of being practiced or carried out in various ways. Also, it is to be understood that the phraseology and terminology employed herein is for the purpose of description and should not be regarded as limiting.

[0002] Figure 1 is a block diagram of real-time video generation processing system, according to some embodiments of the invention. The system is implemented by at least one processor unit using different internal modules in communication with different user external module which can be implemented at personal processing device using such as computer, laptop or smart phones or at external servers. The system is comprised of video generation API computer module 200 for receiving remote entity 10 request such as computer terminal, server for video generation, the request is conveyed, to the video generation management computer module 300 for initiating and managing the video on the fly processing. Throughout the video generation and streaming, the interactive video module 900 polls user derived interaction interface 15 of personal computer device such as keyboard , mouse, touch screen etc. and content/event driven interaction 20 from any external information source such as network server where the content or event may include news website or remote sensors data and conveys polled information of the event/interaction data to the video generation module 300. Once the video process is initiated, the video generation is started, the visual and audible preparation modules (400,450) are invoked for preparing the material, based on the identified template and events types received from the interactive video module 900. The audio generation module 700 and the GPU video frame rendering module 500 poll the audio and visual data from the audio material preparations module (450) and the visual material preparations module(400), material is ready for creating audio stream and video frames to be conveyed to the encoding module 600. The encoding module creates an RTMP stream 650 based on generated video frames and audio sequence/stream, the stream is conveyed to the interactive video stream management module 800.

[0003] Figure 2 is a flowchart diagram of Video Generation API module, according to some embodiments of the invention. The video generation API module receives (210) a request such as Hyper Text Transfer Protocol (HTTP), that is containing a template identifier and required template input text from remote entity 10. According to the identified template the module, retrieves and validates input data based on custom template identifier and user profile data (220). At the next step the input data is translated into a valid canonical entity, formatted to be handled by various modules of the platform (230). The valid canonical entity is sent to a video generation- management module (240). Once the at least part of the video stream meta-data ( Unique ID ) is ready, it is conveyed back to the remote entity 10 that originated the video generation request(250).

[0004] Figure 3 is a flowchart diagram of Video Generation Management module, according to some embodiments of the invention. The video Generation Management module is launched upon the arrival of a new video generation request (310). The module coordinates the launching and/or operating of the video encoding module, Audio Generation module and the Video Frame GPU rendering module. The encoding module is kicked of once both an audio input stream is fed by the audio generation module and a video frame sequence is fed by the video frame GPU rendering module (320). The Audio Generation module is launched when polling new audible material input created by one or many executions of the audible material preparation module (330). The Video Frame GPU rendering module is launched once polling new visual material input created by one or many executions of the visual material preparation module (340). The Video Generation Management module receives input data from a new video generation request (350) or Data polled from interactive video module (360). Based on received new video request or new event data are launched the preparation modules (370). At first the preparation modules are launched when a new request is received, starting to generate all material items according to sequence order as defined by video template logic. Each time new event data arrives, the preparation modules are activated to update, choose or generate materials items, required according predefined logic in relation to received event type and information.

[0005] Figure 4 is a flowchart diagram of Visual materials preparation, according to some embodiments of the invention. Once launched by the Video Generation Management module, the module selects the parts of the video template to be incorporated into the final video, according to the predefined logic of the identified template (410) and generates visual materials, according to the predefined logic of the identified template and /or received input data (420). The predefined logic includes rules which define, in relation to received event data to determine, if and what new video materials or update of video material are required. The newly chosen or generated video material is added to video generation module input pipe including chronological material placement data (430). [0006] Figure 4A is a flowchart diagram of Audible materials preparation module, according to some embodiments of the invention. Once launched by the Video Generation Management module, the module selects and generates audible materials, according to the predefined logic of the identified template and /or received input data (410 A). The predefined logic include rules which define, in relation to received event data to determine, if and what new video materials or update of video material are required. The newly chosen/generated audible material is added to the audio generation module input pipe including chronological material placement data (420A).

[0007] Figure 5 is a flowchart diagram of Audio Generation Module, according to some embodiments of the invention. This module is launched by the Video Generation Management Module (510). The module repeatedly polls for input data created by a single or multiple executions of the audible material preparation module. Input data is processed according to it's chronological and programmatical order (520). At the next step the audio streams are refined, concatenated in proper gaps and ordering according to rules defined during the Audible materials preparation module (530). At the last step the Audio streams are then processed and mixed together to create a single, unified audio stream to be used as the video's audio stream (540) and piping audio stream to the video encoding module (550).

[0008] Figure 6 is a flowchart diagram of Video Rendering module, according to some embodiments of the invention. This module is launched by the Video Generation Management Module (610) and repeatedly polls for input data created by a single or multiple executions of the video material preparation module. Input data is processed according to it's chorological and programmaticall order (620). At the next step, the module incorporates visual materials into a set of video templates as determined during the Visual and Audible materials preparation module (630). The module's products are rendered video frames. Each rendered Video frame is piped to the video encoding modules according to its chronological order in the video (640).

[0009] Figure 7 is a flowchart diagram of Video encoding module, according to some embodiments of the invention. This module is launched by the Video generation management module (710), receiving video Frames generated by the GPU video frame rendering module (720) and audio Frames generated by the Audio generation module (730). Both video and audio frames sequences received by this module are encoded into a live RTMP stream (740). [0010] Figure 8 is a flowchart diagram of Interactive VIDEO Stream Management module, according to some embodiments of the invention. This module receives an RTMP stream from the video encoding module , uniquely identified in the rendering server (810), enabling the remote entity (10) to use the video stream meta data which is returned to it in order, generating an RTMP stream request , containing the unique ID of the stream to connect to (820).

[0011] Figure 9 is a flowchart diagram of Interactive video module, according to some embodiments of the invention. This module is continuously polled by the instance of the video generation management request for interactive/dynamic data events (910). Both User Driven events, such user selections or input data inserted by typing or voice and mechanized Interactive events such as sensor data or identified events external information sources such the Internet are transmitted to this module using a predefined HTTP REST API. Each event is associated with a specific video generation process/video stream (920). Each interactive event is identified by a unique event type, and processed by a matching interactive handling mechanism (930) not clear . The matching interactive handling mechanism match event characteristics such as environmental condition e.g. weather , news event to the type event. The processed events are formalized to canonical form that is consumed by the video generation management instance as part of its polling cycle (940). This is a stand-alone module which is process and manage interactive/dynamic data events and data extraction mechanisms based on identified event type. Each video generation management process can poll this module and receive real-time event data based on a pre-defined set of data extraction mechanisms. For example, data extraction mechanism may include checking current weather at specified location, current local time, news updates in specific subject, receiving data form environmental sensors or incoming data from the user interaction video module as described bellow.

[0012] Figure 10 is a flowchart diagram of User Interaction video module, according to some embodiments of the invention. This module an example of user interaction, while in interactive video is displayed: Halting the movie at pre-defined spots at predefined scene or prompting the user with interaction icon (1010), presenting user with a set of choices at pre-defined spots, each choice is relevant to specific type of information that the user may be interested in (1020), Receiving user's selected choice or active/ passive reaction or selecting default choice after predefined period of time and sending input event data to interactive module (1030), or analyzing continuously or at predefined spots user behavior by using sensors (e.g. camera microphone, acceleration, gyro, temp) and sending measurements results to interactive module when exceeding predefined thresholds.

[0013] The system of the present invention may include, according to certain embodiments of the invention, machine readable memory containing or otherwise storing a program of instructions which, when executed by the machine, implements some or all of the apparatus, methods, features and functionalities of the invention shown and described herein. Alternatively or in addition, the apparatus of the present invention may include, according to certain embodiments of the invention, a program as above which may be written in any conventional programming language, and optionally a machine for executing the program such as but not limited to a general purpose computer which may optionally be configured or activated in accordance with the teachings of the present invention. Any of the teachings incorporated herein may wherever suitable operate on signals representative of physical objects or substances.

[0014] Unless specifically stated otherwise, as apparent from the following discussions, it is appreciated that throughout the specification discussions, utilizing terms such as, "processing", "computing", "estimating", "selecting", "ranking", "grading", "calculating", "determining", "generating", "reassessing", "classifying", "generating", "producing", "stereo-matching", "registering", "detecting", "associating", "superimposing", "obtaining" or the like, refer to the action and/or processes of a computer or computing system, or processor or similar electronic computing device, that manipulate and/or transform data represented as physical, such as electronic, quantities within the computing system's registers and/or memories, into other data similarly represented as physical quantities within the computing system's memories, registers or other such information storage, transmission or display devices. The term "computer" should be broadly construed to cover any kind of electronic device with data processing capabilities, including, by way of non-limiting example, personal computers, servers, computing system, communication devices, processors (e.g. digital signal processor (DSP), microcontrollers, field programmable gate array (FPGA), application specific integrated circuit (ASIC), etc.) and other electronic computing devices. [0015] The present invention may be described, merely for clarity, in terms of terminology specific to particular programming languages, operating systems, browsers, system versions, individual products, and the like. It will be appreciated that this terminology is intended to convey general principles of operation clearly and briefly, by way of example, and is not intended to limit the scope of the invention to any particular programming language, operating system, browser, system version, or individual product.

[0016] It is appreciated that software components of the present invention including programs and data may, if desired, be implemented in ROM (read only memory) form including CD-ROMs, EPROMs and EEPROMs, or may be stored in any other suitable typically non-transitory computer-readable medium such as but not limited to disks of various kinds, cards of various kinds and RAMs. Components described herein as software may, alternatively, be implemented wholly or partly in hardware, if desired, using conventional techniques. Conversely, components described herein as hardware may, alternatively, be implemented wholly or partly in software, if desired, using conventional techniques.

[0017] Included in the scope of the present invention, inter alia, are electromagnetic signals carrying computer-readable instructions for performing any or all of the steps of any of the methods shown and described herein, in any suitable order; machine- readable instructions for performing any or all of the steps of any of the methods shown and described herein, in any suitable order; program storage devices readable by machine, tangibly embodying a program of instructions executable by the machine to perform any or all of the steps of any of the methods shown and described herein, in any suitable order; a computer program product comprising a computer useable medium having computer readable program code, such as executable code, having embodied therein, and/or including computer readable program code for performing, any or all of the steps of any of the methods shown and described herein, in any suitable order; any technical effects brought about by any or all of the steps of any of the methods shown and described herein, when performed in any suitable order; any suitable apparatus or device or combination of such, programmed to perform, alone or in combination, any or all of the steps of any of the methods shown and described herein, in any suitable order; electronic devices each including a processor and a cooperating input device and/or output device and operative to perform in software any steps shown and described herein; information storage devices or physical records, such as disks or hard drives, causing a computer or other device to be configured so as to carry out any or all of the steps of any of the methods shown and described herein, in any suitable order; a program pre-stored e.g. in memory or on an information network such as the Internet, before or after being downloaded, which embodies any or all of the steps of any of the methods shown and described herein, in any suitable order, and the method of uploading or downloading such, and a system including server/s and/or client/s for using such; and hardware which performs any or all of the steps of any of the methods shown and described herein, in any suitable order, either alone or in conjunction with software. Any computer-readable or machine-readable media described herein is intended to include non-transitory computer- or machine-readable media.

[0018] Any computations or other forms of analysis described herein may be performed by a suitable computerized method. Any step described herein may be computer-implemented. The invention shown and described herein may include (a) using a computerized method to identify a solution to any of the problems or for any of the objectives described herein, the solution optionally include at least one of a decision, an action, a product, a service or any other information described herein that impacts, in a positive manner, a problem or objectives described herein; and (b) outputting the solution.

[0019] The scope of the present invention is not limited to structures and functions specifically described herein and is also intended to include devices which have the capacity to yield a structure, or perform a function, described herein, such that even though users of the device may not use the capacity, they are, if they so desire, able to modify the device to obtain the structure or function.

[0020] Features of the present invention which are described in the context of separate embodiments may also be provided in combination in a single embodiment.

[0021] For example, a system embodiment is intended to include a corresponding process embodiment. Also, each system embodiment is intended to include a server- centered "view" or client centered "view", or "view" from any other node of the system, of the entire functionality of the system, computer-readable medium, apparatus, including only those functionalities performed at that server or client or node.