Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ANNOTATING NOTES FROM PASSIVE RECORDING WITH CATEGORIES
Document Type and Number:
WIPO Patent Application WO/2017/003973
Kind Code:
A1
Abstract:
Systems and methods, and computer readable media bearing instructions for carrying out methods of capturing notes from passive recording of an ongoing content stream and annotating the note with a category are presented. Passive recording comprises temporarily recording the most recent content of the ongoing content stream. An ongoing content stream is passively recorded in a passive recording buffer. The passive recording buffer is configured to store a limited amount of recorded content corresponding to the most recently recorded content of the ongoing content stream. Upon indication by the user, a note is generated from the recorded content in the passive recording, annotated with a category, and stored in a note file for the user.

Inventors:
LIU, Jie (Microsoft Technology Licensing, LlcAttn: Patent Group Docketing ,One Microsoft Wa, Redmond Washington, 98052-6399, US)
DALAL, Mayuresh P. (Microsoft Technology Licensing, LlcAttn: Patent Group Docketing ,One Microsoft Wa, Redmond Washington, 98052-6399, US)
GABOR, Michal (Microsoft Technology Licensing, LlcAttn: Patent Group Docketing ,One Microsoft Wa, Redmond Washington, 98052-6399, US)
PRAJAPATI, Gaurang (Microsoft Technology Licensing, LlcAttn: Patent Group Docketing ,One Microsoft Wa, Redmond Washington, 98052-6399, US)
Application Number:
US2016/039695
Publication Date:
January 05, 2017
Filing Date:
June 28, 2016
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
MICROSOFT TECHNOLOGY LICENSING, LLC (Attn: Patent Group Docketing, One Microsoft WayRedmond, Washington, 98052-6399, US)
International Classes:
G11B27/034; G10L15/26; G11B27/11
Domestic Patent References:
2014-03-20
Foreign References:
US8909022B12014-12-09
US20130018654A12013-01-17
US20130329868A12013-12-12
US20140278377A12014-09-18
US20090037171A12009-02-05
US20120245936A12012-09-27
US20130204618A12013-08-08
Other References:
None
Attorney, Agent or Firm:
MINHAS, Sandip et al. (Microsoft Corporation, Attn: Patent Group Docketing One Microsoft Wa, Redmond Washington, 98052-6399, US)
Download PDF:
Claims:
CLAIMS

1. A computer-implemented method conducted on a user' s computing device,

comprising at least a processor and a memory, for generating notes from an ongoing content stream, the method comprising:

initiating passive recording of an ongoing content stream, the passive recording storing recorded content of the ongoing content stream in a passive recording buffer;

receiving a user indication to generate a note based on the recorded content of the passive recording of the ongoing content stream;

generating a note of the ongoing content stream from the recorded content of the passive recording and storing the note in a note file;

receiving an indication to associate a category with the generated note from the user; and

annotating the generated note with the category in the note file.

2. The computer-implemented method of Claim 1, wherein the indication to associate the category with the generated note from the user is received from the user after the note has been generated and stored in the note file.

3. The computer-implemented method of Claim 1, wherein the indication to associate the category with the generated note from the user is automatically applied as a function of generating the note based on the recorded content of the passive recording of the ongoing content stream.

4. The computer-implemented method of Claim 3, wherein the indication to associate the category with the generated note from the user is included as part of the user indication to generate the note based on the recorded content of the passive recording of the ongoing content stream.

5. The computer-implemented method of Claim 1, wherein the category associated with the generated note is one of a plurality of predetermined categories for annotating a generated note.

6. The computer-implemented method of Claim 5, wherein the plurality of

predetermined categories for annotating a generated note are user configurable.

7. The computer-implemented method of Claim 1 further comprising:

receiving an indication to associate a plurality of categories with the generated note from the user; and

annotating the generated note with the plurality of categories in the note file.

8. The computer-implemented method of Claim 1, wherein the generated note is stored in the note file in association with a meeting corresponding to the ongoing content stream.

9. The computer-implemented method of Claim 8, wherein the generated note is stored in the note file as one of a plurality of notes associated with the meeting corresponding to the ongoing content stream.

10. A computer-readable medium bearing computer-executable instructions which, when executed on a computing system comprising at least a processor, carry out a method for generating notes annotated with a category from an ongoing content stream, the method comprising and one of the methods set forth in regard to Claims 1-9.

11. A computing device for generating annotated notes from an ongoing content

stream, the computing device comprising a processor and a memory, wherein the processor executes instructions stored in the memory as part of or in conjunction with additional components to generate notes from an ongoing content stream, the additional components comprising:

a passive recording buffer, the passive recording buffer configured to temporarily store a predetermined amount of recorded content of an ongoing content stream;

an audio recording component, the audio recording component being configured to generate recorded content of the ongoing content stream;

a passive recording component, the passive recording component being configured to obtain recorded content of the ongoing content stream from the audio recording component and store the recorded content to the passive recording buffer;

an annotation component, the annotation component being configured to annotate a generated note with a category; and

a note generator component, the note generator component being configured to initiate a passive recording process via the passive recording component, receive an indication from the user, via a user interface component, to capture recorded content of the ongoing content stream, generate a note from the captured recorded content, annotate the generated note with a category via the annotation component, and store generated note in a note file in a data store.

12. The computing device of Claim 11, wherein the indication from the user to capture recorded content of the ongoing content stream includes an indication to annotate the generated note with a category.

13. The computing device of Claim 11, wherein the generated note is stored in the note file in the data store in association with a meeting corresponding to the ongoing content stream.

14. The computing device of Claim 11, wherein the generated note is stored in the note file in the data store as one of a plurality of notes in association with the meeting corresponding to the ongoing content stream.

Description:
ANNOTATING NOTES FROM PASSIVE RECORDING WITH CATEGORIES

BACKGROUND

[0001] As most everyone will appreciate, it is very difficult to take handwritten notes while actively participating in an on-going conversation or lecture, whether or not one is simply listening or activity conversing with others. At best, the conversation becomes choppy as the note-taker must pause in the conversation (or in listening to the

conversation) to commit salient points of the conversation to notes. Quite often, the note taker misses information (which may or may not be important) while writing down notes of a previous point. Typing one's notes does not change the fact that the conversation becomes choppy or the note taker (in typing the notes) will miss a portion of the conversation.

[0002] Recording an entire conversation and subsequently replaying and capturing notes during the replay, with the ability to pause the replay while the note taker captures information to notes, is one alternative. Unfortunately, this requires that the note taker invests the time to re-listen to the entire conversation to capture relevant points to notes.

[0003] Most people don't have an audio recorder per se, but often possess a mobile device that has the capability to record audio. While new mobile devices are constantly updated with more computing capability and storage, creating an audio recording of a typical lecture would consume significant storage resources.

SUMMARY

[0004] The following Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. The

Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.

[0005] Systems and methods, and computer-readable media bearing instructions for carrying out methods of capturing notes from passive recording of an ongoing content stream and annotating the note with a category are presented. Passive recording comprises temporarily recording the most recent content of the ongoing content stream. An ongoing content stream is passively recorded in a passive recording buffer. The passive recording buffer is configured to store a limited amount of recorded content corresponding to the most recently recorded content of the ongoing content stream. Upon indication by the user, a note is generated from the recorded content in the passive recording, annotated with a category, and stored in a note file for the user.

[0006] According to additional aspects of the disclosed subject matter, a

computer-implemented method conducted on a user's computing device for generating notes from an ongoing content stream is presented. The method comprises initiating passive recording of an ongoing content stream, where the passive recording stores recorded content of the ongoing content stream in a passive recording buffer. A user indication to generate a note based on the recorded content of the passive recording of the ongoing content stream is then received. In response to receiving the user indication to generate a note, a note is generated from the recorded content of the passive recording and the note is the stored in a note file. An indication to associate a category with the generated note is received from the user and the generated note is annotated with the category in the note file.

[0007] According to further aspects of the disclosed subject matter, a computer-readable medium bearing computer-executable instructions is presented. When the

computer-executable instructions are executed on a computing system comprising at least a processor, the execution carries out a method for generating notes annotated with a category from an ongoing content stream. The method comprises at least initiating passive recording of the ongoing content stream, where the passive recording is stored in a passive recording buffer. A user indication to generate a note based on the recorded content of the passive recording of the ongoing content stream is received. A user indication to associate a category of a plurality of predetermined categories with the generated note is also received. A note is generated from the ongoing content stream from the recorded content of the passive recording. The generated note is then annotated with the category in the note file and the annotated note is stored in a note file in association with an event corresponding to the ongoing content stream.

[0008] According to still further aspects of the disclosed subject matter, a computing device for generating annotated notes from an ongoing content stream is presented. The computing device comprises a processor and a memory, where the processor executes instructions stored in the memory as part of or in conjunction with additional components to generate notes from an ongoing content stream. These additional components include at least a passive recording buffer, an audio recording component, a passive recording component, an annotation component, and a note generator component. In operation, the audio recording component records content of the ongoing content stream and the passive recording component obtains the recorded content of the ongoing content stream from the audio recording component and stores the recorded content to the passive recording buffer. The annotation component is configured to annotate a generated note with a category in conjunction with the note generator component. The note generator component initiates a passive recording process via the passive recording component. Further, the note generator component receives an indication from the user, via a user interface component, to capture the recorded content of the ongoing content stream, generate a note from the captured recorded content, annotate the generated note with a category via the annotation component, and store generated note in a note file in a data store.

BRIEF DESCRIPTION OF THE DRAWINGS

[0009] The foregoing aspects and many of the attendant advantages of the disclosed subject matter will become more readily appreciated as they are better understood by reference to the following description when taken in conjunction with the following drawings, wherein:

[0010] Figure 1 A illustrate an exemplary audio stream (i.e., ongoing audio conditions) with regard to a time line, and further illustrates various the ongoing passive recording of the audio stream into an exemplary passive recording buffer;

[0011] Figure IB illustrates component with regard to an alternative implementation (to that of Figure 1 A) in conducting the ongoing passive recording of an audio stream into a passive recording buffer;

[0012] Figure 2 is a flow diagram illustrating an exemplary routine for generating notes of the most recently portion of the ongoing content stream;

[0013] Figure 3 is a flow diagram illustrating an exemplary routine for generating notes of the most recently portion of the ongoing content stream, and for continued capture until indicated by a user;

[0014] Figure 4 is a block diagram illustrating exemplary components of a suitably configured computing device for implementing aspects of the disclosed subject matter;

[0015] Figure 5 is a pictorial diagram illustrating an exemplary network environment suitable for implementing aspects of the disclosed subject matter;

[0016] Figure 6 illustrates a typical home screen as presented by an app (or application) executing on a suitably configured computing device;

[0017] Figure 7 illustrates the exemplary computing device of Figure 6 after the user has interacted with the "add meeting" control; [0018] Figure 8 illustrates the exemplary computing device of Figure 6 after the user has switched to the presentation of categories as user-actionable controls for capturing the content of the passive recording buffer into a note in the note file and associating the corresponding category with the note as an annotation;

[0019] Figure 9 illustrates the exemplary computing device showing the notes associated with "Meeting 4"; and

[0020] Figure 10 illustrates an exemplary routine for generating notes of the most recently portion of the ongoing content stream, for continued capture until indicated by a user, and for annotating the captured note with a predetermined category or tag.

DETAILED DESCRIPTION

[0021] For purposes of clarity, the term "exemplary," as used in this document, should be interpreted as serving as an illustration or example of something, and it should not be interpreted as an ideal and/or a leading illustration of that thing.

[0022] For purposes of clarity and definition, the term "content stream" or "ongoing content stream" should be interpreted as being an ongoing occasion in which audio and/or audio visual content can be sensed and recorded. Examples of an ongoing content stream include, by way of illustration and not limitation: a conversation; a lecture; a monologue; a presentation of a recorded occasion; and the like. In addition to detecting a content stream via audio and/or audio/visual sensors or components, according to various embodiments the ongoing content stream may correspond to a digitized content stream which is being receives, as a digital stream, by the user's computing device.

[0023] The term "passive recording" refers to an ongoing recording of a content stream. Typically, the content stream corresponds to ongoing, current audio or audio/visual conditions as may be detected by condition sensing device such as, by way of illustration, a microphone. For purposed of simplicity of this disclosure, the description will generally be made in regard to passively recording audio content. However, in various

embodiments, the ongoing recording may also include both visual content with the audio content, as may be detected by an audio/video capture device (or devices) such as, by way of illustration, a video camera with a microphone, or by both a video camera and a microphone. The ongoing recording is "passive" in that a recording of the content stream is only temporarily made; any passively recorded content is overwritten with more recent content of the content stream after a predetermined amount of time. In this regard, the purpose of the passive recording is not to generate an audio or audio/visual recording of the content stream for the user, but to temporarily store the most recently recorded content in the event that, upon direction by a person, a transcription to text of the most recently recorded content may be made and stored as a note for the user.

[0024] In passively recording the current conditions, e.g., the audio and/or audio/visual conditions, the recently recorded content is placed in a "passive recording buffer." In operation, the passive recording buffer is a memory buffer in a host computing device configured to hold a limited, predetermined amount of recently recorded content. For example, in operation the passive recording buffer may be configured to store a recording of the most recent minute of the ongoing audio (or audio/visual) conditions as captured by the recording components of the host computing device. To further illustrate aspects of the disclosed subject matter, particularly in regard to passive recording and the passive recording buffer, reference is made to Figure 1.

[0025] Figure 1 illustrate an exemplary audio stream 102 (i.e., ongoing audio conditions) with regard to a time line 100, and further illustrates various the ongoing passive recording of the audio stream into an exemplary passive recording buffer. According to various embodiments of the disclosed subject matter and shown in Figure 1, the time (as indicated by time line 100) corresponding to the ongoing audio stream 102 may be broken up according to time segments, as illustrated by time segments tso - tss. While the time segments may be determined according to implementation details, in one non-limiting example the time segment corresponds to 15 seconds. Correspondingly, the passive recording buffer, such as passive recording buffer 102, may be configured such that it can store a predetermined amount of recently recorded content, where the predetermined amount corresponds to a multiple of the amount of recently recorded content that is recorded during a single time segment. As illustratively shown in Figure 1, the passive recording buffer 102 is configured to hold an amount of the most recently recorded content corresponding to 4 time segments though, as indicted about, this number may be determined according to implementation details and/or according to user preferences.

[0026] Conceptually, and by way of illustration and example, with the passive recording buffer 102 configure to temporarily store recently recorded content corresponding to 4 time segments, the passive recording buffer 102 at the beginning of time segment ts 4 will include the recently recorded content from time segments tso-ts 3 , as illustrated by passive recording buffer 104. Similarly, the passive recording buffer 102, at the start of time period tss, will include the recently recorded content from time segments tsi-tS4, and so forth as illustrated in passive recording buffers 106-112. [0027] In regard to implementation details, when the recently recorded content is managed according to time segments of content, as described above, the passive recording buffer can implemented as a circular queue in which the oldest time segment of recorded content is overwritten as a new time segment begins. Of course, when the passive recording buffer 102 is implemented as a collection of segments of content (corresponding to time segments), the point at which a user provides an instruction to transcribe the contents of the passive recording buffer will not always coincide with a time segment. Accordingly, an implementation detail, or a user configuration detail, can be made such that recently recorded content of at least a predetermined amount of time is always captured. In this embodiment, if the user (or implementer) wishes to record at least 4 time segments of content, the passive recording buffer may be configured to hold 5 time segments worth of recently recorded content.

[0028] While the discussion above in regard to Figure 1 A is made in regard to capturing recently recording content along time segments, it should be appreciated that this is one manner in which the content may be passively recorded. Those skilled in the art will appreciate that there are other implementation methods in this an audio or audio/visual stream may be passively recorded. Indeed, in an alternative embodiment as shown in Figure IB, the passive recording buffer is configured to a size sufficient to contain a predetermined maximum amount of passively recorded content (as recorded in various frames) according to time. For example, if the maximum amount (in time) of passively recorded content is 2 minutes, then the passive recording buffer is configured to retain a sufficient number of frames, such as frames 160-164, which collectively correspond to 2 minutes. Thus, as new frames are received (in the on-going passive recording), older frames whose content falls outside of the preceding amount of time for passive recording will be discarded. In reference to passive buffer TO, assuming that the preceding amount of time to passively record is captured in 9 frames (as shown in passive buffer TO), when a new frame 165 is received, it is stored in the passive buffer and the oldest frame 160 is discarded, as shown in passive buffer Tl .

[0029] While the passive recording buffer may be configured to hold a predetermined maximum amount of recorded content, independent of the maximum amount that a passive recording buffer can contain and according to various embodiments of the disclosed subject matter, a computer user may configure the amount of recent captured content to be transcribed and placed as a note in a note file - of course, constrained by the maximum amount of content (in regard to time) that the passive recording buffer can contain. For example, while the maximum amount (according to time) of passively recorded content that passive recording buffer may contain may be 2 minutes, in various embodiments the user is permitted to configure the length (in time) of passive recorded content to be converted to a note, such as the prior 60 seconds of content, the prior 2 minutes, etc. In this regard, the user configuration as to the length of the audio or audio/visual content stream to be transcribed and stored as a note in a note file (upon user instruction), is independent of the passive recording buffer size (except for the upper limit of content that can be stored in the buffer.) Further, while the example above suggests that the passive recording buffer may contain up to 2 minutes of content, this is merely illustrative and should not be construed as limiting upon the disclosed subject matter. Indeed, in various alternative, non-limiting embodiments, the passive recording buffer may be configured to hold up to any one of 5 minutes of recorded content, 3 minutes of recorded content, 90 seconds of recorded content, etc. Further, the size of the passive recording buffer may be dynamically determined, adjusted as needed according to user configuration as to the length of audio content to be converted to a note in a note file.

[0030] Rather than converting the frames (160-165) into an audio stream at the time that the frames are received and stored in the passive buffer, the frames are simply stored in the passive buffer according to their sequence in time. By not processing the frames as they are received but, instead, processing the frames into an audio stream suitable for transcription (as will be described below), significant processing resources may be conserved. However, upon receiving an indication that the content in the passive buffer is to be transcribed into a note, the frames are merged together into an audio (or

audio/visual) stream that may be processed by a transcription component or service.

[0031] As shown with regard to Figures 1 A and IB, there may be any number of implementations of a passive buffer, and the disclosed subject matter should be viewed as being equally applicable to these implementations. Indeed, irrespective of the manner in which a passive buffer is implemented, what is important is that a predetermined period of preceding content is retained and available for transcription at the direction of the person using the system.

[0032] As briefly discussed above, with an ongoing audio stream (or audio/visual stream) being passively recorded, a person (i.e., user of the disclosed subject matter on a computing device) can cause that the most recently recorded content of the ongoing stream be transcribed to text and the transcription recorded in a notes file. Figure 2 is a flow diagram illustrating an exemplary routine 200 for generating notes, i.e., a textual transcription of the recently recorded content, of the most recently portion of the ongoing audio stream. Beginning at block 202, a passive recording process of the ongoing audio stream is commenced. As should be understood, this passive recording is an ongoing process and continues recording the ongoing audio (or audio/visual) stream (i.e., the content stream) until specifically terminated at the direction of the user, irrespective of other steps/activities that are taken with regard to routine 200. With regard to the format of the recorded content by the passive recording process, it should be appreciated that any suitable format may be used including, by way of illustration and not limitation, MP3 (MPEG-2 audio layer III), AVI (Audio Video Interleave), AAC (Advanced Audio Coding), WMA (Windows Media Audio), WAV (Waveform Audio File Format), and the like. Typically though not exclusively, the format of the recently recorded content is a function of the codec (coder/decoder) that is used to convert the audio content to a file format.

[0033] At block 204, with the passive recording of the content stream ongoing, the routine 200 awaits a user instruction. After receiving a user instruction, at decision block 206 a determination is made as to whether the user instruction is in regard to generating notes (from the recorded content in the passive recording buffer 102) or in regard to terminating the routine 200. If the instruction is in regard to generating a note, at block 208 the recently recorded content in the passive recording buffer is captured. In implementation, typically capturing the recently recorded content in the passive recording buffer comprises copying the recently recorded content from the passive recording buffer into another temporary buffer. Also, to the extent that the content in the passive recording buffer is maintained as frames, the frames are merged into an audio stream (or audio/visual stream) into the temporary buffer. This copying is done such that the recently recorded content can be transcribed without impacting the passive recording of the ongoing audio stream such that information/content of the ongoing content stream is continuously recorded.

[0034] At block 210, after capturing the recently recorded content in the passive recording buffer, the captured recorded content is transcribed to text. According to aspects of the disclosed subject matter, the captured recorded content may be transcribed by executable transcription components (comprising hardware and/or software components) on the user's computing device (i.e., the same device implementing routine 200). Alternatively, a transcription component may transmit the captured recorded content to an online transcription service and, in return, receive a textual transcription of the captured recorded content. As additional alternatives, the captured recorded content may be temporarily stored for future transcription, e.g., storing the captured recorded content for subsequent uploading to a computing device with sufficient capability to transcribe the content, or storing the captured recorded content until a network communication can be established to obtain a transcription from an online transcription service.

[0035] At block 212, the transcription is saved as a note in a note file. In addition to the text transcription of the captured recorded content, additional information may be stored with the note in the note file. Information such as the date and time of the captured recorded content may be stored with or as part of the note in the note file. A relative time (relative to the start of routine 200) may be stored with or as part of the note in the note file. Contextual information, such as meeting information, GPS location data, user information, and the like can be stored with or as part of the note in the note file. After generating the note and storing it in the note file, the routine 200 returns to block 204 to await additional instructions.

[0036] At some point, at decision block 206, the user instruction/action may be in regard to terminating the routine 200. Correspondingly, the routine 200 proceeds to block 214 where the passive recording of the ongoing audio (or audio/visual) stream is terminated, and the routine 200 terminates.

[0037] Often, an interesting portion of an ongoing conversation/stream may be detected and the user will wish to not only capture notes regarding the most recent time period, but continue to capture the content in an ongoing manner. The disclosed subject matter may be suitably and advantageously implemented to continue capturing the content (for transcription to a text-based note) as described in regard to Figure 3. Figure 3 is a flow diagram illustrating an exemplary routine 300 for generating notes of the most recently portion of the ongoing content stream, and for continued capture until indicated by a user. As will be seen, many aspects of routine 200 and routine 300 are the same.

[0038] Beginning at block 302, a passive recording process of the ongoing audio stream is commenced. As indicated above in regard to routine 200, this passive recording process is an ongoing process and continues recording the ongoing content stream until specifically terminated, irrespective of other steps/activities that are taken with regard to routine 300. With regard to the format of the recently recorded content, it should be appreciated that any suitable format may be used including, by way of illustration and not limitation, MP3 (MPEG-2 audio layer III), AVI (Audio Video Interleave), AAC (Advanced Audio Coding), WMA (Windows Media Audio), WAV (Waveform Audio File Format), and the like. [0039] At block 304, with the passive recording ongoing, the routine 300 awaits a user instruction. After receiving a user instruction, at decision block 306 a determination is made as to whether the user instruction is in regard to generating a note (from the recorded content in the passive recording buffer 102) or in regard to ending the routine 300. If the user instruction is in regard to generating a note, at block 308 the recently recorded content in the passive recording buffer is captured. In addition to capturing the recorded content from the passive recording buffer, at decision block 310 a determination is made in regard to whether the user has indicated that the routine 300 should continue capturing the ongoing audio stream for transcription as an expanded note. If the determination is made that the user has not indicated that the routine 300 should continue capturing the ongoing audio stream, the routine proceeds to block 316 as described below. However, if the user has indicated that the routine 300 should continue capturing the ongoing audio stream as part of an expanded note, the routine proceeds to block 312.

[0040] At block 312, without interrupting the passive recording process, the ongoing recording of the ongoing content stream to the passive recording buffer is continually captured as part of expanded captured recorded content, where the expanded captured recorded content is, thus, greater than the amount of recorded content that can be stored in the passive recording buffer. At block 314, this continued capture of the content stream continues until an indication from the user is received to release or terminate the continued capture. At block 316, after capturing the recently recorded content in the passive recording buffer and any additional content as indicated by the user, the captured recorded content is transcribed to text. As mentioned above in regard to routine 200 of Figure 2, the captured recorded content may be transcribed by executable transcription components (comprising hardware and/or software components) on the user's computing device.

Alternatively, a transcription component may transmit the captured recorded content to an online transcription service and, in return, receive a textual transcription of the captured recorded content. As additional alternatives, the captured recorded content may be temporarily stored for future transcription, e.g., storing the captured recorded content for subsequent uploading to a computing device with sufficient capability to transcribe the content, or storing the captured recorded content until a network communication can be established to obtain a transcription from an online transcription service.

[0041] At block 318, the transcription is saved as a note in a note file, i.e., a data file comprising at least one or more text notes. In addition to the text transcription of the captured recorded content, additional information may be stored with the note in the note file. Information such as the date and time of the captured recorded content may be stored with or as part of the note in the note file. A relative time (relative to the start of routine 200) may be stored with or as part of the note in the note file. Contextual information, such as meeting information, GPS location data, user information, and the like can be stored with or as part of the note in the note file. After generating the note and storing it in the note file, the routine 300 returns to block 304 to await additional instructions.

[0042] As mentioned above, at decision block 306, the user instruction/action may be in regard to terminating the routine 300. In this condition, the routine 300 proceeds to block 320 where the passive recording of the ongoing audio (or audio/visual) stream is terminated, and thereafter the routine 300 terminates.

[0043] Regarding routines 200 and 300 described above, as well as routine 1000 and other processes describe herein, while these routines/processes are expressed in regard to discrete steps, these steps should be viewed as being logical in nature and may or may not correspond to any actual and/or discrete steps of a particular implementation. Also, the order in which these steps are presented in the various routines and processes, unless otherwise indicated, should not be construed as the only order in which the steps may be carried out. In some instances, some of these steps may be omitted. Those skilled in the art will recognize that the logical presentation of steps is sufficiently instructive to carry out aspects of the claimed subject matter irrespective of any particular language in which the logical instructions/steps are embodied.

[0044] Of course, while these routines include various novel features of the disclosed subject matter, other steps (not listed) may also be carried out in the execution of the subject matter set forth in these routines. Those skilled in the art will appreciate that the logical steps of these routines may be combined together or be comprised of multiple steps. Steps of the above-described routines may be carried out in parallel or in series. Often, but not exclusively, the functionality of the various routines is embodied in software (e.g., applications, system services, libraries, and the like) that is executed on one or more processors of computing devices, such as the computing device described in regard Figure 4 below. Additionally, in various embodiments all or some of the various routines may also be embodied in executable hardware modules including, but not limited to, system on chips, codecs, specially designed processors and or logic circuits, and the like on a computer system.

[0045] These routines/processes are typically embodied within executable code modules comprising routines, functions, looping structures, selectors such as if-then and if-then- else statements, assignments, arithmetic computations, and the like. However, the exact implementation in executable statement of each of the routines is based on various implementation configurations and decisions, including programming languages, compilers, target processors, operating environments, and the linking or binding operation. Those skilled in the art will readily appreciate that the logical steps identified in these routines may be implemented in any number of ways and, thus, the logical descriptions set forth above are sufficiently enabling to achieve similar results.

[0046] While many novel aspects of the disclosed subject matter are expressed in routines embodied within applications (also referred to as computer programs), apps (small, generally single or narrow purposed, applications), and/or methods, these aspects may also be embodied as computer-executable instructions stored by computer-readable media, also referred to as computer-readable storage media, which are articles of manufacture. As those skilled in the art will recognize, computer-readable media can host, store and/or reproduce computer-executable instructions and data for later retrieval and/or execution. When the computer-executable instructions that are hosted or stored on the

computer-readable storage devices are executed, the execution thereof causes, configures and/or adapts the executing computing device to carry out various steps, methods and/or functionality, including those steps, methods, and routines described above in regard to the various illustrated routines. Examples of computer-readable media include, but are not limited to: optical storage media such as Blu-ray discs, digital video discs (DVDs), compact discs (CDs), optical disc cartridges, and the like; magnetic storage media including hard disk drives, floppy disks, magnetic tape, and the like; memory storage devices such as random access memory (RAM), read-only memory (ROM), memory cards, thumb drives, and the like; cloud storage (i.e., an online storage service); and the like. While computer-readable media may deliver the computer-executable instructions (and data) to a computing device for execution via various transmission means and mediums including carrier waves and/or propagated signals, for purposes of this disclosure computer readable media expressly excludes carrier waves and/or propagated signals.

[0047] Advantageously, many of the benefits of the disclosed subject matter can be conducted on computing devices with limited computing capacity and/or storage capabilities. Further still, many of the benefits of the disclosed subject matter can be conducted on computing devices of limited computing capacity, storage capabilities as well as network connectivity. Indeed, suitable computing devices suitable for

implementing the disclosed subject matter include, by way of illustration and not limitation: mobile phones; tablet computers; "phablet" computing devices (the hybrid mobile phone/tablet devices); personal digital assistants; laptop computers; desktop computers; and the like.

[0048] Regarding the various computing devices upon which aspects of the disclosed subject matter may be implemented, Figure 4 is a block diagram illustrating exemplary components of a suitably configured computing device 400 for implementing aspects of the disclosed subject matter. The exemplary computing device 400 includes one or more processors (or processing units), such as processor 402, and a memory 404. The processor 402 and memory 404, as well as other components, are interconnected by way of a system bus 410. The memory 404 typically (but not always) comprises both volatile memory 406 and non-volatile memory 408. Volatile memory 406 retains or stores information so long as the memory is supplied with power. In contrast, non-volatile memory 408 is capable of storing (or persisting) information even when a power supply is not available. Generally speaking, RAM and CPU cache memory are examples of volatile memory 406 whereas ROM, solid-state memory devices, memory storage devices, and/or memory cards are examples of non-volatile memory 408. Also illustrated as part of the memory 404 is a passive recording buffer 414. While shown as being separate from both volatile memory 406 and non-volatile memory 408, this distinction is for illustration purposes in identifying that the memory 404 includes (either as volatile memory or non-volatile memory) the passive recording buffer 414.

[0049] Further still, the illustrated computing device 400 includes a network

communication component 412 for interconnecting this computing device with other devices over a computer network, optionally including an online transcription service as discussed above. The network communication component 412, sometimes referred to as a network interface card or NIC, communicates over a network using one or more communication protocols via a physical/tangible (e.g., wired, optical, etc.) connection, a wireless connection, or both. As will be readily appreciated by those skilled in the art, a network communication component, such as network communication component 412, is typically comprised of hardware and/or firmware components (and may also include or comprise executable software components) that transmit and receive digital and/or analog signals over a transmission medium (i.e., the network.)

[0050] The processor 402 executes instructions retrieved from the memory 404 (and/or from computer-readable media) in carrying out various functions, particularly in regard to responding to passively recording an ongoing audio or audio/visual stream and generating notes from the passive recordings, as discussed and described above. The processor 401 may be comprised of any of a number of available processors such as single-processor, multi-processor, single-core units, and multi-core units.

[0051] The exemplary computing device 400 further includes an audio recording component 420. Alternatively, not shown, the exemplary computing device 400 may be configured to include an audio/visual recording component, or both an audio recording component and a visual recording component, as discussed above. The audio recording component 420 is typically comprised of an audio sensing device, such as a microphone, as well as executable hardware and software, such as a hardware and/or software codec, for converting the sensed audio content into recently recorded content in the passive recording buffer 414. The passive recording component 426 utilizes the audio recording component 420 to capture audio content to the passive recording bugger, as described above in regard to routines 200 and 300. A note generator component 428 operates at the direction of the computing device user (typically through one or more user interface controls in the user interface component 422) to passively capture content of an ongoing audio (or audio/visual) stream, and to further generate one or more notes from the recently recorded content in the passive recording buffer 414, as described above. As indicated above, the note generator component 428 may take advantage of an optional transcription component 424 of the computing device 400 to transcribe the captured recorded content from the passive recording buffer 414 into a textual representation for saving in a note file 434 (of a plurality of note files) that is stored in a data store 430. Alternatively, the note generator component 428 may transmit the captured recorded content of the passive recording buffer 414 to an online transcription service over a network via the network communication component 412, or upload the captured audio content 432, temporarily stored in the data store 430, to a more capable computing device when connectivity is available.

[0052] An annotation component is configured to annotate a generated note with one or more annotations, i.e., one or more categories of a plurality of predetermined categories. Annotating a generated note is described in greater detail below in regard to Figures 6-9 and 10.

[0053] Regarding the data store 430, while the data store may comprise a hard drive and/or a solid state drive separately accessible from the memory 404 typically used on the computing device 400, as illustrated, in fact this distinction may be simply a logical distinction. In various embodiments, the data store is a part of the non-volatile memory 408 of the computing device 400. Additionally, while the data store 430 is indicated as being a part of the computing device 400, in an alternative embodiment the data store may be implemented as a cloud-based storage service accessible to the computing device over a network (via the network communication component 412).

[0054] Regarding the various components of the exemplary computing device 400, those skilled in the art will appreciate that these components may be implemented as executable software modules stored in the memory of the computing device, as hardware modules and/or components (including SoCs - system on a chip), or a combination of the two. Moreover, in certain embodiments each of the various components may be implemented as an independent, cooperative process or device, operating in conjunction with or on one or more computer systems and or computing devices. It should be further appreciated, of course, that the various components described above should be viewed as logical components for carrying out the various described functions. As those skilled in the art will readily appreciate, logical components and/or subsystems may or may not correspond directly, in a one-to-one manner, to actual, discrete components. In an actual embodiment, the various components of each computing device may be combined together or distributed across multiple actual components and/or implemented as cooperative processes on a computer network.

[0055] Figure 5 is a pictorial diagram illustrating an exemplary environment 500 suitable for implementing aspects of the disclosed subject matter. As shown in Figure 5, a computing device 400 (in this example the computing device being a mobile phone of user/person 501) may be configured to passively record an ongoing conversation among various persons as described above, including persons 501, 503, 505 and 507. Upon an indication by the user/person 501, the computing device 400 captures the contents of the passive recording buffer 414, obtains a transcription of the recently recorded content captured from the passive recording buffer, and stores the textual transcription as a note in a note file in a data store. The computing device 400 is connected to a network 502 over which the computing device may obtain a transcription of captured audio content (or audio/visual content) from a transcription service 510, and/or store the transcribed note in an online and/or cloud-based data store (not shown).

[0056] In addition to capturing or generating a note of an ongoing content stream, quite often a person may wish to associate additional information with that note. For example, most people have different roles that he/she may take on during any given day including, by way of illustration and not limitation: a personal/familial role; a volunteer role with a child's soccer team; different projects within one's employment, a community service role, and the like. Further, during an on-going conversation, a particular item - captured in a note from the on-going content stream/conversation - may be of particular relevance to a particular aspect of that person's life. In these cases, it would be advantageous to be able to annotate a captured note with a category or tag that can be used to identify a captured note as pertaining to that particular aspect of his/her life.

[0057] Still further, while generating a note from an ongoing conversation may capture critical information, locating that a particular note and/or understanding the context of a particular note may be greatly enhanced when the person is able to associate a tag or category with the note, i.e., annotate the captured note. With reference to the examples above, that person may greatly enhance the efficiency by which he/she can recall the particular context of a note and/or identify one or more notes that pertain to a particular aspect of his/her life by associating a tag (or more generally, an annotation) to a generated/captured note.

[0058] According to aspects of the disclosed subject matter, during an ongoing recording a person may provide an indication as to a category or tag to be associated with a generated note and an annotation. This indication may be made as part of or in addition to providing an indication to capture and generated a particular note of an ongoing conversation or audio stream as set forth in regard to Figures 6-9 and 10. Indeed, Figures 6-9 illustrate an exemplary interaction with a computing device executing an application for capturing notes from an on-going audio conversation, and further illustrating the annotation of a captured note with a category or tag. Figure 10 illustrates an exemplary routine 1000 for generating notes of the most recently passively recorded content of the ongoing content stream, for continued capture until indicated by a user, and for annotating the captured note with a predetermined category or tag.

[0059] With regard to Figures 6-9, Figure 6 illustrates a typical home screen as presented by an app (or application) executing on a computing device 600. The home screen illustrates/includes several meeting entries, such as meeting entries 602 and 604, for which the user of the computing device has caused the application to capture notes from an on- going conversation. In this regard, meetings are used as an organizational tool for providing a type of folder in which captured notes can be grouped together. As can be seen, meeting 602 entitled "PM Meeting," which occurred on "7/27/2015" at "10:31 AM," includes two captured notes. Similarly, meeting 604 entitled "Group Mtg," occurred on 7/28/2015 at 1 :30 PM and includes 3 captured notes. [0060] In addition to listing the "meetings" (which is more generically used as a folder for collecting generated notes of an on-going audio stream), a user may also create a new meeting (or folder) by interacting with the "add meeting" control 606. Thus, if the user is attending a new meeting and wishes to capture (or may wish to capture) notes from the conversation of the meeting, the user simply interacts with the "add meeting" control 606.

[0061] Turning to Figure 7, Figure 7 illustrates the exemplary computing device 600 of Figure 6 after the user has interacted with the "add meeting" control 606. According to one embodiment of the disclosed subject matter, as part of creating a new meeting, the note capturing app executing on the computing device 600 begins its process of passively recording the on-going content stream, as indicated by the status indicator 702. In addition to the status indicator, a meeting title 704 is also displayed, indicating a default title for the new meeting. Of course, in various embodiments, the default title of the meeting may be user configurable to something meaningful to the user, or be obtained from the user's calendar (i.e., a meeting coinciding with the current time). Also shown by the app on the computing device 600 is a duration control 706 by which the user can control the amount/duration (as a function of seconds) of content that is captured in the passive recording buffer, as described above. In the present example, the amount of content, in seconds, that is captured in the passive recording buffer is set at 30 seconds, i.e., 30 seconds worth of content is captured in the passive recording buffer.

[0062] Also shown in Figure 7 is a capture button 708. According to aspects of the disclosed subject matter, by interacting with the capture button 708, the user can cause the underlying app to capture/generate a note from the passive recording buffer and store the note, in conjunction with the meeting, in a note file as described above. Indeed, as also described, through continued interaction with the capture button 708, such as continuing to press on the capture button 708, the amount of content that is captured in the currently captured/generated note is extended until the interaction ceases, thereby extending the amount of content that is captured in a note. Also presented on the computing device 600 is a home control 710 that causes the passive recording to cease and returns to home page (as illustrated in Figure 6), and a category toggle control 712 that switches from "typical" note capturing to annotated note capturing as described below.

[0063] As mentioned above, in addition to simply capturing a note from the on-going content stream, the user may wish to associate a category or tag with a captured note as an annotation to the note. While a user may annotate an already-captured note with a category or tag after the note has been captured/generated, according to aspects of the disclosed subject matter, a set of predetermined categories may be presented, where each predetermined category is presented as a user-actionable control by which the user can cause the content of the passive recording buffer to be captured and converted into a note in the note file (in association with a meeting) as we as be annotated by the category presented by the user-actionable control. Indeed, by interacting with the category toggle control 712, the user can switch to/from the presentation of categories.

[0064] Figure 8 illustrates the exemplary computing device 600 after the user has switched to the presentation of categories as user-actionable controls for capturing the content of the passive recording buffer into a note in the note file and associating the corresponding category with the note as an annotation. As shown in Figure 8, the computing device 600 now presents a list of predetermined categories in the form of user- actionable controls, such as user-actionable controls 802-808. Indeed, according to aspects of the disclosed subject matter, by interacting with any one of the user-actionable controls 802-808, a note is generated and stored in the note file, annotated with the category associated with the user-actionable control, and associated with the meeting, or more genetically, an "event" corresponding to the ongoing content stream. In other words, the indication to associate the category with a generated note is also an indication to generate a note based on the recorded content of the passive recording of the ongoing content stream. Also shown in Figure 8 are the home control 710 that causes the passive recording to cease and returns to home page (as illustrated in Figure 6), and the category toggle control 712 that switches or toggles between "typical" note capturing and

"annotated" note capturing.

[0065] It should be appreciated and according to various embodiments, while a user may generate a note through interaction with a user-actionable control associated with a category, such as work/action control 802, a user may further configure a generated note to be associated with one or more additional categories. In this manner, a generated note may be associated with multiple categories. Of course, as will be readily appreciated, quite often a particular note is applicable to more than one category. For example, a note that is categorized as a work/action item (as represented by work/action control 802), may also be identifiable and categorized as an idea (as represented by the ideas control 804.) While various embodiments may include the ability to associate multiple categories at the time of generating a note from captured content, in alternative embodiments a user may, after the note is generated, associate one or more categories with the note from a plurality of predetermined categories. Further, while a default set of a plurality of categories may be provided with the application, this set of the plurality of predetermined categories may be user configurable such that the user may add, delete and/or modify the categories.

[0066] Assuming that the user has captured both a non-annotated note and annotated note for the meeting shown in the exemplary Figures 6-8, Figure 9 illustrates the exemplary computing device 600 showing the notes captured and associated with "Meeting 4."

Indeed, as shown in Figure 9, a title control 902 displays the current name of the meeting, a status control 904 illustrates various status information regarding the notes of the meeting, including that there are two (2) notes captured from the meeting, and further includes note 906 and 908, each presented as a user-actionable control for reproducing the note to the user. As can be seen, the first note 906 is not associated with a category (as indicted by the lack of a category icon on the control), whereas note 908 is

associated/annotated with a category as indicted by the presence of the category icon 912 (corresponding to the "Work/ Action" user-actionable category control 802 of Figure 8). In addition to the note controls 906 and 908, the computing device 600 also includes a user-actionable record icon 910 that returns to recording notes (either of the screens displayed in Figures 7 or 8) to continue capturing notes for this meeting.

[0067] While Figures 6-9 illustrate a particular set of user interfaces for interacting with an app executing on a computing device for capturing notes with a category annotation, it should be appreciated that this is simply one example of such user interaction and should note be viewed as limiting upon the disclosed subject matter. Those skilled in the art will appreciate that there may be any number of user interfaces that may be suitably used by an app to capture a note of an on-going audio stream from a passive recording buffer and associated or annotate the note with a category.

[0068] Turning to Figure 10, Figure 10 illustrates an exemplary routine 1000 for generating notes of the most recently portion of the ongoing content stream as located in the passive recording buffer as described above, for continued capture until indicated by a user, and for annotating the captured note with a predetermined category or tag.

Beginning at block 1002, a passive recording process of the ongoing audio stream is commenced. At block 1004, with the passive recording ongoing, the routine 1000 awaits a user instruction.

[0069] After receiving user instruction, at decision block 1006 a determination is made as to whether the user instruction is in regard to generating a note (from the recorded content in the passive recording buffer 102) or in regard to ending the routine 1000. If the user instruction is in regard to generating a note, at block 1008 the recently recorded content in the passive recording buffer is captured. In addition to capturing the recorded content from the passive recording buffer, at decision block 1010 a determination is made in regard to whether the user has indicated that the routine 1000 should continue capturing the ongoing audio stream for transcription as an expanded note. If the determination is made that the user has not indicated that the routine 1000 should continue capturing the ongoing audio stream, the routine proceeds to block 1016 as described below. However, if the user has indicated that the routine 1000 should continue capturing the ongoing audio stream as part of an expanded note, the routine proceeds to block 1012. At block 1012, without interrupting the passive recording process, the ongoing recording of the ongoing content stream to the passive recording buffer is continually captured as part of expanded captured recorded content, where the expanded captured recorded content is, thus, greater than the amount of recorded content that can be stored in the passive recording buffer.

[0070] At block 1014, this continued capture of the content stream continues until an indication from the user is received to release or terminate the continued capture. At block 1016, after capturing the recently recorded content in the passive recording buffer and any additional content as indicated by the user, a note is generated from the captured recorded content. According to various embodiments, the note may be generated according to a transcription of the recorded/captured content. Alternatively, a note may be generated as a single audio file from the recorded/captured content. Further still, a note may be stored in a note file in multiple formats, such as an audio file and a transcription.

[0071] At block 1018, the generated note is then stored in a note file, i.e., a data file comprising at least one or more text notes. As indicated above, according to various embodiments, the note may be stored in a note file in association with (or as part of) a meeting. At block 1020, a determination is made as to whether a category is to be associated with the generated note, i.e., whether to annotate the note with the category. If the note is not to be annotated with the category, the routine 1000 returns to block 1004 to await additional user instructions. Alternatively, if the generated note is to be annotated with a category, at block 1022 the note is annotated with the category and the routine 1000 returns to block 1004.

[0072] As mentioned above, at decision block 1006, the user instruction/action may be in regard to terminating the routine 1000. In this condition, the routine 1000 proceeds to block 1024 where the passive recording of the ongoing audio (or audio/visual) stream is terminated, and thereafter the routine 1000 terminates. [0073] While various novel aspects of the disclosed subject matter have been described, it should be appreciated that these aspects are exemplary and should not be construed as limiting. Variations and alterations to the various aspects may be made without departing from the scope of the disclosed subject matter.