Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
AUTOMATIC GENERATION OF PREFERRED VIEWS FOR PERSONAL CONTENT COLLECTIONS
Document Type and Number:
WIPO Patent Application WO/2015/038335
Kind Code:
A1
Abstract:
Providing a view of relevant items of a content collection includes identifying a current context based temporal parameters, spatial parameters, navigational parameters, lexical parameters, organizational parameters, and/or events, evaluating each of the items of the content collection according to the current context to provide a value for each of the items, and displaying a subset of the items corresponding to highest determined values. The temporal parameters may include a time of recent access of an item, frequency of access of an item, frequency of location related access of an item, and frequency of event related access of an item. Temporal patterns of accessing items may be numerically assessed based on time of day, time of week, and/or time of month. Evaluating each item may include determining a distance from a separating hyperplane using a support vector machine classification method.

Inventors:
AYZENSHTAT MARK (US)
BURFORD CLINTON (US)
Application Number:
PCT/US2014/052859
Publication Date:
March 19, 2015
Filing Date:
August 27, 2014
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
EVERNOTE CORP (US)
International Classes:
G06N20/10
Domestic Patent References:
WO2013025460A12013-02-21
Foreign References:
US20090013250A12009-01-08
US20100142807A12010-06-10
US20090164416A12009-06-25
US20130222257A12013-08-29
US20100036782A12010-02-11
Other References:
ALLEGREZZA ET AL., INTERNET ECONOMETRICS., 17 January 2012 (2012-01-17), Retrieved from the Internet [retrieved on 20141028]
OLDENBURG, M. ET AL.: "OneNote Mobile for Android is now available worldwide.", 7 February 2012 (2012-02-07), Retrieved from the Internet [retrieved on 20141028]
Attorney, Agent or Firm:
MUIRHEAD, Donald, W. et al. (LLC200 Friberg Parkway, Suite 100, Westborough MA, US)
Download PDF:
Claims:
What is claimed is:

1. A method of providing a view of relevant items of a content collection, comprising:

identifying a current context based on at least one of: temporal parameters, spatial parameters, navigational parameters, lexical parameters, organizational parameters, and events; evaluating each of the items of the content collection according to the current context to provide a value for each of the items; and

displaying a subset of the items corresponding to highest determined values.

2. A method, according to claim 1, wherein the temporal parameters include a time of recent access of an item, frequency of access of an item, frequency of location related access of an item, and frequency of event related access of an item.

3. A method, according to claim 2, wherein frequency of access of an item is modeled according to the following formula:

/u(e) =∑Cisce l-^l^ I∑cjec where / (. ) is a feature value for frequency, e is an accessed content item, (q), C, Ce are, respectively, past user actions and a set of all actions and only past actions where the user has accessed the item e, t( [) (t(c/)) is an age of each access event measured at a present moment, and tm is a normalizing median coefficient.

4. A method, according to claim 2, wherein temporal patterns of accessing items are numerically assessed based on at least one of: time of day, time of week, and time of month.

5. A method, according to claim 1, wherein evaluating each item includes determining a distance from a separating hyperplane using a support vector machine classification method.

6. A method, according to claim 1, wherein user feedback is used to adjust subsequent evaluation of each of the items.

7. A method, according to claim 6, wherein the user feedback is implicit and includes frequency of actual viewing by the user.

8. A method, according to claim 6, wherein the user feedback is explicit.

9. A method, according to claim 6, wherein user feedback is used to modify features used to evaluate items.

10. A method, according to claim 1, wherein the subset of items includes only items having a value above a predetermined threshold and wherein displaying the subset of items includes sorting the subset according to values provided for each of the items and wherein items that are not part of the subset are displayed following items in the subset. 11. A method, according to claim 1 , wherein displaying the subset of items includes displaying the items in a pop up screen that is superimposed over a different list containing the items.

12. A method, according to claim 1, wherein analyzing items includes splitting the items into a training set and a test set and building a classifier using automatic learning.

13. A method, according to claim 12, wherein prior to evaluating the items, the items in the training set are analyzed to develop a set of rules used for evaluation of the items.

14. A method, according to claim 13, wherein the temporal parameters of the items in the training set include a time of recent access of an item, frequency of access of an item, frequency of location related access of an item, and frequency of event related access of an item.

15. A method, according to claim 14, wherein frequency of access of an item is modeled according to the following formula: where fu . ) is a feature value for frequency, e is an accessed content item, q (c;), C, Ce are, respectively, past user actions and a set of all actions and only past actions where the user has accessed the item e, t(q) (t(c-)) is an age of each access event measured at a present moment, and tm is a normalizing median coefficient.

16. A method, according to claim 14, wherein temporal patterns of accessing items are numerically assessed based on at least one of: time of day, time of week, and time of month. 17. A method, according to claim 1 , wherein the items are displayed on a mobile device.

18. A method, according to claim 17, wherein the mobile device includes software that is one of: pre-loaded with the device, installed from an app store, installed from a desktop computer, installed from media, and downloaded from a Web site.

19. A method, according to claim 17, wherein the mobile device uses an operating system selected from the group consisting of: iOS, Android OS, Windows Phone OS, Blackberry OS and mobile versions of Linux OS.

20. A method, according to claim 1, wherein the items are stored using the OneNote® note- taking software provided by the Microsoft Corporation of Redmond, Washington.

21. Computer software, provided in a non-transitory computer-readable medium, that provides a view of relevant items of a content collection, the software comprising:

executable code that identifies a current context based on at least one of: temporal parameters, spatial parameters, navigational parameters, lexical parameters, organizational parameters, and events;

executable code that evaluates each of the items of the content collection according to the current context to provide a value for each of the items; and

executable code that displays a subset of the items corresponding to highest determined values.

22. Computer software, according to claim 21 , wherein the temporal parameters include a time of recent access of an item, frequency of access of an item, frequency of location related access of an item, and frequency of event related access of an item.

23. Computer software, according to claim 22, wherem frequency of access of an item is modeled according to the following formula:

/uO) =∑C|EC- 2-^/*» /∑c,ec 2"^™ where /" (. ) is a feature value for frequency, e is an accessed content item, q (c,), C, Ce are, respectively, past user actions and a set of all actions and only past actions where the user has accessed the item e, t(Ci) (t(cj)) is an age of each access event measured at a present moment, and tm is a normalizing median coefficient.

24. Computer software, according to claim 22, wherein temporal patterns of accessing items are numerically assessed based on at least one of: time of day, time of week, and time of month.

25. Computer software, according to claim 21, wherein executable code that evaluates each item determines a distance from a separating hyperplane using a support vector machine classification method.

26. Computer software, according to claim 21, wherein user feedback is used to adjust subsequent evaluation of each of the items.

27. Computer software, according to claim 26, wherein the user feedback is implicit and includes frequency of actual viewing by the user. 28. Computer software, according to claim 26, wherein the user feedback is explicit.

29. Computer software, according to claim 26, wherein user feedback is used to modify features used to evaluate items.

30. Computer software, according to claim 21, wherein the subset of items includes only items having a value above a predetermined threshold and wherein displaying the subset of items includes sorting the subset according to values provided for each of the items and wherein items that are not part of the subset are displayed following items in the subset.

31. Computer software, according to claim 21, wherein executable code that displays the subset of items displays the items in a pop up screen that is superimposed over a different list containing the items. 32. Computer software, according to claim 21, wherein executable code that analyzes items splits the items into a training set and a test set and builds a classifier using automatic learning.

33. Computer software, according to claim 32, wherein prior to evaluating the items, the items in the training set are analyzed to develop a set of rules used for evaluation of the items.

34. Computer software, according to claim 33, wherein the temporal parameters of the items in the training set include a time of recent access of an item, frequency of access of an item, frequency of location related access of an item, and frequency of event related access of an item.

35. Computer software, according to claim 34, wherein frequency of access of an item is modeled according to the following formula: where /"(. ) is a feature value for frequency, e is an accessed content item, q (c,), C, Ce are, respectively, past user actions and a set of all actions and only past actions where the user has accessed the item e, is an age of each access event measured at a present moment, and tm is a normalizing median coefficient.

36. Computer software, according to claim 34, wherein temporal patterns of accessing items are numerically assessed based on at least one of: time of day, time of week, and time of month.

37. Computer software, according to claim 21, wherein the items are displayed on a mobile device.

38. Computer software, according to claim 37, wherein the mobile device includes software that is one of: pre-loaded with the device, installed from an app store, installed from a desktop computer, installed from media, and downloaded from a Web site.

39. Computer software, according to claim 37, wherein the mobile device uses an operating system selected from the group consisting of: iOS, Android OS, Windows Phone OS, Blackberry OS and mobile versions of Linux OS.

40. Computer software, according to claim 21, wherein the items are stored using the OneNote® note-taking software provided by the Microsoft Corporation of Redmond, Washington.

Description:
AUTOMATIC GENERATION OF PREFERRED VIEWS FOR PERSONAL

CONTENT COLLECTIONS

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. Prov. App. No. 61/878,296, filed September 16, 2013, and entitled "AUTOMATIC GENERATION OF PREFERRED VIEWS FOR

PERSONAL CONTENT COLLECTIONS", which is incorporated herein by reference.

TECHNICAL FIELD

This application is directed to the field of extracting, analyzing and presenting information, especially in conjunction with custom ordering of items in personal and shared content management systems.

BACKGROUND OF THE INVENTION

Hundreds of millions people are using personal, shared and business-wide content management systems, such as the Evernote service and software created by the Evernote Corporation of Redwood City, California, the Microsoft® Office OneNote and many more systems. Content collections supported by such software and online services may contain thousands and even hundreds of thousands of content items (notes, memos, documents, etc.) with widely varying sizes, content types and other parameters. These items are viewed and modified by users in different order, with different frequency and under different circumstances. Routines for accessing items in content collections may include direct scrolling, keyword and natural language search, accessing items by tags, categories, notebooks, browsing interlinked clusters of items with or without indexes and tables of content, and other methods.

Irrespective of specific methods, quick and targeted access to desired content at any given moment, place and situation is important to user productivity and convenience. Search technologies, organizational and user interface features, such as tags, favorites, folders, advanced content sorting, and other functionality provide a significant help in accessing needed content. Contemporary content management systems may expand search to images, audio and video, synonyms, semantic terms, anthologies and language specifics. Navigational methods for tags, tag clouds, lists of favorites, and interlinked clusters of items are constantly progressing and may include multi-dimensional and dynamic data representation, advanced use of touch interfaces and screen estate, etc.

Still, even the most sophisticated search and navigational methods may be insufficient for quickly growing information volumes. Additionally, repetitive searches for the same materials even with saved queries take additional time with every search occurrence. A recent enterprise search study has discovered a significant search gap affecting all categories of workers: 52% of respondents said they could not find the information they were seeking within an acceptable amount of time using their own organization's enterprise search facility. Further analysis has shown that 65% of respondents have defined an overall good search experience as a situation where a particular search takes less than two minutes. However, only 48% of study participants have reported being able to achieve that result in their own organization. In other words, there exists a 17% gap between user expectation of satisfying search experiences and an enterprise search reality. Additionally, about 90% of respondents reported that taking four minutes or more to find the information they want does not constitute a good search experience; yet 27% responded this was the case within their own enterprises. Accordingly limited search efficiency may drive many users to abandoning search as a method of defining immediate views of materials from personal or shared data collections. Analogously, sorting items in a content collection by time, location, size and other parameters may complicate information processing and still fall short of representing content views required by users.

Furthermore, user needs in accessing various materials from content collections (notes, attachments, notebooks, folders, etc.) are driven, on the one hand, by constantly changing work, home and other environments, and on the other hand, by repetitive patterns of user adaptation to such environments. For example, users may need several notes with standard bits of information (a social security number, a driver license number, a passport number or other IDs, a credit card number) every time they visit an official establishment. However, additional pieces of information that they may need could significantly differ depending on whether the users visit a bank or a medical office, are traveling to a place where they have taken family photos and want to recall them or are reviewing materials before a weekly staff meeting. Reflecting dynamic combinations of parameters, different environments and contexts influencing content access requirements and customized content views may be difficult with fixed content settings such as tags or favorite lists, while trying to memorize such combinations of parameters may be cumbersome, tiring, and inefficient and causing frequent updates as user behavior patterns evolve.

Accordingly, it is desirable to develop advanced systems and methods for generating preferred content views depending on context and user viewing history.

SUMMARY OF THE INVENTION

According to the system described herein, providing a view of relevant items of a content collection includes identifying a current context based temporal parameters, spatial parameters, navigational parameters, lexical parameters, organizational parameters, and/or events, evaluating each of the items of the content collection according to the current context to provide a value for each of the items, and displaying a subset of the items corresponding to highest determined values. The temporal parameters may include a time of recent access of an item, frequency of access of an item, frequency of location related access of an item, and frequency of event related access of an item. Frequency of access of an item may be modeled according to the following formula:

/"0 =∑ Ci≡c e 2-'C«¾)Am /∑ c . eC 2-W™ where f u (. ) is a feature value for frequency, e is an accessed content item, q (c ), C, C e are, respectively, past user actions and a set of all actions and only past actions where the user has accessed the item e, t(Cj) (t(c,-)) is an age of each access event measured at a present moment, and t m is a normalizing median coefficient. Temporal patterns of accessing items may be numerically assessed based on time of day, time of week, and/or time of month. Evaluating each item may include determining a distance from a separating hyperplane using a support vector machine classification method. User feedback may be used to adjust subsequent evaluation of each of the items. The user feedback may be implicit and may include frequency of actual viewing by the user. The user feedback may be explicit. User feedback may be used to modify features used to evaluate items. The subset of items may include only items having a value above a predetermined threshold and displaying the subset of items may include sorting the subset according to values provided for each of the items and items that are not part of the subset may be displayed following items in the subset. Displaying the subset of items may include displaying the items in a pop up screen that is superimposed over a different list containing the items. Analyzing items may include splitting the items into a training set and a test set and a classifier may be built using automatic learning. Prior to evaluating the items, the items in the training set may be analyzed to develop a set of rules used for evaluation of the items. The temporal parameters of the items in the training set may include a time of recent access of an item, frequency of access of an item, frequency of location related access of an item, and/or frequency of event related access of an item. The items may be displayed on a mobile device. The mobile device may include software that is pre-loaded with the device, installed from an app store, installed from a desktop computer, installed from media, or downloaded from a Web site. The mobile device may use an operating system selected from the group consisting of: iOS, Android OS, Windows Phone OS, Blackberry OS and mobile versions of Linux OS. Items may be stored using the OneNote® note-taking software provided by the Microsoft Corporation of Redmond, Washington.

According further to the system described herein, computer software, provided in a non- transitory computer-readable medium, provides a view of relevant items of a content collection. The software includes executable code that identifies a current context based on temporal parameters, spatial parameters, navigational parameters, lexical parameters, organizational parameters, and/or events, executable code that evaluates each of the items of the content collection according to the current context to provide a value for each of the items, and executable code that displays a subset of the items corresponding to highest determined values. The temporal parameters may include a time of recent access of an item, frequency of access of an item, frequency of location related access of an item, and frequency of event related access of an item. Frequency of access of an item may be modeled according to the following formula: where / u (. ) is a feature value for frequency, e is an accessed content item, q (c,), C, C e are, respectively, past user actions and a set of all actions and only past actions where the user has accessed the item e, ί{ε ) ί{ ^^) is an age of each access event measured at a present moment, and t m is a normalizing median coefficient. Temporal patterns of accessing items may be numerically assessed based on time of day, time of week, and/or time of month. Executable code that evaluates each item may determine a distance from a separating hyperplane using a support vector machine classification method. User feedback may be used to adjust subsequent evaluation of each of the items. The user feedback may be implicit and may include frequency of actual viewing by the user. The user feedback may be explicit. User feedback may be used to modify features used to evaluate items. The subset of items may include only items having a value above a predetermined threshold and displaying the subset of items may include sorting the subset according to values provided for each of the items and items that are not part of the subset may be displayed following items in the subset. Executable code that displays the subset of items may display the items in a pop up screen that is superimposed over a different list containing the items. Executable code that analyzes items may split the items into a training set and a test set and may build a classifier using automatic learning. Prior to evaluating the items, the items in the training set may be analyzed to develop a set of rules used for evaluation of the items. The temporal parameters of the items in the training set may include a time of recent access of an item, frequency of access of an item, frequency of location related access of an item, and/or frequency of event related access of an item. The items may be displayed on a mobile device. The mobile device may include software that is pre-loaded with the device, installed from an app store, installed from a desktop computer, installed from media, or downloaded from a Web site. The mobile device may use an operating system selected from the group consisting of: iOS, Android OS, Windows Phone OS, Blackberry OS and mobile versions of Linux OS. Items may be stored using the OneNote® note-taking software provided by the Microsoft Corporation of Redmond, Washington.

The proposed system automatically generates preferred content views, re-grouping and selecting such content items as notes and notebooks depending on a particular environment or conditions, reflected in context related features, and based on automatic classification with parameters derived from historical patterns of user access to items. At a first phase, extensive content collections from many existing users of a content management system are processed and analyzed to develop a set of learning features, or rules, derived from contexts (environment, situation, conditions) and defining stable repetitive viewing of content items (e.g., notes). Such features may include and combine temporal, spatial, navigational, lexical, organizational and other parameters, events such as meetings, trips, visits, and other factors that may be pre-processed and formalized by the system, to reflect real life situations via linguistic variables in the meaning accepted in probability and fuzzy set theories. Thus, temporal features may include modeled notions of recent access, frequent access, frequent location related access, frequent event related access, etc. For example, a numeric feature value for frequent access may be modeled as: f u (e) =∑ Ci SC e 2-^ / / ∑c . ec 2- (1) where / (. ) is a feature value for frequency (a superscript 'u' reflects the term 'usualness'); e is an accessed content item, such as a note, a notebook or a tag; q (c j ), C, C e are, respectively, the past user actions and the set of all actions and only the past actions where the user has accessed the item e; t( c i) ( (¾)) is 311 a S e °f eacn access event measured at the present moment; t m is a normalizing median coefficient; for example, if all time measurements are in seconds, t m may be equal to 2,592,000, which corresponds to a 30-day age of an item. Analogously, by restricting sets of user note access actions to actions performed in a certain location (C C ), corresponding to a certain navigational scheme (C n , C%) or an event (incidence), such as a calendar meeting (Cj, C j S ), combined temporal and non-temporal features such as frequency+location, frequency+navigation, etc. can be modeled. Furthermore, temporal patterns of accessing notes may also be numerically assessed. Examples are presented in the following list:

• The time of day, measured in half hour intervals

• The time of week, measured in half hour intervals

• The time of month, measured in half hour intervals

• The time of day, measured in four hour intervals

• The time of week, measured in four hour intervals

• The time of month, measured in four hour intervals

• The day of week

• The time of week, measured in twenty-four hour intervals

• The time of month, measured in twenty-four hour intervals

The following are examples of contexts and applications where the temporal, spatial, navigational and other features may be utilized:

• View a certain note every time a user is at a given location. This rule has a broad set of applications, such as viewing partnerships related notes when a user arrives to a meeting at partner address and the system identifies user location, for example, from a mobile copy of content management software mnning on a user location aware device (GPS, GeoIP, etc.). Another application could be an automatic display of a note with an ATM pin number when a user arrives to a known ATM location where the note containing the pin number was frequently recalled in the past. • View a certain note at a given time if such note has been repetitively viewed at around the given time in the past. Applications could be Monday to-do lists for the week on Monday morning; meeting notes from a last week staff meeting; etc.

• View a certain note in conjunction with a scheduled event, such as meetings, meeting reminders, action reminders, etc.

• View a certain note if it previously appeared near the top of a saved search query and has been frequently viewed after such search has been performed.

• View a certain note when meeting around the same periodically repeating time with certain people is detected by schedule and location aware technologies. Applications include opening master project schedule every time when all or part of a project team meets for weekly project reviews.

Based on a preliminary analysis of repetitive note viewing patterns, a set of features / rules may be chosen and numeric representations for the features may be defined, as explained elsewhere herein. At a next phase, the conglomerate of pre-existing content collections may be split into a training set and a test set, and a binary classifier may be built and optimized using automatic learning.

The classifier may work with an input data pair (item, context) and may define whether the item may be added to a preferred viewing list for a given context; additionally, for items that are positively assessed by the classifier, the score of the items may be calculated, such as a distance from the separating hyperplane in the numeric feature space corresponding to the (linear or non-linear) Support Vector Machine (SVM) classification method. Ranking notes in the preferred viewing list by scores of the notes may allow control over a length of the list to address possible user interface and other requirements. A version of a preferred note view classifier developed at the previous step may be bundled with the content management or note-taking software and may be delivered to new users and immediately employed for automatic building of custom preferred content views for various contexts. An explicit or implicit user feedback to the functioning of such classifier may be used to improve the system and adjust the classifier:

• An implicit user feedback may be monitored by the system through measuring frequency of actual viewing by users of notes in the preferred lists

• An explicit user feedback may use a built-in feedback mechanism.

Both techniques may lead to re-training and adjusting parameters of the classifier, such as weights representing the coordinates of a normal vector in the SVM method. In some embodiments, user feedback may be used to modify the set of features through supervised learning.

From the user interface standpoint, preferred viewing lists may be implemented in a variety of ways. The preferred viewing lists may be displayed as separate lists of notes that automatically pop up on a user screen every time a new context is identified and requires an update of a preferred view. Alternatively, preferred view may populate a list or a section of a list of favorite user notes. In yet another implementation, preferred notes for a new context may be displayed in a top portion of a main note view preceding other notes, as if the preferred view implied a new sorting order pushing previously displayed top items down the list.

Preferred views may not be limited to individual notes and other elementary content units. Similar technique may be applied to choosing larger content assemblies, such as notebooks or notebook stacks in the Evernote content management system. The techniques may also be used to modify tags, lists of saved searches, lists of favorites and other content related displayable attributes that may depend on the environment, external conditions and contexts.

It should be noted that, while the system may constantly monitor changing conditions, the system may also have built-in thresholds to identify meaningful changes of the context and assess notes for the purpose of inclusion of particular notes into preferred views only when such meaningful changes occur. Such clustering of contexts may bring additional economy of system resources.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the system described herein will now be explained in more detail in accordance with the figures of the drawings, which are briefly described as follows.

FIG. 1 is a schematic illustration of a preferred note view created in response to a temporal, scheduling and sharing context, according to an embodiment of to the system described herein. FIG. 2 is a schematic illustration of preferred note and notebooks views created in response to a geolocation context, according to an embodiment of the system described herein.

FIG. 3 is a schematic illustration of feature extraction from an individual note and classification of the note, according to an embodiment of the system described herein.

FIG. 4 is a system flow diagram illustrating automatic learning, according to an embodiment of the system described herein.

FIG. 5 is a system flow diagram describing building of preferred content views, according to an embodiment of the system described herein.

DETAILED DESCRIPTION OF VARIOUS EMBODIMENTS

The system described herein provides a mechanism for building preferred views of items from individual, shared and organization-wide content collections in response to changing environment and context. Items may include individual notes, notebooks, tags, search lists and other attributes; contexts may include temporal characteristics, location, navigation, events, content organization and other features. The mechanism utilizes classifiers build through automatic learning based on past user access to content items; classifiers may be dynamically adjusted based on user feedback. FIG. 1 is a schematic illustration 100 showing a preferred note view created in response to a temporal, scheduling and sharing context. A content collection 110 displays eight notes 120 to a user. In response to a new context 130, that includes a temporal context 130a, a scheduling context 130b and a sharing context 130c, a system classifier applied to the content collection (not shown in FIG. 1, see FIG. 3 and the accompanying text for details) chooses two notes 140a, 140b for inclusion in a preferred system view. Subsequently, the notes in a previously displayed main note view are reordered so that the notes 140a, 140b occupy a top position 150 and a remainder of the notes 160 are pushed down the main view.

FIG. 2 is a schematic illustration 200 showing preferred note and notebooks views created in response to a geolocation context. Analogously to FIG. 1, a content collection 110 displays eight notes 120 to a user. Additionally, a notebook view of the content collections includes three notebooks 210 (notebooks A, B, C). In response to a new geolocation context 220, a system classifier applied to the content collection (not shown in FIG. 2) chooses two notes 150a, 150b and a notebook C for inclusion in a preferred system view. Subsequently, the two selected notes 150a, 150b are displayed in a pop-up pane 230; at a bottom portion 240 of the pane 230, the selected notebook is also displayed. FIG. 2 illustrates a different user interface solution compared with the FIG. 1 : In FIG. 2, the pane 230 with a preferred note view is shown on top of a main note view 250.

FIG. 3 is a schematic illustration 300 of feature extraction from an individual note and classification of the individual note. The note collection 110 is scanned by the system to identify notes that should be included in a preferred note view reflecting a current context 320. In the example of FIG. 3, a note 310 is evaluated based on the current context 320. The current context 320 may include multiple components, such as a temporal context 320a, a spatial (geolocation) context 320b, scheduled events 320c, a navigational context 320d (such as a scrolling view, a tag based view or a notebook based view within a content collection), a sharing context 320e, a search context 320f, a linguistic (textual) context 320g, a travel context 320h, a social network context 320i, etc. Furthermore, each component of the context may be represented by one or multiple features 330. In the illustration 300, three sample feature sets 330a, 330b, 330c are shown and the first feature in each set is described in details:

• The feature set 330a is a group of k features , . T| < for a temporal context; · The feature set 330b is a group of m features S it S m for a spatial context;

• The feature set 330c is a group of n features _ t L n for a search context.

The system may extract attributes of the note 310 corresponding to each of the feature sets 330a-330c and build numeric feature values 340, as explained elsewhere herein (see, for example, formula (1) for some of the temporal features). Numeric feature values are illustrated in FIG. 1 for a temporal context - the feature set 340a, and for a search context - the features set 340b.

At a next step, a vector V of feature values 340 is processed using a classifier 350, such as an SVM classifier where a separating plane defining one of two possible outcomes is defined by a normal vector W of the classifier plane, so the outcome is associated with a sign of the dot product V · W (for example, V W > 0 may indicate an inclusion of the note 310 into a preferred note view, as illustrated in FIG.s 1, 2). Based on the classification result, the system makes a binary decision 360 to add the selected note 310 to the preferred note view or to not add the note 310.

Referring to FIG. 4, a flow diagram illustrates automatic learning in conjunction with compiling an SVM classifier. Processing begins at a step 410 where pre-existing notes and access history are collected, as explained elsewhere herein. After the step 410, processing proceeds to a step 420 where a feature set for automatic learning is built. After the step 420, processing proceeds to a step 430 where a classifier designated for pre-building into the system and delivering to users is trained and evaluated utilizing training and test sets of notes (and possibly other items in content collections, such as notebooks, tags, search lists, etc.). After the step 430, processing proceeds to a step 440 where the classifier is delivered to a new user with the classifier software. After the step 440, processing proceeds to a step 450 where, in connection with software functioning and user access to notes and other items in different environments, the system collects additional contexts and note access history for the new user. After the step 450, processing proceeds to a step 460 where the classifier is re-trained and parameters of the classifier are modified. After the step 460, processing is complete.

Referring to FIG. 5, a flow diagram 500 describes building preferred content views. Processing begins at a step 510 where the system identifies the current context, as described elsewhere herein. After the step 510, processing proceeds to a step 520 where a note or other item is chosen for evaluation. After the step 520, processing proceeds to a step 530 where features relevant to the current context and the chosen note are assessed, as described in more detail elsewhere herein. After the step 530, processing proceeds to a step 540 where numeric feature values for the selected note and the current context are built, as explained elsewhere herein (see, for example, formula (1) and FIG. 3).

After the step 540, processing proceeds to a step 550 where the classifier is applied to a vector of numeric feature values (see, for example, items 340, 350 and the accompanying text in FIG. 3). After the step 550, processing proceeds to a test step 560 where it is determined whether the previous step resulted in assigning the selected note to the preferred view. If so, processing proceeds to a step 570 where the note score obtained during the classification step (such as a cosine of the angle between the vectors V, W explained in conjunction with FIG. 3) is used to calculate note rank with respect to other notes identified as candidates for inclusion in the preferred view (if any). Such ranking may apply to any type of items that may be present in the preferred view: notes, notebooks, tags, saved search queries, etc.

After the step 570, processing proceeds to a test step 575 where it is determined whether the note rank is within a preferred list size. If so, processing proceeds to a step 580 where the note is added to the preferred view list and the list is modified if necessary; for example, a previously included item with a lower score residing at the bottom of the list may be eliminated from the preferred view list. After the step 580, processing proceeds to a test step 585 where it is determined whether there are more notes to evaluate. Note that the step 585 may be independently reached from the step 560 if the selected note is not added to the preferred view and from the step 575 if the note rank is outside the list size. If there are more notes to evaluate, processing proceeds to a step 590 where the next note is chosen and control is transferred back to the step 530; otherwise, processing is complete.

Various embodiments discussed herein may be combined with each other in appropriate combinations in connection with the system described herein. Additionally, in some instances, the order of steps in the flowcharts, flow diagrams and/or described flow processing may be modified, where appropriate. Subsequently, elements and areas of screen described in screen layouts may vary from the illustrations presented herein. Further, various aspects of the system described herein may be implemented using software, hardware, a combination of software and hardware and/or other computer-implemented modules or devices having the described features and performing the described functions. A mobile device, such as a cell phone or a tablet, may be used to implement the system described herein, although other devices, such as a laptop computer, etc., are also possible. The mobile device may include software that is pre-loaded with the device, installed from an app store, installed from a desktop (after possibly being preloaded thereon), installed from media such as a CD, DVD, etc., and/or downloaded from a Web site. The mobile device may use an operating system selected from the group consisting of: iOS, Android OS, Windows Phone OS, Blackberry OS and mobile versions of Linux OS. The items may be stored using the OneNote® note-taking software provided by the Microsoft Corporation of Redmond, Washington.

Software implementations of the system described herein may include executable code that is stored in a computer readable medium and executed by one or more processors. The computer readable medium may be non-transitory and include a computer hard drive, ROM, RAM, flash memory, portable computer storage media such as a CD-ROM, a DVD-ROM, a flash drive, an SD card and/or other drive with, for example, a universal serial bus (USB) interface, and/or any other appropriate tangible or non-transitory computer readable medium or computer memory on which executable code may be stored and executed by a processor. The system described herein may be used in connection with any appropriate operating system. Other embodiments of the invention will be apparent to those skilled in the art from a consideration of the specification or practice of the invention disclosed herein. It is intended that the specification and examples be considered as exemplary only, with the true scope and spirit of the invention being indicated by the following claims.