Calculating Confidence score

Confidence Score is one of the most important factors in Knowledge search. It reflects how confident the system is that the Knowledge Artifact answers the user’s query. The confidence score for an Artifact is calculated based on the metadata matches. It determines the final search results displayed to end-users.

In Luma Knowledge, the Confidence score is a number ranging from 0 to 1 and indicates the relevancy of an Artifact. Higher the confidence score, the better the Knowledge.

Each Metadata type in Luma Knowledge is allocated a specific Confidence Score Weightage, which is used to calculate the Confidence score for the matching artifact. Confidence Score Weightage is allocated based on the importance of metadata type and the structure of your Knowledge base.

When an end-user searches for Knowledge, the system generates metadata from the user query. The query’s metadata is matched with the metadata of the Artifacts in Knowledge Base to filter the relevant artifacts. As matching metadata is found, the respective score is awarded to the artifact. More the number of matching metadata, the more the confidence score.

In Luma Knowledge, there are two ways to calculate the Confidence score:

1 Static Confidence Weightage Allocation
2 Dynamic Confidence Weightage Allocation

Static Confidence Weightage Allocation

This is the default confidence weightage allocation method used in Luma Knowledge. In 'Static' allocation, the confidence score weightage assigned to each Metadata Type is fixed and the Artifacts Confidence score is evaluated accordingly. By default, the following weightage is assigned to each metadata type:

Topic	Path (Parent Topics if Artifact Topic does not match)	Subject	Action	Motivation

Topic	Path (Parent Topics if Artifact Topic does not match)	Subject	Action	Motivation
40	35	30	25	5

For a user query, confidence score weightage is assigned based on the ontology generated from the user phrase. This means, that the more metadata types are identified from the user phrase, the more confidence score is available for allocation to matching Artifacts.

Let us discuss the following example to understand how the Confidence score is calculated:

Example 1: “Describe Knowledge Management”- Query generates Metadata Topic, Subject, and Action

From the user query 'Describe Knowledge Management', metadata types Topic, Subject, and Action are generated. Based on the identified Metadata, the confidence score weightage is allocated to each metadata type. Since Motivation is not identified from the user phrase, it is not used for confidence score evaluation.

Metadata type	Ontology generated	Weightage assigned based on the user phrase

Metadata type	Ontology generated	Weightage assigned based on the user phrase
Topic	Knowledge Management*	40
Subject	Knowledge Management, Knowledge Management*	30
Action	Describe	25
Motivation		NA
Path		0

Luma Knowledge identifies Artifacts that match with the ontology generated from the user query. Based on the metadata type and number of metadata matches, the confidence score for the matching Artifacts is calculated.

Artifact	Matched Metadata	Confidence Score

Artifact	Matched Metadata	Confidence Score
What is Luma Knowledge	path : Luma Knowledge Management, Knowledge Management topic : Knowledge Management action : describe subject : Luma knowledge, Knowledge	0.95 (40+30+25)
What is the intent of the Retrieval Accuracy Graph	path : Luma Knowledge Management, Knowledge Building	0.35 (35)
What is the intent of the Feedback Response Rate Graph	path : Luma Knowledge Management, Knowledge Building	0.35 (35)

Example 2: Multiple Subject match

If a search query generates multiple Subjects and more than one matching subject is available in the Artifacts, an additional weightage of 3% for the phrase is assigned to the metadata type ‘Subject’. The additional weightage for the 'Action' metadata is reduced and assigned to the Subject. The identified metadata is used to calculate the confidence score.

For user query “How to create a dropbox account“, metadata types Topic, Subject, Motivation, and Action are generated.

Metadata type	Ontology generated	Weightage assigned based on the user phrase

Metadata type	Ontology generated	Weightage assigned based on the user phrase
Topic	dropbox, Dropbox Account	40
Subject	dropbox account, dropbox, Dropbox Account	30
Action	create	25
Motivation	how	5
Path		0

Based on the identified metadata, the confidence score for the matching Artifacts is calculated.

Artifact	Matched Metadata	Confidence Score

Artifact	Matched Metadata	Confidence Score
ChangePassword-Dropbox	path : Dropbox topic : Dropbox subject : dropbox account, dropbox	0.73 (40+33)
How to Sign in to Dropbox	path : Dropbox topic : Dropbox subject : Account motivation : How* to*	0.75 (40+30+5)
How to change your password for Dropbox	path : Dropbox topic : Dropbox motivation : How* to*	0.45 (40+5)

In case of multiple subject matches, weightage of 3% is assigned for every additional phrase. i.e.
- For 2 subject matches, an additional 3% weightage is assigned to 'Subject' metadata. The confidence weightage is increased to 33%.
- For 3 or more subject matches, an additional 6% weightage is assigned. The confidence weightage is increased to 36%
The additional weightage is decreased from the weightage of Action metadata. This means, If Weightage for Subject is 33%, Action is reduced to 22% (25-3).

Dynamic Confidence Weightage Allocation

In Dynamic confidence score allocation, the confidence score is allocated based on the ontology generated from the user query. The confidence score is always calculated based only on the metadata identified. This ensures that Luma Knowledge can find matching Knowledge Artifacts even if the queries are ambiguous or do not generate all the types of metadata.

In dynamic allocation, the metadata types are not assigned a fixed weightage. It is calculated, dynamically based on the metadata identified and the Base weightage points for each metadata type.

There are four steps in calculating the Confidence score in dynamic allocation:

Configure Base points

Base points are the base scores allocated to each metadata type. Every phrase/word identified as the metadata is assigned the configured base score. In other words, if the base score of ‘Action’ metadata is 25, each word identified as ‘Action’ from the query is assigned a weightage of 25.

By default, the base points are configured as below and can be updated as required.

Topic, Path and Subject	Action	Motivation

Topic, Path and Subject	Action	Motivation
70	25	5

The Base points for the metadata types are configurable and can be updated based on your organization’s required. Currently, the configuration is available in the backend. You may contact the Serviceaide support team to update the configuration.
A total of 100 Base points can be divided among the metadata types.

Calculate Total Weightage points for the query and matched phrases

Using the Base points per metadata type, the Total Weightage points for the search are calculated.

For the ontology generated from the search query, the Weightage points for each metadata type and the Total Weightage points for the search query are calculated. The Total Weightage point is a sum of the weightage points for the identified metadata types. This score is used to derive the Confidence weightage that can be allocated to Artifacts.

For example, if a search query generates metadata Topic, Subject, Action and Motivation:

Base points for Topic and Subjects = 70
Number of words/phrases identified as Topic and Subjects identified = 2
Weightage points for Topic and Subjects = 140 (calculated as 70 x 2 )

Base points for Action = 25
Number of words/phrases identified as Action = 2
Weightage points for Action = 50 (calculated as 25 x 2 )

Base points for Motivation = 5
Number of words/phrases identified as Motivation = 1
Weightage points for Motivation = 5 (calculated as 5 x 1 )

Total Weightage points for the search = 195 ( calculated as 140+50+5)

Derive Confidence Weightage

Now using the Weightage points for each metadata type and Total Weightage points, we can derive the Confidence weightage for each metadata type. Based on metadata generated from the user query, the confidence weightage is calculated. The weightage is calculated only for the metadata type identified from the query string.

In the above example,

Total Weightage points for the search = 195

Weightage points for Topic and Subjects =140
Confidence Weightage for Topic and Subjects = 0.72 ( calculate as 140/195)
Weightage for each phrase = 0.36 ( calculated as 0.72/2 = 0.36)

Weightage points for Action = 50
Confidence Weightage for Action = 0.26 ( calculate as 50/195)
Weightage for each phrase = 0.13 ( calculated as 0.26/2 = 0.13)

Weightage points for Motivation = 5
Confidence Weightage for Motivation = 0.02 ( calculate as 5/195)
Weightage for each phrase = 0.02 ( calculated as 0.02/1 = 0.02)

Calculate the Confidence score for Artifact

Using the confidence weightage for the identified metadata, the confidence score for the matching Knowledge artifacts is calculated. Metadata generated from the search query is matched with the Artifact's metadata and the confidence score is assigned accordingly.

For an Artifact with matching metadata, the confidence score is calculated as below:

Metadata	Artifact 1		Artifact 2		Artifact 3
Metadata	Matching phrases/words	Confidence Score (Weightage for each phrase * number of phrases)	Matching phrases/words	Confidence Score (Weightage for each phrase * number of phrases)	Matching phrases/words	Confidence Score (Weightage for each phrase * number of phrases)
Topic, Subject, Path	2	0.72 (calculated as 0.36 * 2)	1	0.36 (calculated as 0.36 * 1)	2	0.72 (calculated as 0.36 * 2)
Action	2	0.26 (calculated as 0.13 * 2)	1	0.13 (calculated as 0.13 * 1)	1	0.13 (calculated as 0.13 * 1)
Motivation	0	0	1	0.02 (calculated as 0.02 * 1)	1	0.02 (calculated as 0.02 * 1)
Total		0.98		0.51		0.87

Examples

Let us look at the following examples to understand how the Confidence score is calculated:

Example 1: “How to login to dropbox“

From the user query 'How to login to dropbox', metadata types Topic, Subject, Action, and Motivation are generated. Based on the identified Metadata and configured base point, the confidence score is allocated to each metadata type.

Metadata type	Ontology generated	Weightage points per Metadata	Total Weightage points for the query	Confidence weightage per phrase

Metadata type	Ontology generated	Weightage points per Metadata	Total Weightage points for the query	Confidence weightage per phrase
Topic	dropbox*	Base points= 70 Number of phrases identified = 1 Weightage score = 70 (calculated as 70*1)	100 (calculated as 70+25+5)	Confidence Weightage = 0.70 ( calculate as 70/100) Weightage per phrase = 0.70 ( calculated as 0.70/1 = 0.70)
Subject	dropbox, dropbox*
Action	login	Base points= 25 Number of phrases identified = 1 Weightage score = 25 (calculated as 25*1)		Confidence Weightage = 0.25 ( calculate as 25/100) Weightage per phrase = 0.25 ( calculated as 0.25/1 = 0.25)
Motivation	how	Base points= 5 Number of phrases identified = 1 Weightage score = 5 (calculated as 5*1)		Confidence Weightage = 0.05 ( calculate as 5/100) Weightage per phrase = 0.05 ( calculated as 0.05/1 = 0.05)

Based on the calculated weightage per phrase, Confidence score for the artifact is calculated.

Artifact	Matched Metadata	Confidence Score

Artifact	Matched Metadata	Confidence Score
How to change your password for Dropbox	path : Dropbox topic : Dropbox motivation : How to	0.75 (calculated as 0.70+0.05)
ChangePassword-Dropbox	path : Dropbox topic : Dropbox subject : dropbox account, dropbox	0.7
Dropbox Description	path : Dropbox topic : Dropbox	0.7

Example 2: “Apply for a Building Permit“

From the user query 'Apply for a Building Permit', metadata types Topic, Subject, and Action are generated. Based on the identified Metadata and configured base point, the confidence score is allocated to each metadata type.

Metadata type	Ontology generated	Weightage points per Metadata	Total Weightage points for the query	Confidence weightage per phrase

Metadata type	Ontology generated	Weightage points per Metadata	Total Weightage points for the query	Confidence weightage per phrase
Topic	permits, permit, building permit	Base points= 70 Number of phrases identified = 2 Weightage score = 140 (calculated as 70*2)	165 (calculated as 140+25)	Confidence Weightage = 0.85 ( calculate as 140/165) Weightage per phrase = 0.425 ( calculated as 0.85/2)
Subject	permits, permit, building permit
Action	Apply	Base points= 25 Number of phrases identified = 1 Weightage score = 25 (calculated as 25*1)		Confidence Weightage = 0.15 ( calculate as 25/165) Weightage per phrase = 0.15 ( calculated as 0.15/1)
Motivation	-	-		-

Based on the calculated weightage per phrase, the Confidence score for the artifact is calculated.

Artifact	Matched Metadata	Confidence Score

Artifact	Matched Metadata	Confidence Score
Apply for a Building Permit	path : Building Permits, Construction Permits, Permits, Licenses, and Inspections topic : Building Permits action : apply subject : permit, building	1.0 (calculated as 0.85+0.15)
Building Permits Online	path : Building Permits, Construction Permits, Permits, Licenses, and Inspections topic : Building Permits subject : permit	0.85
Board of Equalization, BOE	path : Sellers Permit, Business Permits, Permits, Licenses, and Inspections topic : Sellers Permit action : apply subject : permit	0.575 (calculated as 0.425+0.15)

Example 3: “Wifi Router“

From the user query 'Wifi Router', metadata types Topic and Subject are generated. Based on the identified Metadata and configured base point, the confidence score is allocated to each metadata type.

Metadata type	Ontology generated	Weightage points per Metadata	Total Weightage points for the query	Confidence weightage per phrase

Metadata type	Ontology generated	Weightage points per Metadata	Total Weightage points for the query	Confidence weightage per phrase
Topic	router, wifi router	Base points= 70 Number of phrases identified = 2 Weightage score = 140 (calculated as 70*2)	140 (calculated as 140+0+0)	Confidence Weightage = 1 ( calculate as 140/140) Weightage per phrase = 0.5 ( calculated as 1/2)
Subject	wifi router, router, wifi router
Action	-	-		-
Motivation	-	-		-