SharePoint Connector - Metadata
2 min read
Ingested Metadata
The connector retrieves documents and their metadata via the Microsoft Graph API. For each document, it calls the Graph drive items endpoint with an expanded listItem and fields, which returns all SharePoint list columns. During ingestion, the connector passes these column values as metadata to the Unique platform.
Regular Documents (Document Libraries)
For regular documents in document libraries, the connector passes through all SharePoint column values as-is (the full fields bag from Graph), plus the following derived fields:
Field | Source | Description |
|---|---|---|
|
| The document's web URL |
| Same as | Alias for the document's web URL |
| Parent reference path | The folder path within the drive |
| Drive identifier | The Microsoft Graph drive ID |
|
| The Microsoft Graph drive item ID (a longer unique identifier) |
|
| The file's leaf name |
|
| Structured object with |
Native SharePoint columns that come through include (among others): FileLeafRef, Modified, Created, ContentType, AuthorLookupId, EditorLookupId, FileSizeDisplay, Title, plus any custom columns defined on the library.
Graph API Call
GET /drives/{driveId}/items/{itemId}/children
?$select=id,name,webUrl,size,createdDateTime,lastModifiedDateTime,createdBy,folder,file,listItem,parentReference
&$expand=listItem($expand=fields)SitePages
SitePages (pages created directly in SharePoint such as news posts or wiki pages) are handled differently from regular documents:
Field | Source | Description |
|---|---|---|
|
| The page's web URL |
| Same as | Alias for the page's web URL |
|
| The page's web URL (used as folder path) |
| SitePages list ID | The SitePages list identifier (not a document library drive ID) |
|
| Sequential list item ID (e.g. 1, 2, 3), unique only within that SitePages list |
|
| The page's file name |
|
| Structured object with |
|
| Content approval status (only present for SitePages) |
SitePages carry a reduced set of SharePoint column values: FileLeafRef, FileSizeDisplay, Title, AuthorLookupId, EditorLookupId, _ModerationStatus, and the sync flag column.
Graph API Call
GET /sites/{siteId}/lists/{listId}/items
?$select=id,createdDateTime,lastModifiedDateTime,webUrl,createdBy,lastModifiedBy
&$expand=fields($select=FileLeafRef,FileSizeDisplay,_ModerationStatus,Title,AuthorLookupId,EditorLookupId)Key Differences Between Regular Documents and SitePages
Aspect | Regular Documents | SitePages |
|---|---|---|
Metadata fields | Full set of all SharePoint column values | Reduced set ( |
ItemInternalId | Graph API drive item ID (long unique identifier) | Sequential list item ID (e.g. 1, 2, 3), unique only within that SitePages list |
DriveId | Document library drive ID | SitePages list ID |
ModerationStatus | Not present | Content approval status from |
Related Documentation
Flows - Content sync, ASPX page processing, file diff mechanism
Architecture - System components and Graph API endpoints
Permissions - Required API permissions
Standard References
Microsoft Graph API - DriveItem - DriveItem resource
Microsoft Graph API - ListItem - ListItem resource
Microsoft Graph API - FieldValueSet - SharePoint column values