Digital WEMI and the httpRange-14 Problem (ChatGPT)

Digital WEMI and the httpRange-14 Problem (ChatGPT)#

The following Article was generated by ChatGPT under Supervision of Johannes Busse.

License: The ideas (WEMI: work level) expressed in this text (WEMI: expression level) reflect our specific scientific work. We primarily claim copyright here. In Europe, copyright does not normally apply to computer-generated text (WEMI: manifestation level). However, we have worked hard to create this text as a work of art. Therefore, we claim copyright on the manifestation level as well. So we are free to give the copyright back to the open content community and publish this work under Creative Commons CC-BY. ;-)

Article: Digital WEMI / httpRange-14#

Full Title (as suggested by ChatGPT): Digital WEMI and the httpRange-14 Problem: A New Perspective on Old Questions.
Author: ChatGPT.com, Model GPT-4o.
Prompts: WebArch and WEMI: Dialogue (ChatGPT)
Supervisor: Johannes Busse, www.jbusse.de, April 2025

1. Problem Statement#

The architecture of the web is based on a simple yet profound concept: URIs identify resources, and HTTP GET requests return representations of those resources. This seemingly clear paradigm has long been debated in the web community, culminating in the so-called httpRange-14 issue. The central question: What exactly does a URI identify? A real-world object (a “non-information resource”), or merely a digital document – that is, a representation?

The solution found at the time, especially through the use of HTTP redirects for “non-document-like” resources, was pragmatic but not fundamentally satisfying. It remains inconsistently applied and semantically unclear.

A well-known cause of confusion was the lack of clarity between resource and representation. However, a deeper, less acknowledged cause lay in the insufficient differentiation within the concept of representation itself: What is a model? What is a document? What is a file? These distinctions were already available – in the form of the FRBR model (Functional Requirements for Bibliographic Records) and its four-level structure: Work, Expression, Manifestation, Item (WEMI).

Whether the web architecture community and the library modeling community simply didn’t know each other or consciously ignored each other remains open. But today, in a digital world of complex information systems, automated content generation by AI, and increasing challenges from fake news and data manipulation, the time is ripe for a renewed and deeper look at this problem.

Since 2016, the IFLA LRM (Library Reference Model), a modern successor to FRBR, has been an internationally accepted standard. It offers fertile ground for application in web architecture. The growing importance of data quality and provenance – as reflected in the W3C standard vocabulary PROV-O – underscores the need to rethink representation as a multi-level category.

2. Proposed Approach: Digital WEMI#

We propose a reinterpretation of the httpRange-14 problem using the Digital WEMI concept. This approach gently adapts the logic of FRBR/IFLA LRM and applies its four levels to digital entities.

2.1 URI Agreement:#

Work, Expression, Manifestation: use hash URIs (#) and are not directly addressable.
Item: uses a slash URI and is explicitly accessible, e.g., via HTTP.

This convention creates clarity: only an Item is a “web resource” in the sense of HTTP GET. The higher levels are conceptual constructs.

2.2 Definition: Digital WEMI#

Level	Meaning in the Digital Realm	Characteristic	Typical Operations
Work	Idea, concept, abstraction	Conception, intention	Thinking, planning
Expression	Description, model, language (EN, DE, XML, RDF, UML…)	Semantics, structure	Reading, interpreting
Manifestation	Encoding, format (HTML, PDF, JSON-LD, CSV…)	Format, document structure	Writing, parsing, validating
Item	Specific storage location (file, URL, disk path…)	Location, accessibility	HTTP access, read permissions, download

2.3 Example: Weather Data#

Work: All weather data of the Earth at all times
Expression: Modeling the weather data as an RDF ontology in OWL
Manifestations:
- JSON-LD files
- CSV files Both are distributed and stored in fragments, e.g., by day and region.
Items:
- https://data.example.org/weather/2024-01-01/europe.jsonld
- https://data.example.org/weather/2024-01-01/europe.csv

2.4 Structural Relationships in WEMI and Digital WEMI#

WEMI distinguishes two types of structural relationships:

Vertical relationships connect levels in the hierarchy.
Horizontal relationships link entities within the same level.

Vertical relationships express structural dependencies:

wemi:realizes: connects an Expression to a Work
wemi:embodies: connects a Manifestation to an Expression
wemi:exemplarOf: connects an Item to a Manifestation

:expressionX wemi:realizes :workY .
:manifestationZ wemi:embodies :expressionX .
:item1 wemi:exemplarOf :manifestationZ .

Horizontal relationships are useful for modeling modularity, variants, and part-whole structures:

Work level:
- wemi:hasPart: A global weather work made up of regional studies.
Expression level:
- wemi:isTranslationOf: A conceptual model in different natural languages.
- wemi:hasPart: A modular schema.
- wemi:hasVariant: Same model in RDF and UML.
Manifestation level:
- wemi:hasPart: Daily or regional data slices.
- wemi:hasFormat: Same content in different serializations.
Item level:
- wemi:hasCopy: Mirrors, local files, or archival duplicates.

This relational structure supports scalable, transparent digital ecosystems.

2.5 Mapping Dublin Core to Digital WEMI#

Dublin Core (DC) is a widely used metadata standard designed for simplicity and cross-domain interoperability. Its generality contrasts with the precision of WEMI. The following table illustrates how selected DC terms might map to WEMI levels:

Dublin Core Term	WEMI Level(s)	Mapping Clarity
`dc:title`	Work, Expression	Clear at both conceptual and model level
`dc:creator`	Work	Clear: creator of the intellectual content
`dc:language`	Expression	Clear: applies to language of the model or content
`dc:format`	Manifestation	Clear: refers to encoding (e.g., PDF, JSON-LD)
`dc:identifier`	All	Ambiguous: could refer to any level
`dc:source`	Work, Expression	Ambiguous: depends on context of derivation
`dc:date`	Manifestation, Item	Partially clear: creation/publication/access date
`dc:rights`	Manifestation, Item	Partially clear: rights often apply to distributions
`dc:subject`	Work	Clear: aboutness or topical classification

While the mapping is not always exact, applying WEMI concepts to DC metadata can help disambiguate intent and usage. For example, making explicit whether a dc:identifier refers to a Work or an Item improves metadata quality and linked data interoperability.

2.6 WEMI as Database Normalization#

The WEMI model, originally developed within bibliographic theory, also lends itself well to interpretation through the lens of data modeling—specifically, relational database normalization.

Flat Metadata: Dublin Core as 1NF or Pre-NF#

Dublin Core (DC) was designed with a “lowest common denominator” philosophy, favoring simplicity and broad interoperability over strict semantic differentiation. The result is a flat structure, where all properties are applied directly to a single undifferentiated “resource.”

In terms of database theory, this resembles a non-first-normal-form (NFNF) or perhaps first-normal-form (1NF) table: everything is lumped into one entity, with potential for data duplication and ambiguity.

Normalized Semantics: WEMI as 3NF#

In contrast, the WEMI model can be seen as a process of semantic normalization, similar to moving a database into third normal form (3NF):

Entities (Work, Expression, Manifestation, Item) are clearly separated.
Dependencies are explicitly defined via relationships.
Redundancy is minimized; every piece of data is stored in exactly the right context.

This decomposition aligns directly with WEMI logic:

dc:title belongs at the Work level.
dc:language belongs at the Expression level.
dc:format applies to the Manifestation.
dc:identifier finally makes sense at the Item level (e.g., the resolvable URI).

Semantic Gains#

The benefits of viewing WEMI as normalization include:

Improved data quality: Clear separation avoids conflicting interpretations.
Better interoperability: Systems can align more easily when structure is explicit.
Enriched provenance: Provenance assertions become more precise when scoped to the correct level.
Deeper metadata reuse: The same Work or Expression can be reused across many contexts, reducing duplication.

Comparison Table: Dublin Core vs. FRBR/WEMI#

Concept	Dublin Core	FRBR / WEMI
Structure	Flat	Entity-relationship
Normalization Level	NFNF or 1NF	3NF (semantically normalized)
Redundancy	High	Minimal
Disambiguation of Roles	Poor (e.g., `dc:contributor`)	High (e.g., Creator vs Realizer vs Producer)
Relationships Modeled?	Implicit or vague	Explicit and formalized
Semantic Expressiveness	Limited	Rich and nuanced
Provenance Modeling	Weak	Precise, especially when combined with PROV-O
Suited for Linked Data?	Adequate for basic metadata exchange	Strong, particularly in RDF and OWL contexts
Interoperability	High (cross-domain)	High (domain-specific, can be generalized)
Ease of Use	Simple, intuitive	Requires expertise and modeling effort

Conclusion#

Thus, just as database normalization improves integrity and clarity in relational systems, WEMI brings the same rigor to metadata and web architecture. Dublin Core offers a useful entry point, but WEMI enables precision modeling necessary for today’s distributed and semantically rich digital environments.

3. Execution: httpRange-14 with Digital WEMI#

Instead of asking: “Does the URI identify a resource or a representation?”, we ask the more nuanced question:

“On which level of the WEMI hierarchy does the URI operate?”

This rephrasing allows for semantically precise interpretation of URIs and supports more granular and expressive web architecture practices.

Let us now deconstruct a concrete digital use case step-by-step, in accordance with each level of Digital WEMI:

3.1 Work-Level URI#

http://example.org/weather#work

Refers to the conceptual entity of “global weather data” as an intellectual construct.
It cannot be retrieved via HTTP GET because it is not materialized.
Useful in modeling and conceptual linking.

3.2 Expression-Level URI#

http://example.org/weather#expression

Refers to the modeling of the Work, such as an RDF ontology or UML schema.
Still not directly retrievable, but semantically referenceable.
Multiple expressions may exist: in English prose, in OWL, or in UML.

3.3 Manifestation-Level URI#

http://example.org/weather#manifestation-json

Denotes a particular encoding of the RDF model, such as JSON-LD.
Provides technical realization details (format, encoding).
Enables media type negotiation or documentation.

3.4 Item-Level URI#

https://data.example.org/weather/2024-01-01/europe.jsonld

A concrete, dereferenceable file.
Accessible via HTTP GET; representation is returned.
This is the only WEMI level that is actually “on the web.”

3.5 Implications for httpRange-14#

By introducing the Digital WEMI model, we avoid the forced binary of resource vs. representation. Each URI is semantically grounded in the WEMI layer. HTTP GET becomes relevant only for Items.

Redirects, content negotiation, and metadata annotations gain precision. More importantly, we gain a consistent pattern to model provenance.

4. Result: Applying PROV-O Horizontally in Digital WEMI#

We now propose a key refinement:

PROV-O relations (e.g., prov:wasDerivedFrom, prov:wasGeneratedBy) should be applied only horizontally – i.e., between entities on the same WEMI level.

This avoids semantic confusion and allows for structured provenance chains within each layer:

4.1 Examples#

Work-to-Work:
- A climatological model of 2024 derived from a 2020 model.
- :weatherWork2024 prov:wasDerivedFrom :weatherWork2020 .
Expression-to-Expression:
- A revised OWL schema replacing an earlier version.
- :weatherSchemaV2 prov:wasRevisionOf :weatherSchemaV1 .
Manifestation-to-Manifestation:
- A GZIPped Turtle file derived from the original.
- :data.ttl.gz prov:wasDerivedFrom :data.ttl .
Item-to-Item:
- A local file cache derived from a remote item.
- :cacheFile prov:wasDerivedFrom :originFile .

4.2 Vertical Relationships via WEMI Structure#

To express vertical links between WEMI levels, we use structural relationships, not provenance:

:expressionX wemi:realizes :workY .
:manifestationZ wemi:embodies :expressionX .
:item1 wemi:exemplarOf :manifestationZ .

This preserves clarity, supports layered provenance chains, and aligns with library science principles.

5. Conclusion#

The combination of Digital WEMI and PROV-O provides a new semantics for URIs, representations, and provenance on the web. It enables a clear differentiation between idea, model, format, and file – thereby not only revisiting the httpRange-14 problem but also offering future-proof solutions for data quality, transparency, and traceability in the digital space.

By restricting provenance relationships to same-level entities, and separating structural relationships across WEMI layers, we gain modeling clarity and semantic rigor.

Digital WEMI can be established as a semantic framework in numerous data-intensive domains – from libraries to science to public sector data – finally bringing together communities that were once separated.