Wikidata:Property proposal/Generic


Property proposal: Generic Authority control Person Organization
Creative work Place Sports Sister projects
Transportation Natural science Computing Lexeme

See also

edit

This page is for the proposal of new properties.

Before proposing a property

  1. Search if the property already exists.
  2. Search if the property has already been proposed.
  3. Check if you can give a similar label and definition as an existing Wikipedia infobox parameter, or if it can be matched to an infobox, to or from which data can be transferred automatically.
  4. Select the right datatype for the property.
  5. Read Wikidata:Creating a property proposal for guidelines you should follow when proposing new property.
  6. Start writing the documentation based on the preload form below by editing the two templates at the top of the page to add proposal details.

Creating the property

  1. Once consensus is reached, change status=ready on the template, to attract the attention of a property creator.
  2. Creation can be done 1 week after the creation of the proposal, by a property creator or an administrator.
  3. See property creation policy.

General

edit

‎Publication type of scholarly article

edit
   Under discussion
DescriptionPublication type of scholarly article
Data typeItem
Template parameterDifferent from publication type as used for example in w:Template:Infobox short story
DomainInstances of scholarly article (Q13442814) and its subclasses.
Allowed valuesPermitted values typically should be potential subclasses of scholarly work (Q55915575). In practice there is diversity in instance of (P31) statements additional to scholarly article (Q13442814) items, which number in tens of millions, and some cleanup work is anticipated for both domain and range.
Example 1Malaria and the microbiome: a systematic review (Q56383548)systematic review (Q1504425)
Example 2NIH Consensus conference. Gallstones and laparoscopic cholecystectomy (Q70552083)NIH consensus development conference summary (Q27718083)
Example 3Practice guidelines for the management of bacterial meningitis (Q33982444)medical guideline (Q878041)
SourceThe new statements initially will be generated by rules from existing statements, as followup to the WDQS split.
Robot and gadget jobsBots will be used heavily to implement the migration from instance of (P31) statements.
Wikidata projectWikidata:WikiCite

Motivation

edit

Currently these publication types of articles are added as instance of (P31) statements, but better data modelling can follow from having a separate property. For example, on clinical trial (Q30612) under MeSH descriptor ID (P486) the publication type meaning is at present given preferred rank over the "clinical trials as topic" meaning. It would be better not to overload the item in this way, given the importance of clinical trials in medical research. We should have two items, one of which should only be used in "publication type of scholarly article" statements.

This idea was mentioned already several years ago. It comes up now because of the graph split treating the scholarly article items as a graph in their own right. See Wikidata talk:WikiCite#Community input into WDQS graph split: a publication type property proposal for a preliminary discussion. That thread links to a graph split page which goes into fuller details of the technical side. I've been asked by the developers working on the split to make this proposal. @Daniel Mietchen: @Bluerasberry: @Sj:

While the graph split will make SPARQL queries more complex, good can come of it if this proposed property is created, and some systematic work goes on to sort out the current overloading of dozens of items. Charles Matthews (talk) 10:35, 12 September 2024 (UTC)[reply]

Discussion

edit
I can give a concise kind of reason: if you want to have database constraints that apply in this particular context. Certainly it doesn't make very much sense to have instance of (P31) subjected to database constraints, when it is universal. When you say "should not be used on the items at all" you are arguing for constraints, and the standard way to do that is with a definite property. Charles Matthews (talk) 19:33, 21 September 2024 (UTC)[reply]
Makes sense. One thing is that I think it would generally be best if values in properties can be constrained depending on other values/properties of the item and think this is already done. Moreover, could you please explain why properties like language of work or name also show values other than in this case languages in the autocomplete box? Prototyperspective (talk) 10:07, 22 September 2024 (UTC)[reply]
I don't think I want to talk here about details of constraints, because it is anyway going to be a community decision what is wanted. The general principle is to have constraints based on queries, so a list of constraint violations can be generated automatically. In this case it is worth emphasising (a) that there are tens of millions of items involved, and (b) preliminary checks on the instance of (P31) statements we are starting with show a complex situation. So I don't think we should approach this business with ad hoc ideas. We may end up with a package of constraints that is effective in keeping the data clean, but that would require some effort. Charles Matthews (talk) 11:08, 22 September 2024 (UTC)[reply]
@Nikki: genre (P136) is not similar at all; for scholarly works maybe main subject (P921) would be similar to genre (P136), though perhaps for a real "genre" you would pick a more high-level subject area (mathematics, biology, etc.). This proposed property would be much more similar to form of creative work (P7937) - but obviously we are not talking about "creative works" here I think, at least not as normally understood. Perhaps form of creative work (P7937) could be renamed/extended to support what is wanted here? But I think a separate property for this makes a lot of sense. ArthurPSmith (talk) 17:09, 23 September 2024 (UTC)[reply]
  •   Support For several reasons
    1. Agree with Jklamo, "instance of" is being used as a data dump and this property helps correct that
    2. Nikki's rationale for opposing would apply in most similar cases, and often taking action is debatable, but this is a very unusual case and strong action is merited. As the pie chart shows and Wikidata:Statistics further explains, Wikidata has a huge number of items which are "instance of -> scholarly article". Currently, these are sorted by properties which apply to publications generally, like genre (P136), or by adding additional items to instance of (P31). There are enough of these items to merit specific sorting through a dedicated property.
    3. I have contributed to the problem of adding imprecise data through Wikidata:WikiProject Clinical Trials, where I encouraged tagging scholarly articles about clinical trials as "instance of clinical trial". Although it is common to call such papers "clinical trials", the trial is actually the research experiment itself. There are probably other classes of items loaded into scholarly articles for similar reasons. Sorting this in a dedicate property enables better cleanup outside of P31.
    4. We can reasonably expect editor engagement with this property because WikiCite is a popular project, and everything discussed here is WikiCite-related. For additional context on the community and its projects see meta:WikiCite and Wikidata:WikiCite.
    5. The need to clean this up is now because of Wikidata:SPARQL query service/WDQS graph split, which I explained in an English Wikipedia Signpost article at Wikidata to split as sheer volume of information overloads infrastructure
    6. genre (P136) is not quite a fit in some cases. Some kinds of papers that come up include clinical trial (Q30612), obituary (Q309481), product testing (Q7247798), and letter to the editor (Q651270). These are not well developed already as genres, and I do not think Wikidata should make a precedent into overloading the concept of genre with such a new application of the term. It makes sense to me to have a new term for Wikidata's needs which is not already loaded with meaning from existing disciplines.
I do not have all the answers and I am uncertain about how all of this goes, so I encourage anyone to ask questions and critique these plans. I do agree that a problem exists, action is useful, and this proposal is the best idea I have heard for addressing it. I see no significant shortcomings to this idea except the newness and uncertainty, but I think this is the way. Bluerasberry (talk) 19:18, 23 September 2024 (UTC)[reply]

characteristic of (aliases: quality of | property of | inheres in )

edit
   Ready Create
Description(qualifier only) statement value is a characteristic, quality, property, or state of this item
Data typeItem
Domainquality (Q1207505), property (Q937228), state (Q3505845), relation (Q930933), type of property (Q96253971)
Example 1battery management system (Q810938)measures (P2575)temperature (Q11466)characteristic ofbattery (Q267298)
Example 2terminal velocity (Q614981)has contributing factor (P1479)orientation (Q2235286)characteristic ofsolid object (Q29052015)
Example 3The Unconscious of a Conservative (Q52945586)main subject (P921)mental health (Q317309)characteristic ofBarry Goldwater (Q319129)
Example 4tetrachromacy (Q94556)has characteristic (P1552)dimension (Q4440864)quantity (P1114)"4"characteristic ofcolor space (Q166863)
See alsoof (P642), applies to part (P518), facet of (P1269), part of (P361), has characteristic (P1552)

Motivation

edit

This common relation is widely expressed with the massively overloaded (and to-be-deprecated) of (P642), and sometimes (erroneously) with applies to part (P518), facet of (P1269), part of (P361), and possibly a few other properties. Although it is semantically an inverse of has characteristic (P1552), constraining this property to the qualifier scope will prevent introduction of redundant inverses of has characteristic (P1552) statements. Swpb (talk) 18:03, 12 September 2024 (UTC)[reply]

Discussion

edit