Skip to content

Ayan1089/CEUR-WS-Event-Series--SS23

Repository files navigation

Event Series Completion Group Project 2023

Task

  1. Find CEUR-WS related academic events in Wikidata
  2. Some events have the property 179 (part of the series)
  3. For all events that don't have this property find whether,
    • they are part of a conference/workshop series or
    • they are a standalone event
  4. Match events to their event series and update the property.

Relations of Proceedings, Events, Series

classDiagram
    note for Proceeding "Those are in CEUR-WS"
    class Proceeding {
        String title
        Int Volume
    }
    class Event {
        String title
        Int ordinal
        Optional~String~ dblpEventId(P10692)
    }

    class Conference

    class Workshop

    note for `CEUR-WS (Q27230297)` "Only proceedings for this\n series are of interest"
    class `CEUR-WS (Q27230297)`
    <<Singleton>> `CEUR-WS (Q27230297)`

    note for EventSeries "This relation is what\n we need to find"
    class EventSeries {
        String title
        String acronym
    }

    Event <|-- Workshop: instance of (P31)
    Event <|-- Conference: instance of (P31)
    Proceeding --> Event: is proceedings from
    Proceeding --> `Conference Proceedings Series (Q27785883)`: part of the series

`Conference Proceedings Series (Q27785883)` <|--  `CEUR-WS (Q27230297)`
Event --> EventSeries: part of the series (P179)
Loading

Associations and Properties in Wikidata

Progress

  • Two main approaches
    • Match directly based on attributes (title, acronym)
    • Match through DBLP
  • Extracted relevant events from Wikidata
  • Implemented various matching-algorithms mostly comparing titles
  • Parsed and scraped events and their series from dblp that are related to CEUR-WS
    • Extracted meta-information, event-information, event-series-information
    • Matched conferences to their series through dblp
  • More information can be found in the final presentation
  • Sankey plot about the progress

Structure

  • Architecture
    • The excalidraw information is embedded in the file
  • The main module is eventseries
  • Test are found in eventseries/src/tests
    • More information about dblp can be found in eventseries/src/main/dblp
    • Resources like .json, .pickle or .csv files are located in eventseries/src/main/resources
  • Dataexploration and experiments are within notebooks in notebooks
    • Data used within the notebooks should be placed in data

Known Issues

About

KB lab SS23

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors