Kris Dorosz

Gdańsk, Pomorskie, Poland Contact Info
1K followers 500+ connections

Join to view profile

About

On a mission to make airspace more predictable.

Activity

Join now to see all activity

Experience & Education

  • Air Space Intelligence

View Kris’s full experience

See their title, tenure and more.

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Publications

  • Semantic Approach for Web Information Monitoring

    International Journal of Web Applications, DLINE

    Other authors
    See publication
  • Latent Semantic Analysis Evaluation of Conceptual Dependency Driven Focused Crawling

    Springer Berlin Heidelberg, Multimedia Communications, Services and Security

    In this paper we study a focused crawler driven by deep semantic analysis provided by the Conceptual Dependency (CD) theory. We test in practice the application of CD scripts as an approach of defining topics (queries) in a focused crawler and its robustness in evaluating real text structures extracted from HTML documents. In order to benchmark its efficiency in comparison to classical approaches, apart from human evaluation we also provide an evaluation of the result set based on its internal…

    In this paper we study a focused crawler driven by deep semantic analysis provided by the Conceptual Dependency (CD) theory. We test in practice the application of CD scripts as an approach of defining topics (queries) in a focused crawler and its robustness in evaluating real text structures extracted from HTML documents. In order to benchmark its efficiency in comparison to classical approaches, apart from human evaluation we also provide an evaluation of the result set based on its internal similarity using Latent Semantic Analysis (LSA). The performed measurement brings us to the conclusion that the CD theory is well suited for evaluating the similarity of HTML documents provided a specific query, as it achieves a high precision measured through human evaluation. At the same time we observe the drawbacks of LSA used in the same context.

    Other authors
    See publication
  • Enhancing Regular Expressions for Polish Text Processing

    Computer Science Journal, AGH

    The paper presents proposition of regular expressions engine based on the modified
    Thompson's algorithm dedicated to the Polish language processing. The Polish inflectional
    dictionary has been used for enhancing regular expressions engine and syntax. Instead of
    using characters as a basic element of regular expressions patterns (as it takes place in
    BRE or ERE standards) presented tool gives possibility of using words from a natural
    language or labels describing words…

    The paper presents proposition of regular expressions engine based on the modified
    Thompson's algorithm dedicated to the Polish language processing. The Polish inflectional
    dictionary has been used for enhancing regular expressions engine and syntax. Instead of
    using characters as a basic element of regular expressions patterns (as it takes place in
    BRE or ERE standards) presented tool gives possibility of using words from a natural
    language or labels describing words grammar properties in regex syntax.

    Other authors
    • Anna Szczerbińska
    See publication
  • Usage of Dedicated Data Structures for URL Databases in a Large-scale Crawling

    Computer Science Journal, AGH

    The article discuss usage of Berkeley DB data structures such as hash tables and b-trees for
    implementation of a high performance URL database. The article presents a formal model
    for a data structures oriented URL database, which can be used as an alternative for a
    relational oriented URL database. Keywords: crawling, crawler, large-scale, Berkeley DB,
    URL database, URL repository, data structures.

    See publication

Courses

  • ESSLI'09 Bordeaux

    -

Honors & Awards

  • Rector's Team Award

    AGH Rector

    Award for didactics achievements.

  • Rector's Team Award

    UJ Vice Rector, prof. dr hab. Jacek Popiel

  • Best Paper Award

    MCSS'2012

    Paper: Latent semantic analysis evaluation of conceptual dependency driven focused crawling

  • Rector's Team Award

    UJ Vice Rector, prof. dr hab. Michał du Vall

Languages

  • Polish

    Native or bilingual proficiency

  • English

    Professional working proficiency

Recommendations received

More activity by Kris

View Kris’ full profile

  • See who you know in common
  • Get introduced
  • Contact Kris directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses