“Krzysztof is one of the most competent and reliable engineer and scientist I had a pleasure to work with. Not only he is expert in Natural Language Processing and related fields, but his technical skills are superb. He also learns and adapts to new challenges very fast. Krzysztof is also a very proactive person and always deliver on-time and his solutions are of the highest quality. Last, but not least he is a great team player, have great communication skills and personal integrity. To sum up, working with a Krzysztof is a joy.”
About
Activity
-
Personal update: today is my last day at Snowflake. As I prepare to embark on a new chapter, I wanted to take a moment to express my deepest…
Personal update: today is my last day at Snowflake. As I prepare to embark on a new chapter, I wanted to take a moment to express my deepest…
Liked by Kris Dorosz
-
Being a dad is the best/hardest/most important job. Happy Father's day everyone!
Being a dad is the best/hardest/most important job. Happy Father's day everyone!
Liked by Kris Dorosz
-
Are you aware of the "Nerd party" meme? It is a photo taken at the science camp and mathematical Olympiad held in Poland in 1997. (btw. all of these…
Are you aware of the "Nerd party" meme? It is a photo taken at the science camp and mathematical Olympiad held in Poland in 1997. (btw. all of these…
Liked by Kris Dorosz
Experience & Education
Publications
-
Semantic Approach for Web Information Monitoring
International Journal of Web Applications, DLINE
-
Latent Semantic Analysis Evaluation of Conceptual Dependency Driven Focused Crawling
Springer Berlin Heidelberg, Multimedia Communications, Services and Security
In this paper we study a focused crawler driven by deep semantic analysis provided by the Conceptual Dependency (CD) theory. We test in practice the application of CD scripts as an approach of defining topics (queries) in a focused crawler and its robustness in evaluating real text structures extracted from HTML documents. In order to benchmark its efficiency in comparison to classical approaches, apart from human evaluation we also provide an evaluation of the result set based on its internal…
In this paper we study a focused crawler driven by deep semantic analysis provided by the Conceptual Dependency (CD) theory. We test in practice the application of CD scripts as an approach of defining topics (queries) in a focused crawler and its robustness in evaluating real text structures extracted from HTML documents. In order to benchmark its efficiency in comparison to classical approaches, apart from human evaluation we also provide an evaluation of the result set based on its internal similarity using Latent Semantic Analysis (LSA). The performed measurement brings us to the conclusion that the CD theory is well suited for evaluating the similarity of HTML documents provided a specific query, as it achieves a high precision measured through human evaluation. At the same time we observe the drawbacks of LSA used in the same context.
Other authorsSee publication -
Enhancing Regular Expressions for Polish Text Processing
Computer Science Journal, AGH
The paper presents proposition of regular expressions engine based on the modified
Thompson's algorithm dedicated to the Polish language processing. The Polish inflectional
dictionary has been used for enhancing regular expressions engine and syntax. Instead of
using characters as a basic element of regular expressions patterns (as it takes place in
BRE or ERE standards) presented tool gives possibility of using words from a natural
language or labels describing words…The paper presents proposition of regular expressions engine based on the modified
Thompson's algorithm dedicated to the Polish language processing. The Polish inflectional
dictionary has been used for enhancing regular expressions engine and syntax. Instead of
using characters as a basic element of regular expressions patterns (as it takes place in
BRE or ERE standards) presented tool gives possibility of using words from a natural
language or labels describing words grammar properties in regex syntax.Other authors -
-
Usage of Dedicated Data Structures for URL Databases in a Large-scale Crawling
Computer Science Journal, AGH
The article discuss usage of Berkeley DB data structures such as hash tables and b-trees for
implementation of a high performance URL database. The article presents a formal model
for a data structures oriented URL database, which can be used as an alternative for a
relational oriented URL database. Keywords: crawling, crawler, large-scale, Berkeley DB,
URL database, URL repository, data structures.
Courses
-
ESSLI'09 Bordeaux
-
Honors & Awards
-
Rector's Team Award
AGH Rector
Award for didactics achievements.
-
Rector's Team Award
UJ Vice Rector, prof. dr hab. Jacek Popiel
-
Best Paper Award
MCSS'2012
Paper: Latent semantic analysis evaluation of conceptual dependency driven focused crawling
-
Rector's Team Award
UJ Vice Rector, prof. dr hab. Michał du Vall
Languages
-
Polish
Native or bilingual proficiency
-
English
Professional working proficiency
Recommendations received
2 people have recommended Kris
Join now to viewMore activity by Kris
-
Fantastic team and great investors (old and new)!
Fantastic team and great investors (old and new)!
Liked by Kris Dorosz
-
Very impressive fundraise and nicely written article. Congrats for managing to change the airplanes universe ✈️
Very impressive fundraise and nicely written article. Congrats for managing to change the airplanes universe ✈️
Liked by Kris Dorosz
-
A few weeks ago I had the chance to sit down for a chat with Kornelia Trzęsowska for the PMI magazine. We talked about the special character of…
A few weeks ago I had the chance to sit down for a chat with Kornelia Trzęsowska for the PMI magazine. We talked about the special character of…
Liked by Kris Dorosz
-
One of our portfolio companies, Air Space Intelligence, is recruiting the founding software engineers for their new defense team. They just raised…
One of our portfolio companies, Air Space Intelligence, is recruiting the founding software engineers for their new defense team. They just raised…
Liked by Kris Dorosz
-
Air Space Intelligence just a raised a $34M Series B, led by Andreessen Horowitz. We are aggressively growing our US Defense engineering team…
Air Space Intelligence just a raised a $34M Series B, led by Andreessen Horowitz. We are aggressively growing our US Defense engineering team…
Liked by Kris Dorosz
-
We raised $34M in Series B funding from Andreessen Horowitz, and support from existing investors Bloomberg Beta, Renegade Partners, Spark Capital. I…
We raised $34M in Series B funding from Andreessen Horowitz, and support from existing investors Bloomberg Beta, Renegade Partners, Spark Capital. I…
Shared by Kris Dorosz
-
The Sky is the Limit - Our Investment in Air Space Intelligence Today, Air Space Intelligence announced a $34 million Series B funding round led by…
The Sky is the Limit - Our Investment in Air Space Intelligence Today, Air Space Intelligence announced a $34 million Series B funding round led by…
Liked by Kris Dorosz
-
We are excited to announce Air Space Intelligence has raised $34M in Series B funding led by Andreessen Horowitz, and continuous support from…
We are excited to announce Air Space Intelligence has raised $34M in Series B funding led by Andreessen Horowitz, and continuous support from…
Liked by Kris Dorosz
-
Fun time at HackYeah 2023 last weekend. Big kudos to the team Bazyli Polednia, Patryk Neubauer, and Dominik Krajnik 🏄
Fun time at HackYeah 2023 last weekend. Big kudos to the team Bazyli Polednia, Patryk Neubauer, and Dominik Krajnik 🏄
Liked by Kris Dorosz
-
A VC that builds? To best support founders, VCs can't run like law firms. They need to run like the startups they back. They need to build. We're…
A VC that builds? To best support founders, VCs can't run like law firms. They need to run like the startups they back. They need to build. We're…
Liked by Kris Dorosz
-
I am excited to announce that Kamil Krampa and I will be speaking at the Gdansk Kubernetes Meetup on June 22nd. 🗓 Our talk will focus on "Measuring…
I am excited to announce that Kamil Krampa and I will be speaking at the Gdansk Kubernetes Meetup on June 22nd. 🗓 Our talk will focus on "Measuring…
Liked by Kris Dorosz
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More