Zobrazeno 1 - 8
of 8
pro vyhledávání: '"Egor Bogomolov"'
Code clones are pairs of code snippets that implement similar functionality. Clone detection is a fundamental branch of automatic source code comprehension, having many applications in refactoring recommendation, plagiarism detection, and code summar
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::cd0f0f6a995ef8b729a8431d9cf684bf
http://arxiv.org/abs/2206.08726
http://arxiv.org/abs/2206.08726
In recent years, researchers have created and introduced a significant number of various code generation models. As human evaluation of every new model version is unfeasible, the community adopted automatic evaluation metrics such as BLEU to approxim
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6c97a44e4ec2b3e9ca04111f63c81ee3
Publikováno v:
MaLTeSQuE@ESEC/SIGSOFT FSE
Applying machine learning to tasks that operate with code changes requires their numerical representation. In this work, we propose an approach for obtaining such representations during pre-training and evaluate them on two different downstream tasks
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::92f6bc5e798ea9131e35fc5a3ece35c7
http://arxiv.org/abs/2106.02087
http://arxiv.org/abs/2106.02087
Publikováno v:
MSR
The application of machine learning algorithms to source code has grown in the past years. Since these algorithms are quite sensitive to input data, it is not surprising that researchers experiment with input representations. Nowadays, a popular star
Publikováno v:
ASE
In this paper, we present Sosed, a tool for discovering similar software projects. We use fastText to compute the embeddings of subtokens into a dense space for 120,000 GitHub repositories in 200 languages. Then, we cluster embeddings to identify gro
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6aab5c1a4aecbd5353049164006a1a24
http://arxiv.org/abs/2007.02599
http://arxiv.org/abs/2007.02599
Publikováno v:
ICSE (Workshops)
With the goal of facilitating team collaboration, we propose a new approach to building vector representations of individual developers by capturing their individual contribution style, or coding style. Such representations can find use in the next g
Publikováno v:
ESEC/SIGSOFT FSE
Authorship attribution (i.e., determining who is the author of a piece of source code) is an established research topic. State-of-the-art results for the authorship attribution problem look promising for the software engineering field, where they cou
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::36591c8cc0063245fcdd9dea38f05898
http://arxiv.org/abs/2001.11593
http://arxiv.org/abs/2001.11593
Publikováno v:
2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR)
MSR
MSR
One recent, significant advance in modeling source code for machine learning algorithms has been the introduction of path-based representation -- an approach consisting in representing a snippet of code as a collection of paths from its syntax tree.
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ed1afa7bd92a1aac66017361d3b9078b
https://www.zora.uzh.ch/id/eprint/197735/
https://www.zora.uzh.ch/id/eprint/197735/