@@ -4,7 +4,7 @@ Google Summer of Code 2019
44==========================
55
66AboutCode is participating in the Google Summer of Code in 2019 as a mentoring org. This page
7- contain all the information for students and anyone else interested in helping.
7+ contains all the information for students and anyone else interested in helping.
88
99AboutCode is a family of FOSS projects to uncover data ... about software code:
1010
@@ -13,7 +13,7 @@ AboutCode is a family of FOSS projects to uncover data ... about software code:
1313- what is its license? copyright?
1414- is the code secure, maintained, well coded?
1515
16- All these are questions that are important to answer: there are million
16+ All these are questions that are important to answer: there are millions
1717of free and open source software components available on the web for reuse.
1818
1919Knowing where a software package comes from, what is its license and if it is
@@ -420,11 +420,11 @@ DeltaCode projects
420420DeltaCode is a tool to compare and report scan differences.
421421When comparing files, it only uses exact comparison.
422422The goal of this project is to improve the usefulness of the delta by also
423- finding files that are mostly the same (e.g. quasi or nrea duplicates) vs. files
423+ finding files that are mostly the same (e.g. quasi or near duplicates) vs. files
424424that are completely different. Then the DeltaCode comparison core should be
425425updated accordingly to detect and report material changes to scans (such as
426- new, update or removed licenses, origins and packages) when changes are also
427- meterial in the code files (e.g. such that small changes may be ignored)
426+ new, updated or removed licenses, origins and packages) when changes are also
427+ material in the code files (e.g. such that small changes may be ignored)
428428
429429
430430- **Level **
@@ -454,7 +454,7 @@ TraceCode projects
454454
455455TraceCode does system call tracing only today. The primary goal of this project
456456is to create a tool that provides the same results as the strace-based tracing
457- but would be using using ELF symbols, DWARF debug symbols, signatures or string
457+ but would be using ELF symbols, DWARF debug symbols, signatures or string
458458matching to determine when and how a source code file is built in a binary
459459using only a static analysis. The primary target should be Linux executables,
460460though the code should be designed to be extensible to Windows PE and macOS Dylib and exes.
@@ -593,13 +593,13 @@ The features and TODO for this updated server would be:
593593- Aggregate more and new packages vulnerabilities feeds,
594594
595595- Automating correlation: add smart relationship detection to infer new
596- relatiosnhips between available packages and vulnerabilities from mining the
596+ relationships between available packages and vulnerabilities from mining the
597597 graph of existing relations.
598598
599599- Create a ScanCode plugin to report vulnerabilities with detected packages
600600 using this data.
601601
602- - Integrate API lookup on the server withe the AboutCode Manager UI
602+ - Integrate API lookup on the server with the AboutCode Manager UI
603603
604604- Create a UI and model for community curation of vulnerability to package
605605 mappings, correlations and enhancements.
@@ -657,7 +657,7 @@ There are several areas to research and prototype such as:
657657- A data structure to match efficiently a batch of fix-width checksums (e.g.
658658 SHA1) against a large index of such checksums, where each checksum points to
659659 one or more files or packages. A possible direction is to use finite state
660- transducers, specialized B-tree indexes, blomm -like filters. Since when a
660+ transducers, specialized B-tree indexes, bloom -like filters. Since when a
661661 codebase is being matched there can be millions of lookups to do, the batch
662662 matching is preferred.
663663
@@ -677,7 +677,7 @@ There are several areas to research and prototype such as:
677677
678678- Feature hashing research: we deal with many "features" and hashing to limit
679679 the number and size of the each features seems to be a valuable thing. The goal
680- is to research the validaty of feature hashing with short hashes (15, 16 and
680+ is to research the validity of feature hashing with short hashes (15, 16 and
681681 32 bits) and evaluate if this leads to acceptable false-positive and loss of
682682 accuracy in the context of the data structures mentioned above.
683683
@@ -708,5 +708,5 @@ This is a green field project.
708708Mentoring
709709---------
710710
711- We welcome new mentors to help with the program and require some good unerstanding of the project
711+ We welcome new mentors to help with the program and require some good understanding of the project
712712codebase and domain to join as a mentor. Contact the team on Gitter.
0 commit comments