The FASTA Parser is a command-line tool designed to process FASTA files, retrieve sequence headers, and extract sequences based on a specified reference ID.
- Description: Specifies the path to the input FASTA file.
- Usage: This argument is required for the program to run.
- Syntax:
-i <path_to_file> --fasta-input <path_to_file>
- Example:
./fasta_parser -i input.fasta
- Description: A flag to print the headers of the FASTA sequences. When this option is specified, the program will display the headers from the input FASTA file.
- Usage: This argument is optional and acts as a flag (no additional value required).
- Syntax:
-H --print-headers
- Example:
./fasta_parser -i input.fasta -H
- Description: Specifies the ID of the reference sequence to be located in the FASTA file. The program will search for and operate on the sequence with the given reference ID.
- Usage: This argument is optional but requires a string value corresponding to the reference ID.
- Syntax:
-r <reference_id> --reference-id <reference_id>
- Example:
./fasta_parser -i input.fasta -r ref123
- Description: "A nucleotide sequence that you want to file in the fasta sequences of fasta file. Return BED style Start and End coordinates.
- Usage: This argument is optional but requires a string value corresponding to the wanted subsequence to be searched.
- Syntax:
-s <nucleotide_sequence> --subsequence-pattern <nucleotide_sequence>
- Example:
./fasta_parser -i input.fasta -s ATGT
./fasta_parser -i input.fasta -HExample 2: Parse a FASTA file and find aligns the sequence with the ID provided to the other sequences in file.
./fasta_parser -i input.fasta -r ID_123./fasta_parser -i input.fasta- 0: Program executed successfully.
- 1: Missing required arguments or invalid input.
- The input file must be in valid FASTA format.
- The program ensures UTF-8 compliance for all input arguments.
Developed using the CLI11 library.