Skip to content

docs: paper #111

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 8 commits into
base: main
Choose a base branch
from
Open

docs: paper #111

wants to merge 8 commits into from

Conversation

cmeesters
Copy link
Collaborator

@cmeesters cmeesters commented Apr 8, 2025

this PR is not to be merged until after the paper review.

The intention is to develop a paper draft together .

Summary by CodeRabbit

  • Documentation

    • Introduced a new document titled "A Snakemake workflow for differential expression analysis with alternative splicing detection using long read data," detailing a modular workflow for analyzing long-read RNA sequencing data, including quality filtering, read alignment, quantification, and isoform detection.
  • New Features

    • Added a GitHub Actions workflow for automatically generating and uploading a draft PDF of the new document upon changes to the relevant files.

Copy link
Contributor

coderabbitai bot commented Apr 8, 2025

Warning

Rate limit exceeded

@cmeesters has exceeded the limit for the number of commits or files that can be reviewed per hour. Please wait 0 minutes and 8 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

📥 Commits

Reviewing files that changed from the base of the PR and between db39ed1 and 7687119.

📒 Files selected for processing (1)
  • .github/workflows/draft-pdf.yml (1 hunks)

Walkthrough

A new document titled "A Snakemake workflow for differential expression analysis with alternative splicing detection using long read data" has been added. The document details a modular, reproducible workflow optimized for high-performance computing environments. It outlines steps including quality filtering, read alignment, quantification, differential expression analysis, and alternative splicing detection using a modified FLAIR toolkit. An optional annotation procedure for under-annotated or novel genomes is also described. Additionally, a new GitHub Actions workflow has been introduced to automate the generation and upload of a draft PDF from the document.

Changes

File Change Summary
paper/paper.md New document describing a modular Snakemake workflow for differential expression and alternative splicing analysis on long-read data.
.github/workflows/draft-pdf.yml New GitHub Actions workflow for automatically generating and uploading a draft PDF from paper/paper.md.

Sequence Diagram(s)

sequenceDiagram
    participant User as User/HPC
    participant Workflow as Snakemake Workflow
    participant Quality as Quality Filtering
    participant Align as Read Alignment
    participant Quant as Quantification
    participant DiffExp as Differential Analysis
    participant Splice as Splicing Analysis (FLAIR)
    participant Annot as Optional Annotation

    User->>Workflow: Start workflow with input data
    Workflow->>Quality: Execute quality filtering
    Quality-->>Workflow: Return filtered data
    Workflow->>Align: Perform read alignment
    Align-->>Workflow: Return aligned reads
    Workflow->>Quant: Quantify transcripts
    Quant-->>Workflow: Return expression data
    Workflow->>DiffExp: Run differential expression analysis
    DiffExp-->>Workflow: Return statistical outcomes
    Workflow->>Splice: Perform splicing analysis using FLAIR
    Splice-->>Workflow: Return isoform data
    Workflow->>Annot: (Optional) Annotate novel genomes
    Annot-->>Workflow: Return annotation details
    Workflow-->>User: Provide final workflow output
Loading

Poem

I’m a rabbit with a code-filled hop,
Leaping through data with a happy bop.
New workflows bloom like carrots in spring,
Smart sequences that make efficiency sing.
With every command and snippet so neat,
My floppy ears dance to this technical beat!
🐇🌼


🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai plan to trigger planning for file edits and PR creation.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🧹 Nitpick comments (7)
paper/paper.md (7)

4-10: Correct Typos in Title and Tags
The title and the tags contain typographical errors. In the title, "analyis" should be corrected to "analysis" and in the tags, "alterternative splicing detection" should be corrected to "alternative splicing detection".

-title: 'A Snakemake workflow for differential expression analyis with alternative splicing detection using long read data'
+title: 'A Snakemake workflow for differential expression analysis with alternative splicing detection using long read data'

-  - alterternative splicing detection
+  - alternative splicing detection
🧰 Tools
🪛 markdownlint-cli2 (0.17.2)

6-6: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


7-7: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


8-8: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


9-9: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


10-10: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


19-20: Review Placeholder ORCID Value
The ORCID for Christiane Krämer is set as "XXX ?". If this is a placeholder, please update it with a valid ORCID or remove the placeholder to maintain professionalism.


31-31: Remove Inline Comment from Date Field
The date field includes the annotation " <- update". Consider removing this inline note before finalizing the document to ensure the metadata remains clean.


5-10: Address YAML List Indentation Issues in Tags Block
Markdownlint has flagged unordered list indentation issues (MD007) for the tags block. You might either adjust the indentation to conform to the linter’s expectations or configure the linter to ignore YAML front matter.

🧰 Tools
🪛 markdownlint-cli2 (0.17.2)

6-6: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


7-7: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


8-8: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


9-9: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


10-10: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


25-29: Ensure Consistent Indentation in Affiliations Block
There are inconsistent indentation issues in the affiliations list. Align the list items and their sub-properties uniformly to improve readability and prevent potential parsing issues.

🧰 Tools
🪛 LanguageTool

[uncategorized] ~28-~28: If this is a compound adjective that modifies the following noun, use a hyphen.
Context: ...ny index: 1 - name: NHR-SouthWest / High Performance Computing Group, Johannes Gutenberg-Uni...

(EN_COMPOUND_ADJECTIVE_INTERNAL)

🪛 markdownlint-cli2 (0.17.2)

26-26: Inconsistent indentation for list items at the same level
Expected: 2; Actual: 1

(MD005, list-indent)


26-26: Unordered list indentation
Expected: 0; Actual: 1

(MD007, ul-indent)


28-28: Inconsistent indentation for list items at the same level
Expected: 2; Actual: 1

(MD005, list-indent)


28-28: Unordered list indentation
Expected: 0; Actual: 1

(MD007, ul-indent)


78-78: Fix Typo and Enhance Readability in Quality Filtering Section
The word "Prioar" appears to be a typo and should be corrected to "Prior to". Additionally, consider inserting a comma after the introductory phrase "For this" to improve clarity (i.e. "For this, we make use of the BioPython library...").

🧰 Tools
🪛 LanguageTool

[typographical] ~78-~78: Use a comma after an introductory phrase.
Context: ...ead quality and read length thresholds. For this we make use of the BioPython library [@...

(COMMA_INTRODUCTORY_WORDS_PHRASES)


93-93: Remove Placeholder Text for Rulegraph
The text "ADD rulegraph and caption, here." appears to be a placeholder. Ensure that this placeholder is replaced with the final rulegraph content or removed before merging the draft paper.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 04e998a and fc60051.

📒 Files selected for processing (1)
  • paper/paper.md (1 hunks)
🧰 Additional context used
🪛 LanguageTool
paper/paper.md

[uncategorized] ~28-~28: If this is a compound adjective that modifies the following noun, use a hyphen.
Context: ...ny index: 1 - name: NHR-SouthWest / High Performance Computing Group, Johannes Gutenberg-Uni...

(EN_COMPOUND_ADJECTIVE_INTERNAL)


[typographical] ~78-~78: Use a comma after an introductory phrase.
Context: ...ead quality and read length thresholds. For this we make use of the BioPython library [@...

(COMMA_INTRODUCTORY_WORDS_PHRASES)

🪛 markdownlint-cli2 (0.17.2)
paper/paper.md

6-6: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


7-7: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


8-8: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


9-9: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


10-10: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


12-12: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


15-15: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


18-18: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


21-21: Unordered list indentation
Expected: 0; Actual: 2

(MD007, ul-indent)


26-26: Inconsistent indentation for list items at the same level
Expected: 2; Actual: 1

(MD005, list-indent)


26-26: Unordered list indentation
Expected: 0; Actual: 1

(MD007, ul-indent)


28-28: Inconsistent indentation for list items at the same level
Expected: 2; Actual: 1

(MD005, list-indent)


28-28: Unordered list indentation
Expected: 0; Actual: 1

(MD007, ul-indent)

⏰ Context from checks skipped due to timeout of 90000ms (1)
  • GitHub Check: Testing

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 3

🧹 Nitpick comments (1)
.github/workflows/draft-pdf.yml (1)

28-28: Newline at End of File

The linter also reported that there’s no newline character at the end of the file (line 28). Please add a newline to adhere to best practices and avoid potential issues in some tools.

🧰 Tools
🪛 YAMLlint (1.35.1)

[error] 28-28: no new line character at the end of file

(new-line-at-end-of-file)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between fc60051 and f129380.

📒 Files selected for processing (1)
  • .github/workflows/draft-pdf.yml (1 hunks)
🧰 Additional context used
🪛 YAMLlint (1.35.1)
.github/workflows/draft-pdf.yml

[warning] 4-4: wrong indentation: expected 8 but found 6

(indentation)


[warning] 5-5: wrong indentation: expected 10 but found 8

(indentation)


[warning] 9-9: wrong indentation: expected 4 but found 2

(indentation)


[warning] 10-10: wrong indentation: expected 6 but found 4

(indentation)


[warning] 13-13: wrong indentation: expected 8 but found 6

(indentation)


[warning] 18-18: wrong indentation: expected 12 but found 10

(indentation)


[warning] 24-24: wrong indentation: expected 12 but found 10

(indentation)


[error] 28-28: no new line character at the end of file

(new-line-at-end-of-file)

Comment on lines +8 to +12
jobs:
paper:
runs-on: ubuntu-latest
name: Paper Draft
steps:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🛠️ Refactor suggestion

Jobs Block: Correct Indentation

The jobs section also shows inconsistent indentation. For instance, line 9 (the key paper:) is indented with 2 spaces while the expected indentation is 4, and line 10’s runs-on: ubuntu-latest should be indented by 6 spaces rather than 4.

A consistent reindentation might look like this:

Adjust the rest of the document similarly to maintain uniformity.

Committable suggestion skipped: line range outside the PR's diff.

🧰 Tools
🪛 YAMLlint (1.35.1)

[warning] 9-9: wrong indentation: expected 4 but found 2

(indentation)


[warning] 10-10: wrong indentation: expected 6 but found 4

(indentation)

Comment on lines 13 to 27
- name: Checkout
uses: actions/checkout@v4
- name: Build draft PDF
uses: openjournals/openjournals-draft-action@master
with:
journal: joss
# This should be the path to the paper within your repo.
paper-path: paper/paper.md
- name: Upload
uses: actions/upload-artifact@v4
with:
name: paper
# This is the output path where Pandoc will write the compiled
# PDF. Note, this should be the same directory as the input
# paper.md
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🛠️ Refactor suggestion

Steps Block: Reindent Step Items and Parameters

Several steps within the job have indentation issues as well. Static analysis reports that:

  • Line 13 (“- name: Checkout”) is indented with 6 spaces instead of the expected 8.
  • Line 18 (the “journal: joss” key) is indented with 10 spaces when 12 are expected.
  • Line 24 (“name: paper”) similarly shows too few spaces.

A suggested reindentation for the steps block using a 2-space-per-level standard is:

This will help clear the YAML lint warnings and ensure that the workflow is parsed correctly.

Committable suggestion skipped: line range outside the PR's diff.

🧰 Tools
🪛 YAMLlint (1.35.1)

[warning] 13-13: wrong indentation: expected 8 but found 6

(indentation)


[warning] 18-18: wrong indentation: expected 12 but found 10

(indentation)


[warning] 24-24: wrong indentation: expected 12 but found 10

(indentation)

cmeesters and others added 5 commits April 8, 2025 15:31
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (4)
paper/paper.md (4)

1-31: Fix Typos and Remove Editorial Notes in Front Matter

  • Line 2: The title contains a typo—“differential expression analyis” should be “differential expression analysis.”
  • Line 8: The tag “alterternative splicing detection” should be corrected to “alternative splicing detection.”
  • Line 29: Remove the “<- update” note from the date field once the paper’s metadata is finalized.

16-18: Review Placeholder ORCID Field

The ORCID for Christiane Krämer is listed as XXX ?, which appears to be a placeholder. Please update this field with the correct ORCID or remove it if not applicable.


76-77: Correct Typographical Error in Quality Filtering Section

The sentence beginning on line 76 starts with “Prioar downstream analysis...”. It should read “Prior to downstream analysis...” to ensure clarity and correctness.


90-90: Remove Placeholder Text

The placeholder text “ADD rulegraph and caption, here.” on line 90 should be replaced with the actual rulegraph and appropriate caption, or removed if it is not required in the final draft.

🧰 Tools
🪛 LanguageTool

[uncategorized] ~90-~90: The grammatical number of this noun doesn’t look right. Consider replacing it.
Context: ...exploratory studies. ADD rulegraph and caption, here. # Usage # Acknowledgements A...

(AI_EN_LECTOR_REPLACEMENT_NOUN_NUMBER)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between f129380 and db39ed1.

📒 Files selected for processing (2)
  • .github/workflows/draft-pdf.yml (1 hunks)
  • paper/paper.md (1 hunks)
🧰 Additional context used
🪛 LanguageTool
paper/paper.md

[typographical] ~75-~75: Use a comma after an introductory phrase.
Context: ...ead quality and read length thresholds. For this we make use of the BioPython library [@...

(COMMA_INTRODUCTORY_WORDS_PHRASES)


[uncategorized] ~90-~90: The grammatical number of this noun doesn’t look right. Consider replacing it.
Context: ...exploratory studies. ADD rulegraph and caption, here. # Usage # Acknowledgements A...

(AI_EN_LECTOR_REPLACEMENT_NOUN_NUMBER)

🪛 YAMLlint (1.35.1)
.github/workflows/draft-pdf.yml

[error] 24-24: no new line character at the end of file

(new-line-at-end-of-file)

🔇 Additional comments (2)
.github/workflows/draft-pdf.yml (2)

1-7: YAML Trigger Configuration Looks Correct

The trigger block under on: is well structured using consistent 2-space indentation for keys and 6-space indentation for list items. This update resolves the previous indentation issues flagged in earlier commits.


8-24: Append Newline at End of File

Static analysis has flagged that there is no newline character at the end of the file (line 24). Please ensure a newline is added to meet YAML formatting best practices.

🧰 Tools
🪛 YAMLlint (1.35.1)

[error] 24-24: no new line character at the end of file

(new-line-at-end-of-file)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant