Run Non Octopus UI Tests macOS #149

Workflow file for this run

.github/workflows/runUiOctopusTests.yml at 0526019

	name: Run Non Octopus UI Tests macOS
	on:
	workflow_dispatch:
	schedule:
	- cron: '0 12 * * *'
	jobs:
	build-for-ui-test-mac-os:
	if: github.repository == 'JetBrains/ideavim'
	runs-on: macos-latest
	permissions:
	contents: read
	id-token: write
	steps:
	- uses: actions/checkout@v4
	- name: Setup Java
	uses: actions/setup-java@v4
	with:
	distribution: zulu
	java-version: 21
	- name: Setup FFmpeg
	run: brew install ffmpeg
	- name: Setup Gradle
	uses: gradle/actions/setup-gradle@v4
	- name: Build Plugin
	run: gradle :buildPlugin
	- name: Run Idea
	run: \|
	mkdir -p build/reports
	gradle runIdeForUiTests -Doctopus.handler=false > build/reports/idea.log &
	- name: List available capture devices
	run: ffmpeg -f avfoundation -list_devices true -i "" 2>&1 \|\| true
	continue-on-error: true
	- name: Start screen recording
	run: \|
	mkdir -p build/reports/ci-screen-recording
	ffmpeg -f avfoundation -capture_cursor 1 -i "0:none" -r 30 -vcodec libx264 -pix_fmt yuv420p build/reports/ci-screen-recording/screen-recording.mp4 &
	echo $! > /tmp/ffmpeg_pid.txt
	continue-on-error: true
	- name: Auto-click Allow button for screen recording permission
	run: \|
	sleep 3
	brew install cliclick \|\| true

	for coords in "512:367" "960:540" "640:400" "800:450"; do
	x=$(echo $coords \| cut -d: -f1)
	y=$(echo $coords \| cut -d: -f2)
	echo "Trying coordinates: $x,$y"

	cliclick c:$x,$y 2>/dev/null && echo "cliclick succeeded" && break
	sleep 0.5

	osascript -e "tell application \"System Events\" to click at {$x, $y}" 2>/dev/null && echo "AppleScript succeeded" && break
	sleep 1
	done
	continue-on-error: true
	- name: Wait for Idea started
	uses: jtalk/url-health-check-action@v3
	with:
	url: http://127.0.0.1:8082
	max-attempts: 20
	retry-delay: 10s
	- name: Tests
	run: gradle :tests:ui-ij-tests:testUi
	- name: Stop screen recording
	if: always()
	run: \|
	if [ -f /tmp/ffmpeg_pid.txt ]; then
	kill $(cat /tmp/ffmpeg_pid.txt) \|\| true
	sleep 2
	fi
	continue-on-error: true
	- name: Move sandbox logs
	if: always()
	run: mv build/idea-sandbox/IU-*/log_runIdeForUiTests idea-sandbox-log
	- name: AI Analysis of Test Failures
	if: failure()
	uses: anthropics/claude-code-action@v1
	with:
	claude_code_oauth_token: ${{ secrets.CLAUDE_CODE_OAUTH_TOKEN }}
	claude_args: '--allowed-tools "Read,Write,Edit,Glob,Grep,Bash(ffmpeg:),Bash(ffprobe:),Bash(mkdir:),Bash(touch:),Bash(echo:),Bash(ls:),Bash(cat:),Bash(grep:),Bash(find:),Bash(cd:),Bash(rm:),Bash(git:),Bash(gh:),Bash(gradle:),Bash(./gradlew:),Bash(java:),Bash(which:*)"'
	prompt: \|
	## Task: Analyze UI Test Failures

	Please analyze the UI test failures in the current directory.

	Key information:
	- Test reports are located in: build/reports and tests/ui-ij-tests/build/reports
	- There is a CI screen recording at build/reports/ci-screen-recording/screen-recording.mp4 that shows what happened during the entire test run - this video is usually very useful for understanding what went wrong visually
	- There is also a single screenshot at tests/ui-ij-tests/build/reports/ideaVimTest.png showing the state when the test failed
	- IDE sandbox logs are in the idea-sandbox-log directory
	- ffmpeg is already installed and available. Useful commands for video analysis:
	* Extract frame at specific time: `ffmpeg -i video.mp4 -ss 00:01:30 -vframes 1 output.png`
	* Extract multiple frames: `ffmpeg -i video.mp4 -vf fps=1/10 frame_%04d.png` (1 frame every 10 seconds)
	* Get video duration: `ffprobe -v error -show_entries format=duration -of default=noprint_wrappers=1:nokey=1 video.mp4`
	* Extract last N seconds: `ffmpeg -sseof -10 -i video.mp4 -update 1 last_frame.png` (last 10 seconds)
	* Create thumbnail grid: `ffmpeg -i video.mp4 -vf "select='not(mod(n\,300))',scale=160:120,tile=5x5" -frames:v 1 grid.png`

	Special troubleshooting for timeout failures:
	- If the test fails because it waited for something to appear, but according to the video and screenshot the element actually appeared on screen, check the UI hierarchy file at build/reports/hierarchy-ideaVimTest.html
	- This hierarchy file shows the actual structure of UI elements and their properties at the time of failure
	- The failure may be caused by a renamed property or changed class name in the UI element
	- If you find this is the case, suggest a new query or selector that matches the current element structure

	Please provide:
	1. A detailed analysis of what went wrong
	2. The root cause of the failure
	3. Potential fixes or suggestions

	Write your analysis to build/reports/ai-analysis.txt

	## UI Test Best Practices

	When fixing UI tests, follow these principles:

	Cause-Effect Over Timeouts: UI tests should wait for specific conditions rather than arbitrary timeouts.
	- ✅ GOOD: Wait for a button to become visible or enabled before clicking it
	- ✅ GOOD: Wait for specific text to appear in a component
	- ✅ GOOD: Wait for a dialog to be present with a specific accessible name
	- ❌ BAD: Use Thread.sleep() or fixed delays
	- ❌ BAD: Wait for arbitrary timeouts without checking for specific conditions

	Only use timeouts as a maximum wait duration with explicit condition checks (e.g., "wait up to 30 seconds for dialog to appear").
	Tests should be deterministic and based on observable state changes in the UI, not time-based assumptions.

	IMPORTANT: If you have a concrete suggestion for fixing the test, ALWAYS proceed with creating a branch and PR. Never ask for permission - just do it.

	If you have a concrete suggestion for fixing the test:
	1. Create a new branch with a descriptive name (e.g., fix/ui-test-accessible-name)
	2. Apply your suggested fix to the codebase
	3. CRITICAL: When staging changes, NEVER use `git add -A` or `git add .`. Always add modified files explicitly by path (e.g., `git add path/to/file.kt path/to/other.kt`). This prevents accidentally staging unrelated files.
	4. MANDATORY: Verify compilation succeeds with your changes: `gradle compileKotlin compileTestKotlin`
	5. MANDATORY: Run the specific failing test to verify the fix improves or resolves the issue
	- For Non-Octopus UI tests: `gradle :tests:ui-ij-tests:testUi --tests "YourTestClassName.yourTestMethod"`
	- To run all Non-Octopus UI tests: `gradle :tests:ui-ij-tests:testUi`
	- The test MUST either pass completely or show clear improvement (e.g., progressing further before failure)
	6. If the test passes or shows improvement with your fix, create a PR with:
	- Clear title describing the fix
	- Description explaining the root cause and solution
	- Test results showing the fix works
	- Reference to the failing CI run
	7. Use the base branch 'master' for the PR
	- name: Save report
	if: always()
	uses: actions/upload-artifact@v4
	with:
	name: ui-test-fails-report-mac
	path: \|
	build/reports
	tests/ui-ij-tests/build/reports
	idea-sandbox-log
	# build-for-ui-test-linux:
	# runs-on: ubuntu-latest
	# steps:
	# - uses: actions/checkout@v2
	# - name: Setup Java
	# uses: actions/[email protected]
	# with:
	# distribution: zulu
	# java-version: 11
	# - name: Build Plugin
	# run: gradle :buildPlugin
	# - name: Run Idea
	# run: \|
	# export DISPLAY=:99.0
	# Xvfb -ac :99 -screen 0 1920x1080x16 &
	# mkdir -p build/reports
	# gradle :runIdeForUiTests #> build/reports/idea.log
	# - name: Wait for Idea started
	# uses: jtalk/[email protected]
	# with:
	# url: http://127.0.0.1:8082
	# max-attempts: 15
	# retry-delay: 30s
	# - name: Tests
	# run: gradle :testUi
	# - name: Save fails report
	# if: ${{ failure() }}
	# uses: actions/upload-artifact@v2
	# with:
	# name: ui-test-fails-report-linux
	# path: \|
	# ui-test-example/build/reports

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run Non Octopus UI Tests macOS #149

Workflow file

Run Non Octopus UI Tests macOS #149

Uh oh!

Workflow file for this run