hackxit / assist-heidi-speech-jokey Goto Github PK
View Code? Open in Web Editor NEWA MVP project for natasa
License: MIT License
A MVP project for natasa
License: MIT License
Implement a feature in Navigation Mode (Mode 1) to display SSML tags as abbreviations or icons, similar to a markdown live preview. This will allow users to quickly identify and understand the presence of SSML tags in the text without viewing the full syntax, enhancing readability and navigation efficiency.
feature/ssml-abbrev-navigation
Create a user interface design that allows users to easily switch between Navigation Mode (Mode 1) and Editing Mode (Mode 2). This interface should be intuitive and accessible, enabling users to seamlessly transition between modes without disrupting their workflow.
feature/mode-switching-ui
Develop a user-friendly and accessible interface using the Kivy Framework, as per the initial draft1.kv
. This interface should be intuitive, with clear visual cues for navigation and accessible pop-up menus for SSML tag selection.
feature/frontend-layout
Develop the functionality to display the full SSML syntax in Editing Mode (Mode 2). When a line of text is selected and zoomed in, the application should render the text with the actual SSML tags, enabling users to view and edit the SSML syntax directly. This feature is critical for detailed editing and customization of SSML tags.
feature/full-ssml-syntax-editing
Create comprehensive documentation for the Navigation Mode (Mode 1) and Editing Mode (Mode 2) features. This documentation should cover detailed usage instructions, design philosophy, technical implementation, and any other relevant information that would assist developers, testers, and end-users in understanding and utilizing these features effectively.
feature/nav-edit-docs
Develop a comprehensive user preference settings feature for the application. This will include a frontend interface for users to customize their settings and backend logic to save and apply these preferences. The specific settings options will be defined in the future, but the infrastructure should be flexible to accommodate a range of user customizations.
feature/user-preferences
Design and implement the SSML Break Tag pop-up menu, which includes both frontend UI design and backend functionality. This component should allow users to easily customize and insert SSML Break tags.
feature/ssml-break-menu
Develop a generic time/date picker pop-up menu component that can be adapted for SSML features like 'time' and 'date'. This component should enable users to easily select time and date values for SSML tags, and is contingent on the text selection mechanism.
feature/time-date-picker-menu
Develop a generic button pop-up menu component that can be adapted for various SSML features including 'pitch', 'emphasis', and 'emotions'. This reusable component should facilitate user interaction for selecting SSML attributes, and is contingent on the successful implementation of the text selection mechanism.
feature/generic-ssml-button-menu
Develop a mechanism to wrap selected text using arbitrary SSML tags. This feature is essential for allowing users to apply SSML enhancements to specific text segments. The implementation requires both frontend interface design and backend logic to handle the selection and wrapping process.
feature/ssml-wrapping-mechanism
Develop a generic Numpad pop-up menu component for the SSML 'number' feature. This component will facilitate user interaction for entering numerical values and should be compatible with both the insertion and selection mechanisms.
feature/ssml-numpad-menu
Implement the integration of the operating system's on-screen keyboard with the application, particularly for use in Editing Mode (Mode 2). This feature will enable users to input and edit text using the on-screen keyboard, ensuring accessibility for users with limited physical input capabilities.
feature/on-screen-keyboard-integration
Develop a comprehensive mechanism for the insertion of arbitrary SSML tags within the text. This feature is a foundational requirement for the application, enabling users to insert any SSML tag as needed. The development includes both frontend UI components and backend logic.
feature/ssml-insertion-mechanism
Develop a comprehensive cursor control mechanism for the application that allows seamless switching between the chunk-based navigation in Navigation Mode (Mode 1) and the character-by-character movement in Editing Mode (Mode 2). This mechanism is critical for providing a fluid and intuitive user experience, catering to the unique needs of users with restricted input abilities.
feature/cursor-control-mechanism
Develop and test the integration with eye-tracking software for cursor movement and selection. This should allow seamless navigation and interaction for users relying on eye-tracking technology.
feature/eye-tracking-integration
Develop backend functionalities for text navigation, SSML tag processing, and voice generation, as outlined in draft1.py
. This includes integrating with speech synthesis OpenAPI and managing temporary storage for voice files.
feature/backend-dev
Implement keyboard shortcuts for text navigation and SSML tag insertion, ensuring they are intuitive and easily accessible for users with restricted movement.
feature/keyboard-shortcuts
Conduct thorough testing of both Navigation Mode (Mode 1) and Editing Mode (Mode 2), with a specific focus on accessibility. This testing should ensure that the application is usable and efficient for users with restricted input capabilities, and that both modes are intuitive and meet accessibility standards.
feature/accessibility-testing-nav-edit
Develop the Navigation Mode (Mode 1) for the application, allowing users to move quickly through the text in 'chunks', such as moving between lines and words. This mode is designed to facilitate fast and efficient navigation through the text, avoiding the need to traverse character-by-character.
feature/navigation-mode-1
Develop the Editing Mode (Mode 2) for the application, enabling users to edit text traditionally. This mode allows character-by-character navigation and text editing, similar to a standard text editor, but with additional accessibility features suitable for users with restricted input capabilities.
feature/editing-mode-2
Design custom emojis or icons to represent different SSML tags within the application. These visual elements will be used in Navigation Mode (Mode 1) to display SSML tags as abbreviations or icons, enhancing readability and user understanding of the SSML structure in the text.
feature/ssml-emoji-icon-design
Create a generic dropdown pop-up menu component adaptable for various SSML features, specifically 'voice' and 'section'. This component should facilitate user interaction for selecting SSML attributes and be compatible with both insertion and selection mechanisms.
feature/generic-ssml-dropdown-menu
Implement a feature to dynamically render text based on the cursor's position, switching between showing SSML tags as abbreviations/icons in Navigation Mode (Mode 1) and displaying the full SSML syntax in Editing Mode (Mode 2). This functionality is essential for providing a context-sensitive display that enhances user understanding and interaction with the text and SSML tags.
feature/dynamic-text-rendering
Develop backend functionality to integrate with an arbitrary speech synthesis API, as chosen by the user in their preference settings. This feature should enable the application to dynamically connect to and utilize different speech synthesis APIs based on user preferences.
feature/speech-synthesis-api-integration
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.