Deprecated: Function get_magic_quotes_gpc() is deprecated in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 99

Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 619

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1169

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176

Warning: Cannot modify header information - headers already sent by (output started at /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php:99) in /hermes/walnacweb04/walnacweb04ab/b2791/pow.jasaeld/htdocs/De1337/nothing/index.php on line 1176
8000 nissymori (Soichiro Nishimori) / Starred · GitHub
Nothing Special   »   [go: up one dir, main page]

Skip to content
View nissymori's full-sized avatar

Block or report nissymori

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Clean single-file implementation of offline RL algorithms in JAX

Python 158 4 Updated Dec 24, 2024
Jupyter Notebook 10 Updated Aug 8, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,895 1,862 Updated Oct 6, 2025

[RLC 2025] Official code repository for "Offline Reinforcement Learning with Wasserstein Regularization via Optimal Transport Maps"

Python 2 Updated Oct 20, 2025

Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"

Python 23 1 Updated Sep 7, 2025

[TMLR 2025] Importance Weighting for Aligning Language Models under Deployment Distribution Shift

Python 5 Updated Jul 22, 2025

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

Python 7 Updated Jul 23, 2025

Simplifying reinforcement learning for complex game environments

C 3,715 269 Updated Oct 21, 2025

Ludax is a domain-specific language for board games that automatically compiles into hardware-accelerated learning environments with the JAX library

Python 23 1 Updated Oct 21, 2025

textlintプリセット:AIっぽい記述パターンを検出し、より自然な日本語表現を促すルール集

TypeScript 1,011 21 Updated Aug 20, 2025

An Agent-Environment interface with Jax transformation compatibility

Python 12 Updated Nov 17, 2023

[ICML 2025] Official code repository for "Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning"

Python 6 Updated Jun 17, 2025
Python 7 Updated Jul 28, 2025
Python 5 Updated Jun 2, 2025

Policy-based Riichi Mahjong(立直麻将) AI, based on Mortal and Suphx, powered by Rust and deep RL.

Rust 27 4 Updated Sep 24, 2025

Code for the paper "Practical estimation of the optimal classification error with soft labels and calibration"

Python 5 Updated Sep 26, 2025

Implementation of Soft Actor Critic and some of its improvements in Pytorch

Python 60 1 Updated Feb 13, 2025

Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023

Python 28 6 Updated Mar 7, 2024

Unified Implementations of Offline Reinforcement Learning Algorithms

Python 113 5 Updated Oct 12, 2025

Authors' implementation of the "Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?"

Python 9 Updated Jun 13, 2024

☗ lishogi.org: the forever free, adless and open source shogi server forked from lichess.org ☗

Scala 320 44 Updated Oct 8, 2025

[CVPR2025] Official PyTorch implimentation for "Physical Plausibility-aware Trajectory Prediction via Locomotion Embodiment"

Python 43 1 Updated Aug 31, 2025

A Mahjong AI for Mahjong Soul

JavaScript 441 57 Updated Nov 3, 2024

YaneuraOu is the World's Strongest Shogi engine(AI player) , WCSC29 1st winner , educational and USI compliant engine.

C++ 604 158 Updated Oct 21, 2025

Artificial Intelligence for Japanese mahjong

C++ 275 44 Updated Jul 5, 2022

MyoSuite is a collection of environments/tasks to be solved by musculoskeletal models simulated with the MuJoCo physics engine and wrapped in the OpenAI gym API.

Python 1,042 146 Updated Oct 16, 2025

🚀🀄️ A fast and strong AI for riichi mahjong, powered by Rust and deep reinforcement learning.

Rust 1,268 159 Updated Sep 28, 2025

A curated list of Diffusion Model in RL resources (continually updated)

1,369 69 Updated Sep 12, 2025
TypeScript 26 7 Updated Oct 17, 2025
Next
0