Data Scientist Job at NEXUS CORPORATION, 東京都

M3RURDZTbFVHcjNTaXA3UTMzZzEyZFgx
  • NEXUS CORPORATION
  • 東京都

Job Description

業務内容:

  • 以下のいずれかにまず参加して頂きます。その後さらに専門性を高めて頂くか、もしくは他のプロジェクトに参加して、実績に応じて希望のキャリアを積んで頂きます。

アドテク(Adtech)のプロジェクト:

  • インターネット広告の主な仕組みの一つであるRTB(リアルタイム入札)において、広告出稿する側の費用対効果を最適化するDSP(Demand-Side Platform)の機械学習モデルの設計開発、効果測定などをメインに行います

アプリのプロジェクト:

  • フリーWiFi接続を容易にするアプリの新機能や施策の効果測定を因果推論の技術を駆使して行い、データドリブンに経営判断するための仕組みを整えて、サービスのKPIを改善させます

その他のプロジェクト:

  • 暗号資産取引、不正検知などに関して、データ解析や機械学習の技術を応用して支援します

【研究開発業務】:

  • プロジェクト業務を行いながら、一定の時間、全員で最先端の機械学習手法や新たな機械学習の応用を研究します
  • さらに四半期ごとに選任されたメンバーは重点的に研究開発を行います

You will first participate in one of the following projects. After that, you can further develop your expertise or participate in other projects to build your career as desired based on your achievements.

Adtech Project:

  • In RTB (real-time bidding), one of the main mechanisms for internet advertising, we mainly design and develop machine learning models for DSP (Demand-Side Platform) that optimize the cost-effectiveness of advertisers, and measure their effectiveness.

App project:

  • We will use causal inference technology to measure the effectiveness of new features and measures for apps that facilitate free WiFi connection, and improve the KPIs of our services by establishing a system for data-driven management decisions.

Other projects:

  • We will apply data analysis and machine learning technology to support cryptocurrency trading, fraud detection, etc.

[Research and development work]:

  • While working on projects, we will all spend a certain amount of time researching cutting-edge machine learning methods and new machine learning applications
  • In addition, members selected every quarter will focus on research and development

Requirements

【利用技術】:

解析手法:

機械学習:

  • Transformer系(大規模言語モデル他)、グラフニューラルネットワーク(GNN)、多層パーセプトロン(MLP)、アンサンブル学習/勾配ブースティング(Gradient Boost Tree + LR, Random Forest, ExtraTree , Ada Boost, XGBoost, LightGBM)、PCA、FP-Growth、Word2Vec、Doc2Vec、協調フィルタリング、ベイズ推定、HMMモデル(隠れマルコフモデル)

統計分析:

  • t検定、カイ二乗検定、F検定、二項検定、コルモゴロフ・スミルノフ検定、シャピロウィルク検定、サンプリング(MCMC,ブートストラップ法など)、分散分析、因果推論(差分の差分法など)

開発技術/環境:

プログラミング/フレームワーク:

  • Python、PyData(numpy、scipy、pandasなど)、Streamlit
  • PyTorch、TensorFlow、LangChain、Spark(PySpark)

クラウド/オンプレ(ミドルウェア):

  • Google Cloud(GCS、BigQuery、VertexAI、Dataflowなど)
  • AWS(S3、Athena、EMR/Serverless、StepFunction、SageMaker、Bedrockなど)
  • MySQL、MariaDB、Percona Server、PostgreSQL、Galera Cluster、Oracle、Hive、Hadoop/HDFS
  • ConoHa(GPUサーバー)
  • 大規模言語モデル(LLM)関連
  • OpenAI API、Llama3、LangChain、HuggingFace

開発ツール:

  • Atlassian(Jira、Confluence)、Trello
  • VS Code、PyCharm、Jupyter
  • GitHub(Copilot)
  • Tableau、Looker Studio、metabase
  • ChatGPT、Gemini、Claude

開発手法:

  • アジャイル開発(scrumベース)

[Technologies used]:

Analysis Methods:

Machine learning:

  • Transformer series (large-scale language models, etc.), graph neural network (GNN), multi-layer perceptron (MLP), ensemble learning/gradient boosting (Gradient Boost Tree + LR, Random Forest, ExtraTree, Ada Boost, XGBoost, LightGBM), PCA, FP-Growth, Word2Vec, Doc2Vec, collaborative filtering, Bayesian inference, HMM model (hidden Markov model)

Statistical analysis:

  • t-test, chi-square test, F-test, binomial test, Kolmogorov-Smirnov test, Shapiro-Wilk test, sampling (MCMC, bootstrap method, etc.), analysis of variance, causal inference (difference of differences method, etc.)

Development Technology/Environment:

ProgrammingFframework:

  • Python, PyData (numpy, scipy, pandas, etc.), Streamlit
  • PyTorch, TensorFlow, LangChain, Spark (PySpark)

Cloud/on-premise (Middleware):

  • Google Cloud (GCS, BigQuery, VertexAI, Dataflow, etc.)
  • AWS (S3, Athena, EMR/Serverless, StepFunction, SageMaker, Bedrock, etc.)
  • MySQL, MariaDB, Percona Server, PostgreSQL, Galera Cluster, Oracle, Hive, Hadoop/HDFS
  • ConoHa (GPU server)
  • Large-scale language model (LLM) related
  • OpenAI API, Llama3, LangChain, HuggingFace

Development Tools:

  • Atlassian (Jira, Confluence), Trello
  • VS Code, PyCharm, Jupyter
  • GitHub (Copilot)
  • Tableau, Looker Studio, metabase
  • ChatGPT, Gemini, Claude

Development Methods:

  • Agile development (scrum-based)

Job Tags

Similar Jobs

MHM Publishing Inc

Base Pilot Job at MHM Publishing Inc

Pilot-Rotary Wing | Henderson Nevada, USAJob RequirementsMinimum QualificationsCurrent FAA Commercial Rotorcraft Certificate.Helicopter...  ...of 500 hours external load operations.Ability to work in remote locations away from the base.Strong communication skills (English... 

Touchstone Communities

Certified Medication Aide Job at Touchstone Communities

Certified Medication Aide at Touchstone Communities summary:A Certified Medication Aide is responsible for providing compassionate nursing care in accordance with care plans while executing CNA tasks. The position requires valid TX CNA and Medication Aide certifications... 

2iSolutions Inc.

SAP Field Service Management Consultant Job at 2iSolutions Inc.

 ...Job Title: SAP Field Service Management Consultant Contract Duration: 6 Months Location: 100% Remote Key Responsibilities: Provide expert consulting on SAP Field Service Management implementation and optimization for clients. Manage service orders in SAP S/... 

University of Maryland Global Campus

Building Monitor, Iwakuni Job at University of Maryland Global Campus

 ...ded by students and that all University labs/rooms are clean and presentable. Perform other job-related duties a...  ...ma Available for a flexible work schedule including early mornings, days, late evenings, weekends, and holidays Basic ... 

Adecco

User Experience Designer Job at Adecco

 ...Adecco Creative and Marketing seeking an Experience Designer to help create intuitive, engaging, and efficient digital experiences for internal users. This role requires a user-centered design mindset , strong collaboration with cross-functional teams, and the ability...