In the multi-label problem there is no constraint on how many of the classes the instance can be assigned to. Formally, multi-label classification is the problem of finding a model that maps inputs x to binary vectors y (assigning a value of 0 or 1 for each element (label) in y).
Problem transformation methods Transformation into binary classification problems
binary relevance
amounts to independently training one binary classifier for each label.
Python and Java Service
Python和Java服务器通信实现的理解和比较 Python的WSGI和Java的Servlet API Python的WSGI 最近在学习使用Pyt
Machine Learning Crash Course Courses
Machine Learning Crash Course Courses ML Concepts Introduction to ML First, it gives you a tool to reduce the time you spend programming. Second, it will allow you to customize your products, making them better for specific groups of people. Third, machine learning lets you solve problems that you, as a programmer, have no idea how to do by hand. machine learning changes the way you think about a problem.
notes of deep learning with python
Deep Learning with Python PART 1 - FUNDAMENTALS OF DEEP LEARNING 1.What is deep learning? Artificial intelligence, machine learning, and deep learning Before deep learning: a brief history of machine learning Why deep learning? Why now? 2.Before we begin: the mathematical building blocks of neural networks A first look at a neural network Data representations for neural networks tensor: it’s a container for numbers. You may be already familiar with matrices, which are 2D tensors: tensors are a generalization of matrices to an arbitrary number of dimensions (note that in the context of tensors, a dimension is often called an axis).
scikit-learn
scikit-learn sklearn
preprocessing
MultiLabelBinarizer
Transforms between iterable of iterables and a multilabel format (binary matrix)
utils
class_weight
compute_class_weight
Estimate class weights for unbalanced datasets.
Solution for imbalanced data
FAQ imbalanced data https://www.tensorflow.org/tutorials/structured_data/imbalanced_data https://stackoverflow.com/questions/43481490/keras-class-weights-class-weight-for-one-hot-encoding https://github.com/jbjorne/TEES/blob/master/Detectors/KerasDetectorBase.py https://towardsdatascience.com/practical-tips-for-class-imbalance-in-binary-classification-6ee29bcdb8a7 https://stackoverflow.com/questions/48485870/multi-label-classification-with-class-weights-in-keras Class weight degrades Multi Label Classification Performance https://datascience.stackexchange.com/questions/28040/class-weight-degrades-multi-label-classification-performance What is the difference between multilabel and multiclass classification? https://www.quora.com/What-is-the-difference-between-multilabel-and-multiclass-classification https://stats.stackexchange.com/questions/260505/should-i-use-a-categorical-cross-entropy-or-binary-cross-entropy-loss-for-binary
Few shot learning
Reference https://github.com/AnthonyMRios/multi-label-zero-shot Few-Shot and Zero-Shot Multi-Label Learning for Structured Label Spaces Deep Neural Networks for Multi-Label Text Classification https://github.com/YujiaBao/Distributional-Signatures Induction Networks for Few-Shot Text Classification https://github.com/laohur/LearnToCompareText
Learning Path for Machine Learning
Learning Path for Machine Learning Basics of machine learning with TensorFlow Step 1: Understand what ML is all about The book ‘Deep Learning in Python’ by Francois Chollet, creator of Keras, is a great place to get started. Read chapters 1-4 to understand the fundamentals of ML from a programmer’s perspective.
Google Developers Machine Learning Crash Course
Machine Learning Glossary
Step 2: Beyond the basics Take the TensorFlow in Practice Specialization, which takes you beyond the basics into introductory Computer Vision, NLP, and Sequence modelling.
Async I/O
non-blocking I/O, where normally slow system calls like open, accept, read, and write return immediately instead of blocking, select, epoll_\* set of functions, libev and libuv open source libraries.
Caching
caching, memory hierarchies, registers, L1, L2 and L3 caches, main memory, disk, swap space, data locality, temporal locality, custom caches, cache replacement policies (LRU, MRU, TLRU, and so forth), profiling tools, **callgrind**, **cachegrind**, and alternatives
Concurrent Programming
sequential programming, VLIW concept, desire to emulate the real world with parallel threads, free-of-charge exploitation of multiple cores (eight per myth machine, eight per rice machine), pros and cons of **thread**ing versus **fork**ing. C++ **thread**s, thread construction using function pointers, blocks, functors, join, detach, race conditions, mutex, IA32 implementation of lock and unlock, spin-lock, busy waiting, preemptive versus cooperative multithreading, yield, **sleep****_for**. condition variables, rendezvous and thread communication, **lock_guard**, wait, notify_one, notify_all, deadlock, busy waiting.
CS103: Mathematical Foundations of Computing
Description 计算机科学所需的数学基础。主题包括证明技术和逻辑;感应; 集合,功能和关系;正式语言简介;DFA,NFA和正则表达式;上下文无关文法,图灵机
CS106AX: Programming Methodologies in JavaScript and Python
JavaScript JavaScript Basics JavaScript Graphics Interactive Graphics Timer-Based Animation Advanced Animation Strings Arrays Objects Binary Representation Cryptography Enigma Machine Overview Python Strings Lists Dictionaries Objects Object-Oriented Programming Objects and Abstraction Data-Driven Programs Adventure Web Programming Interactors CSS Asynchronous Requests HTTP Building Web Applications
CS106X: Programming Abstractions in C++
CS106X: Programming Abstractions in C++ Description 使用C ++介绍了基本的编程概念和软件工程技术。解决问题的技能,基本的抽象数据类型和递归。常规主题包括基本编程方法(工程,模块化,
CS109: Introduction to Probability for Computer Scientists
CS109: Introduction to Probability for Computer Scientists Description 计算机科学相关的概率和统计资料。主题包括组合学;概率论 有条件的概率和独立性;概率分布;贝叶斯定理,大数定律和中心极限定理;和
CS140: Operating Systems and Systems Programming
CS140: Operating Systems and Systems Programming Description 操作系统: 死锁的条件与算法,进程的状态,进程和线程的区别,内存换页算法,同步与互斥等,参考书籍:现代操作系统 Prerequisite Principles of Computer Systems Readings https://web.stanford.edu/~ouster/cgi-bin/cs140-spring20/lectures.php Operating Systems:
CS144: Introduction to Computer Networking
CS144: Introduction to Computer Networking Description 计算机网络:OSI模型,三次握手,TCP状态转换图等,参考书籍:计算机网络与UNIX网络编程第一卷 Prerequisites Principles of Computer Systems Readings 《计算机网络:自顶
CS145: Data Management and Data Systems
CS145: Data Management and Data Systems Description SQL语句的基本写法,如Select, groupby having,数据库的范式,数据库的索引(实现是哪种数据结构)和存储过程等,数据库的A
CS161: Design and Analysis of Algorithms
CS161: Design and Analysis of Algorithms 算法复杂度分析(Big Oh,Omega,Theta),递归关系和主方法。包括随机算法,分治策略,贪婪算法,hasing,堆,图算