Git Product home page Git Product logo

pysastrawi's People

Contributors

gsarwohadi avatar har07 avatar khrlimam avatar prasastoadi avatar sanspa avatar widnyana avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

pysastrawi's Issues

Stopword Removal Problem

kalimat = 'Saya dan Kamu adalah Manusia' stop = stopword.remove(kalimat) print(stop)
Print = Saya Kamu Manusia

kalimat = 'saya dan kamu adalah Manusia' stop = stopword.remove(kalimat) print(stop)

Print = dan kamu Manusia

Ini memang seperti ini ya? atau cuma saya yang salah tanggap?

stemmer error

kata relawan jadi rawan, yg seharusnya tetap jadi kata relawan
dalam kasus ini, saya mencoba untuk menstem banyak text dari csv ke csv utk mengetahui perbedaannya seberapa banyak

with open('asdasd.csv','r', encoding='utf8') as f:
    thereader = csv.reader(f)
    with open('asdasd_stem.csv','w', newline='', encoding='utf8') as w:
        thewriter = csv.writer(w)
        for row in thereader:
            thewriter.writerow([row[0],row[1],stemmer.stem(row[1]),row[2]])

screenshot :
Image
Image2

Metode searching

Untuk metode pencarian di kamusnya pada fungsi yang mana ya? karena saya berencana melakukan improve dibagian tersebut. Terimakasih.

Menambahkan stopword

saya ingin menanyakan, apakah ada dokumentasi penambahan list stopword. jika belum ada bagaimana cara menambahkan kata ke list stopword.

Peningkatan kecepatan

Untuk melakukan stemming 87000 paragraf di i7, membutuhkan waktu ~ 24 jam.
Kecepatannya sekitar 500-1000 ms per paragraf.

Sepertinya ada potensi untuk optimisasi, tapi saya sendiri belum tahu. :D

stemmer nama kota

stemmer nama kota bekasi --> hasilnya menjadi bekas
run: print(stemmer.stem("bekasi barat"))
out: bekas barat

Stemming

Stemmer "pengering", jadi "ering", yang seharusnya "kering"

AttributeError saat membuat fungsi

Kapan: Saat stemmer factory dicoba digunakan dalam fungsi
Output yang diharapkan: fungsi stemming semestinya
Output yang didapatkan: AttributeError: 'list' object has no attribute 'lower'
Kode yang ditulis:

factory = StemmerFactory()
stemmer = factory.create_stemmer()

def stemm(text):
  stemmed = stemmer.stem(text) <-- Error terjadi di sini
  return stemmed

news['stems'] = news['clean_sw'].apply(lambda x: stemm(x))
news.head()

Info tambahan; Sastrawi di-install dengan cara
pip install PySastrawi
dalam platform google colab
Mohon bantuanya

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.