rstudio / db.rstudio.com Goto Github PK

View Code? Open in Web Editor NEW

14.0 12.0 15.0 16 MB

Website dedicated to all things R and Databases

Home Page: http://db.rstudio.com

License: Creative Commons Attribution Share Alike 4.0 International

HTML 88.00% CSS 11.58% JavaScript 0.41%

r databases dplyr dplyr-sql-backends odbc

db.rstudio.com's Issues

Best Practices Article - Improving Shiny apps

Put together an article that discusses steps to take during and after a Shiny app that uses a database as it's source is built. Using an example, the plan is to show how to use the following tools to improve the app's performance:

The pool package
Use profvis, and possibly shinyloadtest, after the app is complete
How to use show_query() and explain() to troubleshoot long running queries

The article should also include links to these pages:

@bborgesr
@jcheng5

Fix broken link in the odbc page

https://db.rstudio.com/ does not work

Using https instead of http makes the site render incorrectly

https://db.rstudio.com/

http://db.rstudio.com/

Add article about custom functions using rlang

Link to Drivers section leads to "Page Not found" on the Redshift and PostgreSQL pages

Under the "Connection Settings" header, then the "Driver" bullet.

In "See the Drivers section for setup information", the word Driver is a hyperlink to https://db.rstudio.com/redshift/drivers which loads a "Page Not found" page.

It is not entirely clear to me if the expected behavior is to lead to the anchor tag above https://db.rstudio.com/redshift/#driver-options or if there was supposed to be a page in the location it links to.

The PostgreSQL has identical behavior, except the link that isn't found is https://db.rstudio.com/postgresql/drivers and I do not seem to see an anchor tag on the page that would make sense for it to jump to.

Have an article explaining when (or how to decide) to use DBI vs dplyr

This is something that comes up whenever I talk about databases, and it would be good to have "official" guidelines that we can direct people to. Since DBI and dplyr overlap a lot in their database functionality, users (especially newbies) are often at a loss on the topic of which one to choose.

In the olden times, when dplyr only supported SELECT-power statements, the rule was easy. Now, not so much. However, that may still be a good rule of thumb as non-SELECT statements in dplyr have not been time-tested yet.

Whatever the official recommendation is, we should have one, so that the answer to "Which package to use?" is not completely the opinion of whomever happens to be answering it.

cc @hadley

Change references from RPostgres to RPostgreSQL

Most references are pointed to RPostgreSQL package, need to make sure all are pointed to that package instead of RPostgres

Incorporate RViews Dashboard article info

Fix the snippets page

The content of this page: http://db.rstudio.com/advanced/snippets/ , needs to be sourced from here: https://rstudio.github.io/rstudio-extensions/rstudio-connections.html

Add HANA connection page

Ref tidyverse/dplyr#3343

Add database page - Teradata

Add an example of "Invalidate Metadata" for Impala

Add a section to Impala about cleaning up your metadata. If you create a table in Impala and then drop the Hive metadata, you will need to invalidate the Impala metadata.

impala_con <- dbConnect(odbc::odbc(), "Impala")
dbWriteTable(impala_con, "mtcars", mtcars)
hive_con <- dbConnect(odbc::odbc(), "Hive")
dbRemoveTable(hive_con, "mtcars")
dbReadTable(impala_con, "mtcars") # succeeds
dbExistsTable(impala_con, "mtcars") # fails
dbGetQuery(odbcCon, "INVALIDATE METADATA mtcars")
dbExistsTable(impala_con, "mtcars") # succeeds

This happens because dropping the Hive metadata does not drop the Impala metadata. More information can be found here.

Possibly update drivers installation instruction

Look into updating the ODBC drivers installation instructions for Linux: rstudio/rstudio#2463

Add database page - MS Access

Add documentation for supported data types

Add a table that lists supported data types for each data source. For example:

Supported Teradata Data Types

R type	Teradata type
factor	VARCHAR(255)
time	VARCHAR(255)
date	VARCHAR(255)
binary	BLOB
integer	INTEGER
double	BINARY_DOUBLE
character	VARCHAR(255)
logical	DECIMAL
list	VARCHAR(255)

Unsupported types will throw an error. See r-dbi/odbc#238.

Look into adding the config pattern from dbtest as examples

https://github.com/rstudio/dbtest#yaml-file

Typo in dbplyr github repo on the Redshift page

Under the Availability header the code block for installing dbplyr reads:

devtools::install_github("tidyvers/dbplyr")

It should instead read:

devtools::install_github("tidyverse/dbplyr")

Big Query page update

Update http://db.rstudio.com/databases/big-query/ with info from the updated README in bigrquery: r-dbi/bigrquery#202

RSC Env Vars

Add the Environment Variables feature in RSC as an option in this page: http://db.rstudio.com/best-practices/portable-code/

Ref: https://blog.rstudio.com/2018/04/12/rstudio-connect-1-6-0-a-year-in-the-making/

Troubleshooting Connections page

Provide some FAQ level answers to common issues users have reported when connecting and using a database. It should consist mostly of links to other articles, such as:

https://support.rstudio.com/hc/en-us/articles/115011860827-System-Requirements-for-ODBC-Connections-and-RStudio-Professional-Drivers

And links to the Known Issues sections of those databases we have documented issues for.

The main idea is to have a very visible "Troubleshooting" link in the page that users can go in. We can update it as users report to use that they couldn't find what they were looking for.

Another idea is to include the link to community.rstudio.com to also cover non-customers who are still having issues.

@jimhester
@nwstephens

Update list of available RStudio Drivers

https://www.rstudio.com/products/drivers/

Add a "how to connect to a db" page

^^ Should have been self evident :p

Avoid .connection_string

And instead use named parameters to dbConnect()

Article to expand connectivity options

Expand on the ideas on this thread:

https://community.rstudio.com/t/why-does-rstudio-documentation-recommend-odbc-vs-jdbc-drivers/2381/5

cc @hadley

Add an article about adding and updating existing records in a database through R

I think it would be really helpful, if there was an article about adding and more importantly updating existing records in a database through R, and the best practices to use. There are many excellent resources on downloading, cleaning and transforming data in R, but data warehousing is a crucial piece missing in the data pipeline.

PostgreSQL - Add SSL certificate how-to

Update/refine Big Query page

r-dbi/bigrquery#202

Consistent Section Headers

Right now we have "Best Practices" as a section header, but no other section headers, so the sidebar feels inconsistent. We should change the structure to something like this:

I added "Examples" and "Packages" as section headers

This should be a relatively simple change to config.toml

Parameterized queries not working with "?"

See: r-dbi/RPostgres#201

Expand on Pool and Shiny best practices and include custom SQL example

The current pool page doesn't go into much detail about where and when to use different pool functions.

For example:

where/when in your shiny app to create a pool object
where/when to checkout connections from the pool with poolCheckout()
where/when to when to return connections to the pool with poolReturn()
where/when to close the pool using poolClose()

The current page also only has guidance when using dplyr to query, but not using custom SQL.

Addresses: https://community.rstudio.com/t/pool-and-shiny-best-practices-for-getting-custom-queries-from-database/3707/4

Managing credentials section

I think you need a short section on managing credentials in between "run queries safely" and "deploying content". This would basically say that you should never store a password in your R script and never type it into the console.

For now, this section would discuss three options to avoid this:

Use rstudioapi::askForPassword()
Use keyring
Use an environment variable

Fix favicon

Add MonetDB database

@hannesmuehleisen - I think that we should include MonetDB and MonetDBLite-R to the Databases section. Any suggestions or objections to that? If none, would you have any recommendations for connecting that are different from what in the README in the repo (https://github.com/hannesmuehleisen/MonetDBLite-R)?

Typo still present in website

I was looking to correct an error but it already has been corrected here cddf2dc

but it is not yet online.
https://db.rstudio.com/databases/postgresql/

html file has not been rebuilt and deployed

db.rstudio.com/content/databases/postgresql.html

Line 44 in 589937a

PWD = rstudioapi::askForPassword("Database password")

cheers

Broken link on https://db.rstudio.com/best-practices/dashboards/#full-example

The following links, which can be found on https://db.rstudio.com/best-practices/dashboards/#full-example, are broken:

Connection examples for new drivers

It looks like we need to add connection pages, such as this one for MS SQL (https://db.rstudio.com/databases/microsoft-sql-server/), for the new drivers we offer:

Apache Cassandra
Amazon Athena
MongoDB
Google BigQuery
IBM Netezza

Can you add examples as comments to this Issue? I'll be glad to create each page

Fix google analytics tracking

Add:

To a new footer-custom.html file inside the layouts/partial folder

Consider mentioning bpc for bulk copying to Azure

https://community.rstudio.com/t/dbi-dbwritetable-is-slow-for-writing-to-azure/19262/21

Add call out to Kerberos in the Hive and Impala pages

cc @nwstephens

Incorporate materials from ODSC talk

Remove rstudio conf mention from education

MySQL does not support boolean

MySQL does not have a boolean type and uses TINYINT instead. Therefore, if you write logical values they will get returned as integers. Please make a note in the documentation.

Database switching

Add page to explain how to switch between databases using dbplyr and DBI/odbc.

It should include how to use our functions and function arguments, such as in_schema() and dbListTables(...schema_name=""), and some DB specific commands like Oracle's SET SCHEMA. Also, cross reference those DB specific commands to their respective page in the site.

cc.
@jimhester
@nwstephens
@hadley

the link to "Integrated Security with DSN" goes to - http://127.0.0.1:4321/managing-credentials/#integrated-security-with-dsn
This should instead go to https://db.rstudio.com/best-practices/managing-credentials/#integrated-security-with-dsn

Visualization

Add reference to dbplot in the best practice visualization article
Add dbplot to the packages page

rstudio / db.rstudio.com Goto Github PK

db.rstudio.com's Issues

Recommend Projects

Recommend Topics

Recommend Org