rstudio / db.rstudio.com Goto Github PK

View Code? Open in Web Editor NEW

14.0 12.0 15.0 16 MB

Website dedicated to all things R and Databases

Home Page: http://db.rstudio.com

License: Creative Commons Attribution Share Alike 4.0 International

HTML 88.00% CSS 11.58% JavaScript 0.41%

r databases dplyr dplyr-sql-backends odbc

db.rstudio.com's Introduction

db.rstudio.com

This repo (and resulting website) is licensed as CC BY-SA.

This is a blogdown site. To make changes, you can add a new .Rmd under /content.

blogdown::serve_site() will automatically re-render any pages needed from .Rmd to html and will let you view a preview.

db.rstudio.com's People

Contributors

Stargazers

Watchers

Forkers

cderv julianflowers colearendt carneybill mungojam blairj09 markderry hdykiel badjouras ricardofandrade edgararuiz ashleyhenry15 isabella232

db.rstudio.com's Issues

Managing credentials section

I think you need a short section on managing credentials in between "run queries safely" and "deploying content". This would basically say that you should never store a password in your R script and never type it into the console.

For now, this section would discuss three options to avoid this:

Use rstudioapi::askForPassword()
Use keyring
Use an environment variable

Add database page - Teradata

Typo in dbplyr github repo on the Redshift page

Under the Availability header the code block for installing dbplyr reads:

devtools::install_github("tidyvers/dbplyr")

It should instead read:

devtools::install_github("tidyverse/dbplyr")

Add MonetDB database

@hannesmuehleisen - I think that we should include MonetDB and MonetDBLite-R to the Databases section. Any suggestions or objections to that? If none, would you have any recommendations for connecting that are different from what in the README in the repo (https://github.com/hannesmuehleisen/MonetDBLite-R)?

Broken link on https://db.rstudio.com/best-practices/dashboards/#full-example

The following links, which can be found on https://db.rstudio.com/best-practices/dashboards/#full-example, are broken:

Possibly update drivers installation instruction

Look into updating the ODBC drivers installation instructions for Linux: rstudio/rstudio#2463

Big Query page update

Update http://db.rstudio.com/databases/big-query/ with info from the updated README in bigrquery: r-dbi/bigrquery#202

Article to expand connectivity options

Expand on the ideas on this thread:

https://community.rstudio.com/t/why-does-rstudio-documentation-recommend-odbc-vs-jdbc-drivers/2381/5

cc @hadley

Add xml indexing for posts

@carneybill

Fix google analytics tracking

Add:

To a new footer-custom.html file inside the layouts/partial folder

Have an article explaining when (or how to decide) to use DBI vs dplyr

This is something that comes up whenever I talk about databases, and it would be good to have "official" guidelines that we can direct people to. Since DBI and dplyr overlap a lot in their database functionality, users (especially newbies) are often at a loss on the topic of which one to choose.

In the olden times, when dplyr only supported SELECT-power statements, the rule was easy. Now, not so much. However, that may still be a good rule of thumb as non-SELECT statements in dplyr have not been time-tested yet.

Whatever the official recommendation is, we should have one, so that the answer to "Which package to use?" is not completely the opinion of whomever happens to be answering it.

cc @hadley

Look into adding the config pattern from dbtest as examples

https://github.com/rstudio/dbtest#yaml-file

PostgreSQL - Add SSL certificate how-to

Best Practices Article - Improving Shiny apps

Put together an article that discusses steps to take during and after a Shiny app that uses a database as it's source is built. Using an example, the plan is to show how to use the following tools to improve the app's performance:

The pool package
Use profvis, and possibly shinyloadtest, after the app is complete
How to use show_query() and explain() to troubleshoot long running queries

The article should also include links to these pages:

@bborgesr
@jcheng5

Add glue_sql to best practices

https://twitter.com/joelgombin/status/928195603947048960

Connection examples for new drivers

It looks like we need to add connection pages, such as this one for MS SQL (https://db.rstudio.com/databases/microsoft-sql-server/), for the new drivers we offer:

Apache Cassandra
Amazon Athena
MongoDB
Google BigQuery
IBM Netezza

Can you add examples as comments to this Issue? I'll be glad to create each page

Add an example of "Invalidate Metadata" for Impala

Add a section to Impala about cleaning up your metadata. If you create a table in Impala and then drop the Hive metadata, you will need to invalidate the Impala metadata.

impala_con <- dbConnect(odbc::odbc(), "Impala")
dbWriteTable(impala_con, "mtcars", mtcars)
hive_con <- dbConnect(odbc::odbc(), "Hive")
dbRemoveTable(hive_con, "mtcars")
dbReadTable(impala_con, "mtcars") # succeeds
dbExistsTable(impala_con, "mtcars") # fails
dbGetQuery(odbcCon, "INVALIDATE METADATA mtcars")
dbExistsTable(impala_con, "mtcars") # succeeds

This happens because dropping the Hive metadata does not drop the Impala metadata. More information can be found here.

Add a "how to connect to a db" page

^^ Should have been self evident :p

Add article about custom functions using rlang

Add rstudioconf videos

Incorporate RViews Dashboard article info

Update list of available RStudio Drivers

https://www.rstudio.com/products/drivers/

Add an article about adding and updating existing records in a database through R

I think it would be really helpful, if there was an article about adding and more importantly updating existing records in a database through R, and the best practices to use. There are many excellent resources on downloading, cleaning and transforming data in R, but data warehousing is a crucial piece missing in the data pipeline.

Add documentation for supported data types

Add a table that lists supported data types for each data source. For example:

Supported Teradata Data Types

R type	Teradata type
factor	VARCHAR(255)
time	VARCHAR(255)
date	VARCHAR(255)
binary	BLOB
integer	INTEGER
double	BINARY_DOUBLE
character	VARCHAR(255)
logical	DECIMAL
list	VARCHAR(255)

Unsupported types will throw an error. See r-dbi/odbc#238.

Fix broken link in the odbc page

MySQL does not support boolean

MySQL does not have a boolean type and uses TINYINT instead. Therefore, if you write logical values they will get returned as integers. Please make a note in the documentation.

Troubleshooting Connections page

Provide some FAQ level answers to common issues users have reported when connecting and using a database. It should consist mostly of links to other articles, such as:

https://support.rstudio.com/hc/en-us/articles/115011860827-System-Requirements-for-ODBC-Connections-and-RStudio-Professional-Drivers

And links to the Known Issues sections of those databases we have documented issues for.

The main idea is to have a very visible "Troubleshooting" link in the page that users can go in. We can update it as users report to use that they couldn't find what they were looking for.

Another idea is to include the link to community.rstudio.com to also cover non-customers who are still having issues.

@jimhester
@nwstephens

RSC Env Vars

Add the Environment Variables feature in RSC as an option in this page: http://db.rstudio.com/best-practices/portable-code/

Ref: https://blog.rstudio.com/2018/04/12/rstudio-connect-1-6-0-a-year-in-the-making/

Avoid .connection_string

And instead use named parameters to dbConnect()

Add drivers page to the RStudio section

https://db.rstudio.com/ does not work

Using https instead of http makes the site render incorrectly

https://db.rstudio.com/

http://db.rstudio.com/

Fix favicon

Consistent Section Headers

Right now we have "Best Practices" as a section header, but no other section headers, so the sidebar feels inconsistent. We should change the structure to something like this:

I added "Examples" and "Packages" as section headers

This should be a relatively simple change to config.toml

Fix the snippets page

The content of this page: http://db.rstudio.com/advanced/snippets/ , needs to be sourced from here: https://rstudio.github.io/rstudio-extensions/rstudio-connections.html

Expand on Pool and Shiny best practices and include custom SQL example

The current pool page doesn't go into much detail about where and when to use different pool functions.

For example:

where/when in your shiny app to create a pool object
where/when to checkout connections from the pool with poolCheckout()
where/when to when to return connections to the pool with poolReturn()
where/when to close the pool using poolClose()

The current page also only has guidance when using dplyr to query, but not using custom SQL.

Addresses: https://community.rstudio.com/t/pool-and-shiny-best-practices-for-getting-custom-queries-from-database/3707/4

Add HANA connection page

Ref tidyverse/dplyr#3343

Database switching

Add page to explain how to switch between databases using dbplyr and DBI/odbc.

It should include how to use our functions and function arguments, such as in_schema() and dbListTables(...schema_name=""), and some DB specific commands like Oracle's SET SCHEMA. Also, cross reference those DB specific commands to their respective page in the site.

cc.
@jimhester
@nwstephens
@hadley

Incorporate materials from ODSC talk

Add call out to Kerberos in the Hive and Impala pages

cc @nwstephens

Broken link on Connections pane page

Looks like there's a broken link on this page - https://db.rstudio.com/rstudio/connections/

the link to "Integrated Security with DSN" goes to - http://127.0.0.1:4321/managing-credentials/#integrated-security-with-dsn
This should instead go to https://db.rstudio.com/best-practices/managing-credentials/#integrated-security-with-dsn

Add database page - MS Access

Switch the the path argument with dbname

In the /dplyr page

cc @krlmlr

Consider mentioning bpc for bulk copying to Azure

https://community.rstudio.com/t/dbi-dbwritetable-is-slow-for-writing-to-azure/19262/21

Typo still present in website

I was looking to correct an error but it already has been corrected here cddf2dc

but it is not yet online.
https://db.rstudio.com/databases/postgresql/

html file has not been rebuilt and deployed

db.rstudio.com/content/databases/postgresql.html

Line 44 in 589937a

PWD = rstudioapi::askForPassword("Database password")

cheers

Remove rstudio conf mention from education

Parameterized queries not working with "?"

See: r-dbi/RPostgres#201

Visualization

Add reference to dbplot in the best practice visualization article
Add dbplot to the packages page

Update/refine Big Query page

r-dbi/bigrquery#202

Link to Drivers section leads to "Page Not found" on the Redshift and PostgreSQL pages

Under the "Connection Settings" header, then the "Driver" bullet.

In "See the Drivers section for setup information", the word Driver is a hyperlink to https://db.rstudio.com/redshift/drivers which loads a "Page Not found" page.

It is not entirely clear to me if the expected behavior is to lead to the anchor tag above https://db.rstudio.com/redshift/#driver-options or if there was supposed to be a page in the location it links to.

The PostgreSQL has identical behavior, except the link that isn't found is https://db.rstudio.com/postgresql/drivers and I do not seem to see an anchor tag on the page that would make sense for it to jump to.

Change references from RPostgres to RPostgreSQL

Most references are pointed to RPostgreSQL package, need to make sure all are pointed to that package instead of RPostgres